-
Notifications
You must be signed in to change notification settings - Fork 124
Inclusion of tsp flag leads to incorrect DVT output results #1368
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi Kash, Row validation is only possible when rows are unique. When rows are not unique, the source/target comparison will result in apparently incorrect output. In a separate conversation you mentioned that the underlying issue was that the -tsp flag indiscriminately truncates both char and varchar fields. If the flag only truncated char fields, things would work for your customer. Can you elaborate the specific use case(s) ? Does the customer have a composite primary key where one field is a char and another is varchar ? Thanks. Sundar Mudupalli |
Hi, I have created a new branch
BQ table:
The following commands ran successfully.
Please test this version with the customer and let me know how this works out. Sundar Mudupalli |
Overview: the -tsp flag was implemented as an optional flag due to a bug in teradatasql that adds extra whitespaces to CHAR data types, resulting in incorrect DVT results.
Example: ‘ABC ’ CHAR(10) in TD and “ABC‘ in BQ will become identical when DVT executes with -tsp
We understand this raises a risk of duplicate primary keys in the instance where a table contains two different CHAR or VARCHAR PKs, which only differ by trailing spaces.
Observation: -tsp flag applied to CHAR/VARCHAR data results in incorrect DVT output results, even after accounting for the expected duplicate rows. We see identical hashes flagged as "fail" and different hashes flagged as "success"
The text was updated successfully, but these errors were encountered: