Table Compare transform Icon Table Compare

Description

The Table Compare transform compares the data from two tables (provided they have the same lay-out), finds differences between the data in the two tables and logs it.

Supported Engines

Hop Engine

Supported

Spark

Maybe Supported

Flink

Maybe Supported

Dataflow

Maybe Supported

Options

Reference/Comparison data tab

Option Description

Reference connection / Compare connection

Database connections from which the reference/compare table data will come.

Reference schema field / Compare schema field

contain the schema names for the reference/compare table.

Reference table field / Compare table field

contain the actual table names. This means that you could compare two tables with a different name, as long as they have the same column names.

Other fields tab

Option Description

Key fields field

should contain a comma separated list of they fields that make up the 'primary' key of the table(s) you are comparing. The primary key is needed because without this information the two tables cannot be correctly joined.

Exclude fields field

contains a comma separated list of columns that you want to exclude from the comparison. E.g. because they exist in the first table, but not in the second.

Additional fields tab

Option Description

Number of errors field

allows you to specify the name of the output column that will contain the total number of errors found for the comparison of your tables.

Number of reference/compare table records field

allows you to specify the name of the field that will contain the actual number of records found in each table.

Number of left/inner/right join errors field

allows you to specify the name of the field(s) that will contain the number of errors found for each join type.

Error handling key description input field

allows you specify the name of the output field for the 'where clause" of the record that gave an error.

Error handling reference/compare value input field

allows you to specify the output field names for the actual values that differ.