Table Output

Description

The Table Output transform inserts data into a relational database table.

Supported Engines

Hop Engine

Supported

Single Threaded

Supported

Native Spark

Supported

Beam Spark

Supported

Beam Flink

Supported

Beam Dataflow

Supported

Table Output is equivalent to the DML operator INSERT.

This transform provides configuration options for target table and a lot of housekeeping and/or performance-related options such as Commit Size and Use batch update for inserts.

If you have a database table that has identity columns and you are inserting a record, as part of the insert, the JDBC driver will typically return the auto-generated key it used when performing the insert.

Options

Option Description

Option	Description
Transform name	Name of the transform, this name has to be unique in a single pipeline.
Connection	The database connection to which data is written
Target Schema	The name of the Schema for the table to write data to. This is important for data sources that allow for table names with periods in them.
Target table	The name of the table to which data is written.
Commit size	Use transactions to insert rows in the database table. Commit the connection every N rows if N is larger than zero (0); otherwise, don’t use transactions. (Slower)
Truncate table	Select if you want the table to be truncated before the first row is inserted into the table
Truncate on first row	Select if you want the table to be truncated when the transform receives it’s first row
Ignore insert errors	Makes Hop ignore all insert errors such as violated primary keys. A maximum of 20 warnings will be logged however. This option is not available for batch inserts.
Specify database fields	Enable this option to specify the fields in the Database fields tab. Otherwise all fields are taken into account by default.
Automatically update table structure	Automatically manages table structure based on the incoming data stream. When enabled, the table is created if it doesn’t exist, and additional options become available for column management. This option is incompatible with "Specify database fields".
Always drop and recreate table	Drops and recreates the table on every execution. Only available when "Automatically update table structure" is enabled.
Add columns	Adds columns to the table that exist in the incoming stream but not in the table. Only available when "Automatically update table structure" is enabled.
Drop non-existing columns	Drops columns from the table that don’t exist in the incoming stream. Warning: This will result in data loss for the dropped columns. Only available when "Automatically update table structure" is enabled.
Change column data types	Changes column data types to match the incoming stream. Warning: Columns are dropped and recreated, resulting in data loss. Only available when "Automatically update table structure" is enabled.
Partition data over tables	Use to split the data over multiple tables. For example instead of inserting all data into table SALES, put the data into tables SALES_200510, SALES_200511, SALES_200512, … Use this on systems that don’t have partitioned tables and/or don’t allow inserts into UNION ALL views or the master of inherited tables. The view SALES allows you to report on the complete sales: `CREATE OR REPLACE VIEW SALES AS SELECT * FROM SALES_200501 UNION ALL SELECT * FROM SALES_200502 UNION ALL SELECT * FROM SALES_200503 UNION ALL SELECT * FROM SALES_200504 ...`
Use batch update for inserts	Enable if you want to use batch inserts. This feature groups inserts statements to limit round trips to the database. This is the fastest option and is enabled by default.
Is the name of the table defined in a field?	Use these options to split the data over one or more tables; the name of the target table is defined in the field you specify. For example if you store customer data in the field gender, the data might end up in tables M and F (Male and Female). There is an option to exclude the field containing the tablename from being inserted into the tables.
Field that contains name of table	When the option "Is the name of the table defined in a field?" is enabled, enter the field name to use here.
Store the table name field	When the option "Is the name of the table defined in a field?" is enabled, you can chose is this field should we written to the table or not.
Return auto-generated key	Enable if you want to get back the key that was generated by inserting a row into the table
Name of auto-generated key field	Specifies the name of the new field in the output rows that contains the auto-generated key
SQL	Generates the SQL to create the output table automatically

Transform name

Name of the transform, this name has to be unique in a single pipeline.

Connection

The database connection to which data is written

Target Schema

The name of the Schema for the table to write data to. This is important for data sources that allow for table names with periods in them.

Target table

The name of the table to which data is written.

Commit size

Use transactions to insert rows in the database table. Commit the connection every N rows if N is larger than zero (0); otherwise, don’t use transactions. (Slower)

Truncate table

Select if you want the table to be truncated before the first row is inserted into the table

Truncate on first row

Select if you want the table to be truncated when the transform receives it’s first row

Ignore insert errors

Makes Hop ignore all insert errors such as violated primary keys. A maximum of 20 warnings will be logged however. This option is not available for batch inserts.

Specify database fields

Enable this option to specify the fields in the Database fields tab. Otherwise all fields are taken into account by default.

Automatically update table structure

Automatically manages table structure based on the incoming data stream. When enabled, the table is created if it doesn’t exist, and additional options become available for column management. This option is incompatible with "Specify database fields".

Always drop and recreate table

Drops and recreates the table on every execution. Only available when "Automatically update table structure" is enabled.

Add columns

Adds columns to the table that exist in the incoming stream but not in the table. Only available when "Automatically update table structure" is enabled.

Drop non-existing columns

Drops columns from the table that don’t exist in the incoming stream. Warning: This will result in data loss for the dropped columns. Only available when "Automatically update table structure" is enabled.

Change column data types

Changes column data types to match the incoming stream. Warning: Columns are dropped and recreated, resulting in data loss. Only available when "Automatically update table structure" is enabled.

Partition data over tables

Use to split the data over multiple tables. For example instead of inserting all data into table SALES, put the data into tables SALES_200510, SALES_200511, SALES_200512, … Use this on systems that don’t have partitioned tables and/or don’t allow inserts into UNION ALL views or the master of inherited tables. The view SALES allows you to report on the complete sales:

CREATE OR REPLACE VIEW SALES AS
SELECT * FROM SALES_200501
UNION ALL
SELECT * FROM SALES_200502
UNION ALL
SELECT * FROM SALES_200503
UNION ALL
SELECT * FROM SALES_200504
...

Use batch update for inserts

Enable if you want to use batch inserts. This feature groups inserts statements to limit round trips to the database. This is the fastest option and is enabled by default.

Is the name of the table defined in a field?

Use these options to split the data over one or more tables; the name of the target table is defined in the field you specify. For example if you store customer data in the field gender, the data might end up in tables M and F (Male and Female). There is an option to exclude the field containing the tablename from being inserted into the tables.

Field that contains name of table

When the option "Is the name of the table defined in a field?" is enabled, enter the field name to use here.

Store the table name field

When the option "Is the name of the table defined in a field?" is enabled, you can chose is this field should we written to the table or not.

Return auto-generated key

Enable if you want to get back the key that was generated by inserting a row into the table

Name of auto-generated key field

Specifies the name of the new field in the output rows that contains the auto-generated key

SQL

Generates the SQL to create the output table automatically