Merge Join transform Icon Merge Join

Description

The Merge Join transform performs a classic merge join between data sets with data coming from two different input transforms.

This transform assumes your data is sorted on the join keys. Use Sort Rows transforms on the incoming streams to enforce sorting if necessary.

Join options include INNER, LEFT OUTER, RIGHT OUTER, and FULL OUTER.

Supported Engines

Hop Engine

Supported

Spark

Supported

Flink

Supported

Dataflow

Supported

Options

Option Description

First Transform

The first transform to read data from (left hand side of the join)

Second Transform

The second transform to read data from (right hand side of the join)

Join type

The join type that should be used; INNER, LEFT OUTER, RIGHT OUTER, and FULL OUTER

Key Field

The fields used for the join key, this only supports equal joins (key first transform = key second transform)