Apache Arrow

Apache Arrow is a cross-language, columnar in-memory data format and toolkit for efficient analytics and data interchange.

Hop leverages Arrow for high-performance streaming between pipelines and external tools, particularly Python.

Key features used in Hop:

  • Arrow IPC (streaming and file formats)

  • Arrow Flight (RPC-based streaming)

See the official documentation: