Skip to main content

Read Parquet

Synopsis

This operator reads Apache Parquet files.

Description

Apache Parquet is a column-oriented data storage format of the Apache Hadoop ecosystem. It enables efficient data storage as well as processing. While the parquet format, does allow for more complex structures like lists, maps and others, this operator does not support these as of now. That means, repetition count should not be higher than 1, only the first data item will be considered.

Input

file

A Parquet file can be optionally passed in as a file object. This can be created with Operators having file output ports such as theOpen FileOperator.

Output

output

This port delivers a data table created from the Parquet file provided at the input port or loaded from the path given to thefileparameter.

Parameters

Parquet file

拼花指定文件的路径。它can also be selected using the 'Choose a file' button.