Acero - Streaming Execution Engine#
Warning
Acero is experimental and a stable API is not yet guaranteed.
Acero is a streaming query engine, which allows the computation to be expressed
as an “execution plan” (constructed using the Declaration
interface).
This enables to create a computation composed of pyarrow.compute
functions
and to execute this efficiently in a batched manner.
|
Helper class for declaring the nodes of an ExecPlan. |
Base class for the node options. |
|
|
A Source node which accepts a table. |
|
A Source node which yields batches from a Dataset scan. |
|
Make a node which excludes some rows from batches passed through it. |
|
Make a node which executes expressions on input batches, producing batches of the same length with new columns. |
|
Make a node which aggregates input batches, optionally grouped by keys. |
|
Make a node which applies a new ordering to the data. |
|
Make a node which implements join operation using hash join strategy. |
See also
- Substrait
Alternative way to run Acero from a standardized Substrait plan.