Module org.apache.arrow.format
Package org.apache.arrow.flatbuf
package org.apache.arrow.flatbuf
-
ClassDescriptionOpaque binary dataLogically the same as Binary, but the internal representation uses a view struct that contains the string length and either the string's entire data inline (for small strings) or an inlined prefix, an index of another buffer, and an offset pointing to a slice in that buffer (for non-small strings).Optional compression for the memory buffers constituting IPC message bodies.Provided for forward compatibility in case we need to support different strategies for compressing the IPC message body (like whole-body compression rather than buffer-level) in the future---------------------------------------------------------------------- A Buffer represents a single contiguous memory segmentDate is either a 32-bit or 64-bit signed integer type representing an elapsed time since UNIX epoch (1970-01-01), stored in either of two units: * Milliseconds (64 bits) indicating UNIX time elapsed since the epoch (no leap seconds), where the values are evenly divisible by 86400000 * Days (32 bits) since the UNIX epochExact decimal value represented as an integer value in two's complement.For sending dictionary encoding information.---------------------------------------------------------------------- Dictionary encoding metadata Maintained for forwards compatibility, in the future Dictionaries might be explicit maps between integers and values allowing for non-contiguous index values---------------------------------------------------------------------- Endianness of the platform producing the dataRepresents Arrow Features that might not have full support within implementations.---------------------------------------------------------------------- A field represents a named column in a record / row batch or child of a nested type.---------------------------------------------------------------------- Data structures for describing a table row batch (a collection of equal-length Arrow arrays) Metadata about a field at some level of a nested type tree (but not its children).---------------------------------------------------------------------- Arrow File metadata---------------------------------------------------------------------- user defined key value pairs to add custom metadata to arrow key namespacing is the responsibility of the userSame as Binary, but with 64-bit offsets, allowing to represent extremely large data values.Same as List, but with 64-bit offsets, allowing to represent extremely large data values.Same as ListView, but with 64-bit offsets and sizes, allowing to represent extremely large data values.Same as Utf8, but with 64-bit offsets, allowing to represent extremely large data values.Represents the same logical types that List can, but contains offsets and sizes allowing for writes in any order and sharing of child values among list values.A Map is a logical nested type that is represented as List<entries: Struct<key: K, value: V>> In this layout, the keys and values are each respectively contiguous.---------------------------------------------------------------------- The root Message type This union enables us to easily send different message types without redundant storage, and in the future we can easily add new message types.These are stored in the flatbuffer in the Type union belowA data header describing the shared memory layout of a "record" or "row" batch.Contains two child arrays, run_ends and values.---------------------------------------------------------------------- A Schema describes the columns in a row batchCompressed Sparse format, that is matrix-specific.---------------------------------------------------------------------- EXPERIMENTAL: Data structures for sparse tensors Coordinate (COO) format of sparse tensor index.Compressed Sparse Fiber (CSF) sparse tensor index.A Struct_ in the flatbuffer metadata is the same as an Arrow Struct (according to the physical memory layout).---------------------------------------------------------------------- Data structures for dense tensors Shape data for a single axis in a tensorTime is either a 32-bit or 64-bit signed integer type representing an elapsed time since midnight, stored in either of four units: seconds, milliseconds, microseconds or nanoseconds.Timestamp is a 64-bit signed integer representing an elapsed time since a fixed epoch, stored in either of four units: seconds, milliseconds, microseconds or nanoseconds, and is optionally annotated with a timezone.---------------------------------------------------------------------- Top-level Type value, enabling extensible type-specific metadata.A union is a complex type with children in Field By default ids in the type vector refer to the offsets in the children optionally typeIds provides an indirection between the child offset and the type id for each child `typeIds[offset]` is the id used in the type vectorUnicode with UTF-8 encodingLogically the same as Utf8, but the internal representation uses a view struct that contains the string length and either the string's entire data inline (for small strings) or an inlined prefix, an index of another buffer, and an offset pointing to a slice in that buffer (for non-small strings).