Skip to main content

Module part

Module part 

Source
Expand description

Bulk part encoder/decoder.

Structsยง

BatchPruningStats ๐Ÿ”’
Adapter implementing PruningStatistics for BatchStats.
BatchStats ๐Ÿ”’
Per-batch min/max statistics for the first tag column in a MultiBulkPart.
BulkPart
A raw bulk part in the memtable.
BulkPartConverter
Converter that converts structs into BulkPart.
BulkPartEncodeMetrics
Metrics for encoding a part.
BulkPartEncoder
BulkPartMeta
EncodedBulkPart
MultiBulkPart
A collection of ordered RecordBatches representing a bulk part without parquet encoding.
UnorderedPart
A collection of small unordered bulk parts. Used to batch small parts together before merging them into a sorted part.

Enumsยง

PrimaryKeyColumnBuilder ๐Ÿ”’
Primary key column builder for handling strings specially.

Constantsยง

INIT_DICT_VALUE_CAPACITY ๐Ÿ”’

Functionsยง

align_parts ๐Ÿ”’
Align the JSON columns in BulkParts, to unified Arrow arrays. So that we can compute (concat, sort, etc.) on them.
align_schema_with_json_array ๐Ÿ”’
convert_bulk_part
Converts a BulkPart that is unordered and without encoded primary keys into a BulkPart with the same format as produced by BulkPartConverter.
new_primary_key_column_builders ๐Ÿ”’
predicate_references_column ๐Ÿ”’
Returns true if the predicate references the given column name.
record_batch_estimated_size
More accurate estimation of the size of a record batch.
should_prune_bulk_part ๐Ÿ”’
Returns true if the batch should be pruned (skipped) based on the first-tag min/max statistics and the predicate in the context. Returns false if no pruning is possible (no primary key, no predicate, or the batch matches the predicate).
sort_primary_key_record_batch
Sorts the record batch with primary key format.