Expand description
Utilities for scanners.
Structsยง
- Partition
Metrics - Metrics while reading a partition.
- Partition
Metrics ๐Inner - Partition
Metrics ๐List - List of PartitionMetrics.
- Scan
Metrics ๐Set - Verbose scan metrics for a partition.
- Series
Distributor ๐Metrics - Metrics for the series distributor.
- Split
Record ๐Batch Stream - A stream wrapper that splits record batches from an inner stream.
Constantsยง
- BATCH_
SIZE_ ๐THRESHOLD - Minimum batch size after splitting. The batch size is less than 60 because a series may only have 60 samples per hour.
- NUM_
SERIES_ ๐THRESHOLD - Number of series threshold for splitting batches.
- SPLIT_
ROW_ ๐THRESHOLD - Files with row count greater than this threshold can contribute to the estimation.
Functionsยง
- build_
file_ range_ scan_ stream - Build the stream of scanning the input
FileRanges. - build_
flat_ file_ range_ scan_ stream - Build the stream of scanning the input
FileRanges using flat reader that returns RecordBatch. - can_
split_ ๐series - maybe_
scan_ ๐flat_ other_ ranges - maybe_
scan_ ๐other_ ranges - scan_
file_ ๐ranges - Scans file ranges at
index. - scan_
flat_ ๐file_ ranges - Scans file ranges at
indexusing flat reader that returns RecordBatch. - scan_
flat_ ๐mem_ ranges - Scans memtable ranges at
indexusing flat format that returns RecordBatch. - scan_
mem_ ๐ranges - Scans memtable ranges at
index. - should_
split_ ๐flat_ batches_ for_ merge - Returns true if splitting flat record batches may improve merge performance.
- split_
record_ ๐batch - Splits the batch by timestamps.