Expand description
Prefilter framework for parquet reader.
Prefilter optimization reduces I/O by reading only a subset of columns first (the prefilter phase), applying filters to compute a refined row selection, then reading the remaining columns with the refined selection.
Structsยง
- Bulk
Filter ๐Plan - How the bulk-memtable read should apply each predicate.
- Cached
Primary ๐KeyFilter - Prefilter
Context ๐ - Context for prefiltering a row group.
- Prefilter
Context ๐Builder - Pre-built state for constructing PrefilterContext per row group.
- Prefilter
Result ๐ - Result of prefiltering a row group.
- Reader
Filter ๐Plan - How the parquet reader should apply each predicate.
Constantsยง
Functionsยง
- apply_
filters_ ๐to_ batch - build_
bulk_ ๐filter_ plan - build_
reader_ ๐filter_ plan - Splits a query [
Predicate] into aReaderFilterPlan: predicates that can run during the prefilter pass (on a reduced projection, to compute a refined row selection) versus predicates that must run on the normal read path (alongside the full projection). - compute_
projection_ ๐mask - Executes prefiltering on a row group.
- execute_
prefilter ๐ - matching_
row_ ๐ranges_ by_ primary_ key - prefilter_
flat_ ๐batch_ by_ primary_ key - Filters a flat-format record batch by primary key, returning only rows whose
primary key matches the filter. Returns
Noneif all rows are filtered out. - should_
use_ ๐prefilter