Expand description
Prefilter framework for parquet reader.
Prefilter optimization reduces I/O by reading only a subset of columns first (the prefilter phase), applying filters to compute a refined row selection, then reading the remaining columns with the refined selection.
Structsยง
- Bulk
Filter ๐Plan - How the bulk-memtable read should apply each predicate.
- Cached
Primary ๐KeyFilter - Prefilter
Context ๐ - Context for prefiltering a row group.
- Prefilter
Context ๐Builder - Pre-built state for constructing PrefilterContext per row group.
- Prefilter
Entry ๐ - Prefilter
Result ๐ - Result of prefiltering a row group.
- Reader
Filter ๐Plan - How the parquet reader should apply each predicate.
Enumsยง
- Prefilter
Entry ๐Kind
Constantsยง
Functionsยง
- all_
prefilter_ ๐entries - build_
bulk_ ๐filter_ plan - build_
prefilter_ ๐cache_ entries - build_
prefilter_ ๐masks - build_
reader_ ๐filter_ plan - Splits a query [
Predicate] into aReaderFilterPlan: predicates that can run during the prefilter pass (on a reduced projection, to compute a refined row selection) versus predicates that must run on the normal read path (alongside the full projection). - compute_
projection_ ๐count - compute_
projection_ ๐mask - Executes prefiltering on a row group.
- eval_
entry_ ๐mask - eval_
physical_ ๐filter_ mask - eval_
pk_ ๐group_ mask - eval_
simple_ ๐filter_ mask - execute_
prefilter ๐ - execute_
prefilter_ ๐by_ reading_ columns - execute_
prefilter_ ๐with_ result_ cache - matching_
row_ ๐ranges_ by_ primary_ key - non_
cacheable_ ๐physical_ filters - prefilter_
column_ ๐names_ for_ entries - prefilter_
flat_ ๐batch_ by_ primary_ key - Filters a flat-format record batch by primary key, returning only rows whose
primary key matches the filter. Returns
Noneif all rows are filtered out. - projection_
indices ๐ - refined_
selection_ ๐from_ mask - rows_
before_ ๐filter - should_
use_ ๐prefilter