Expand description
Sorted strings tables.
Modulesยง
- file
- Structures to describe metadata of files.
- file_
purger - file_
ref - index
- location
- parquet
- SST in parquet format.
- version ๐
- SST version.
Structsยง
- Flat
Schema Options - Options of flat schema.
- Series
Estimator ๐ - Gets the estimated number of series from record batches.
Enumsยง
- Format
Type - Format type of the SST file.
Constantsยง
- DEFAULT_
WRITE_ BUFFER_ SIZE - Default write buffer size, it should be greater than the default minimum upload part of S3 (5mb).
- DEFAULT_
WRITE_ CONCURRENCY - Default number of concurrent write, it only works on object store backend(e.g., S3).
- INTERNAL_
PARQUET_ ๐FIELD_ ID_ BASE - Parquet field ID base for internal columns (__primary_key, __sequence, __op_type). Uses bit 30 to distinguish from user column IDs and fit in positive i32 range.
- OP_
TYPE_ ๐PARQUET_ FIELD_ ID - Parquet field ID for the __op_type column.
- PARQUET_
FIELD_ ID_ KEY - Iceberg-compatible column field ID key stored in Parquet column metadata.
- PRIMARY_
KEY_ ๐PARQUET_ FIELD_ ID - Parquet field ID for the __primary_key column.
- SEQUENCE_
PARQUET_ ๐FIELD_ ID - Parquet field ID for the __sequence column.
Functionsยง
- concretize_
json_ ๐type - flat_
sst_ arrow_ schema_ column_ num - Returns the number of columns in the flat format.
- internal_
fields ๐ - Fields for internal columns.
- override_
pk_ ๐field_ to_ binary - Returns a copy of
schemawith the__primary_keyfield replaced by a plainBinaryfield. - tag_
maybe_ ๐to_ dictionary_ field - Helper function to create a dictionary field from a field if it is a string column.
- to_
dictionary_ ๐field - Helper function to create a dictionary field from a field.
- to_
flat_ sst_ arrow_ schema - Gets the arrow schema to store in parquet.
- to_
sst_ arrow_ schema - Gets the arrow schema to store in parquet.
- with_
field_ id - Adds
PARQUET:field_idmetadata to an Arrow field.