Description
Feature Request / Improvement
I am working on extending Apache Iceberg supported data types with a new complex type: 'ProtctedType'.
This new data type internally is a StructType including a header and a payload.
The Header to include at minimum:
- Encryption Provider ID
- Encryption Key ID
- Data Type
The payload to include the encrypted data as BinaryType.
The goal is to allow end user transparent interaction with the new type, allowing operations between encrypted data items, and clear text.
Further more, allow extension of puffin files to store aggregate data based on the clear text values, bloom filters, and optionally inverted index for gerex search without a full table scan.
Looking for guidance on how such data type can be introduced and what are the dependencies I would need to address with the various readers and writers.