Hello, I've searched around for awhile and didn't find a similar question here or elsewhere, so thought I'd ask...
I'm assessing the storage/access efficiency of Struct type columns in delta tables. I want to know more about how Databricks is storing Struct type field. Can an SME add some details?
Example question I'm looking at: Suppose I add an int field with low cardinality to a Struct column... in columnar database this would be stored/accessed efficiently, I believe... so would it also be stored/accessed efficiently as a field in a Struct column?
Note: I did find a Databricks page describing (maybe) how Apache Arrow is used in Databricks runtime 14+ (link below), but it referenced use in UDFs... I am using Structs in vanilla delta tables and figured that was significantly different.
- https://www.databricks.com/blog/arrow-optimized-python-udfs-apache-sparktm-35#:~:text=In%20Apache%20...