How can I change the parquet compression algorithm from gzip to something else?
Spark, by default, uses gzip to store parquet files. I would like to change the compression algorithm from gzip to snappy or lz4.
- 18179 Views
- 9 replies
- 1 kudos
- 1 kudos
spark.sql("set spark.sql.parquet.compression.codec=gzip");
- 1 kudos