Spark SQL output multiple small files
We are having multiple joins involving a large table (about 500gb in size). The output of the joins is stored into multiple small files each of size 800kb-1.5mb. Because of this the job is split into multiple tasks and taking a long time to complete....
- 2119 Views
- 2 replies
- 0 kudos
Latest Reply
Hi @Arun Balaji​ , Could you please provide the error message you are receiving?
- 0 kudos