We are trying to create a DELTA table (CTAS statement) from 2 TB PARQUET file and its taking huge amount of time around 12~ hrs.
is it normal.? What are option to tune/optimize this ? are we doing anything wrong
Cluster : Interactive/30 Cores / 320 GB Memory / 4 workers