Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
Hi @Oscar Dyremyhr , PIVOT doesn't support two FOR clauses. You can PIVOT either on month or on quarter.https://docs.databricks.com/sql/language-manual/sql-ref-syntax-qry-select-pivot.html
I'm using PySpark on Databricks and trying to pivot a 27753444 X 3 matrix.
If I do it in Spark DataFrame:
df = df.groupBy("A").pivot("B").avg("C")
it takes forever (after 2 hours and I canceled it).
If I convert it to pandas dataframe and then pivo...
Hi @Raymond_Hu,This means that the driver crashed because of an OOM (Out of memory) exception and after that, it's not able to establish a new connection with the driver. Please try below optionsTry increasing driver-side memory and then retry.You ca...