Databricks Community

cmilligan · ‎01-18-2023

I have a query that I'm trying to insert overwrite into a table. In an effort to try and speed up the query I added a range join hint. After adding it I started getting the error below.I can get around this though by creating a temporary view of the same query and caching that temporary view. It will then allow me to insert overwrite into the table.

Ajay-Pandey · ‎01-18-2023

Could you please check for the datatype of your source and target table.

There might be mismatch between both.

Ajay Kumar Pandey

cmilligan · ‎01-19-2023

Both are the same

subhadeep · ‎01-21-2025

hi , can u help me in this

I am using this query to create a csv in a volume named test_volsrr that i created

INSERT OVERWRITE DIRECTORY '/Volumes/DATAMAX_DATABRICKS/staging/test_volsrr'

USING CSV OPTIONS ( 'delimiter' = ',' , 'header' = 'true' )

SELECT * FROM staging.extract1gb

DISTRIBUTE BY COALESCE( 1 );

i added DISTRIBUTE BY COALESCE(1) so that a single csv gets generated instead of multiple csvs , the size of extract1gb table is 1gb but the csv which is getting created is around 230gb , due to this it is taking more than an hour to execute . Can some pls explain this issue and a solution to generate the csv of optimal size so that execution becomes faster . I dont want to use pyspark .

jose_gonzalez · ‎01-30-2023

Could you share your code and the full error stack trace please? Check the driver logs for the full stack trace.

Databricks Community

Undescriptive error when trying to insert overwrite into a table

Join Us as a Local Community Builder!

Big Book of Data Engineering - Get how-tos, code snippets and real-world examples

Level Up with Databricks Specialist Sessions

🌟 Community Pulse: Your Weekly Roundup! November 07 – 13, 2025

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐