Topics with Label: Spark Driver Crash

by oriole • New Contributor III

03-19-2023 12:35:30 PM

9048 Views
5 replies
2 kudos

Resolved! Spark Driver Crash Writing Large Text

I'm working with a large text variable, working it into single line JSON where Spark can process beautifully. Using a single node 256 GB 32 core Standard_E32d_v4 "cluster", which should be plenty memory for this dataset (haven't seen cluster memory u...

Data Engineering

9048 Views
5 replies
2 kudos

03-19-2023 12:35:30 PM

View Replies

Latest Reply

pvignesh92
Honored Contributor

03-20-2023 8:46:30 AM

2 kudos

@David Toft Hi, The current implementation of dbutils.fs is single-threaded, performs the initial listing on the driver and subsequently launches a Spark job to perform the per-file operations. So I guess the put operation is running on a single cor...

2 kudos

03-20-2023 8:46:30 AM

4 More Replies

Databricks Community

Forum Posts

Resolved! Spark Driver Crash Writing Large Text