Data Engineering

Forum Posts

Sorted by:

by weldermartins • Honored Contributor

02-03-2023 1:04:07 PM

7851 Views
2 replies
1 kudos

Resolved! How to make spark-submit work on windows?

I have Jupyter Notebook installed on my machine working normally. I tested running a Spark application by running the spark-submit command and it returns the message that the file was not found. What do you need to do to make it work?Below is a file ...

Data Engineering

7851 Views
2 replies
1 kudos

02-03-2023 1:04:07 PM

View Replies

Latest Reply

Debayan
Databricks Employee

02-05-2023 10:20:21 AM

1 kudos

Hi, yet this is not tested in my lab, but could you please check and confirm if this works: https://stackoverflow.com/questions/37861469/how-to-submit-spark-application-on-cmd

1 kudos

02-05-2023 10:20:21 AM

1 More Replies

by r-g-s-j • New Contributor

08-19-2022 1:37:22 PM

4398 Views
1 replies
0 kudos

How to Configure PySpark Jobs Using PEX

IssueI am attempting to create a PySpark job via the Databricks UI (with spark-submit) using the parameters below (dependencies are on the PEX file), but I am getting the an exception that the pex file does not exist. It's my understanding that the -...

Data Engineering

4398 Views
1 replies
0 kudos

08-19-2022 1:37:22 PM

View Replies

Latest Reply

franck
New Contributor II

10-28-2022 10:27:36 AM

0 kudos

Hi,I'm facing the same issue trying to execute a pyspark job with spark-submit.I have explored the same solution as you : --files optionspark.pyspark.driver.pythonspark.executorEnv.PEX_ROOTDo you make some progress in the resolution of the problem ?

0 kudos

10-28-2022 10:27:36 AM

by NandhaKumar • New Contributor II

11-14-2019 2:34:57 AM

7395 Views
3 replies
0 kudos

How to specify multiple files in --py-files in spark-submit command for databricks job? All the files to be specified in --py-files present in dbfs: .

I have created a databricks in azure. I have created a cluster for python 3. I am creating a job using spark-submit parameters. How to specify multiple files in --py-files in spark-submit command for databricks job? All the files to be specified in ...

Data Engineering

7395 Views
3 replies
0 kudos

11-14-2019 2:34:57 AM

View Replies

Latest Reply

shyam_9
Databricks Employee

11-17-2019 9:46:20 PM

0 kudos

Hi @Nandha Kumar,please go through the below docs to pass python files as job,https://docs.databricks.com/dev-tools/api/latest/jobs.html#sparkpythontask

0 kudos

11-17-2019 9:46:20 PM

2 More Replies

by dbansal • New Contributor

10-14-2019 12:29:00 PM

16126 Views
1 replies
0 kudos

How can I add jars ("spark.jars") to pyspark notebook?

I want to add a few custom jars to the spark conf. Typically they would be submitted along with the spark-submit command but in Databricks notebook, the spark session is already initialized. So, I want to set the jars in "spark.jars" property in the...

Data Engineering

16126 Views
1 replies
0 kudos

10-14-2019 12:29:00 PM

View Replies

Latest Reply

shyam_9
Databricks Employee

10-14-2019 11:05:00 PM

0 kudos

Hi @dbansal, Install the libraries/jars while initialising the cluster.Please go through the documentation on the same below,https://docs.databricks.com/libraries.html#upload-a-jar-python-egg-or-python-wheel

0 kudos

10-14-2019 11:05:00 PM

Databricks Community

Resolved! How to make spark-submit work on windows?

How to Configure PySpark Jobs Using PEX

How to specify multiple files in --py-files in spark-submit command for databricks job? All the files to be specified in --py-files present in dbfs: .

How can I add jars ("spark.jars") to pyspark notebook?