Databricks Community

weldermartins · ‎02-03-2023

I have Jupyter Notebook installed on my machine working normally. I tested running a Spark application by running the spark-submit command and it returns the message that the file was not found. What do you need to do to make it work?

Below is a file with a simple example.

from pyspark.sql import SparkSession
from pyspark.sql.functions import *
 
if __name__ == "__main__":
  spark = SparkSession.builder.appName("Exemplo").getOrCreate()
 
 arqschema = "id INT, nome STRING,status STRING, cidade STRING,vendas INT,data STRING"
 
 despachantes = spark.read.csv("C:\test-spark\despachantes.csv",header=False, schema=arqschema)
 
  calculo = despachantes.select("date").groupBy(year("date")).count()
 
  calculo.write.format("console").save()
 
 
 
  spark.stop()

weldermartins · ‎02-06-2023

I managed to resolve. It was java and python incompatibility with the Spark version I was using.

I will create a video, explaining how to use Spark without Jupyter notebook.

View solution in original post

Debayan · ‎02-05-2023

Hi, yet this is not tested in my lab, but could you please check and confirm if this works: https://stackoverflow.com/questions/37861469/how-to-submit-spark-application-on-cmd

weldermartins · ‎02-06-2023

I managed to resolve. It was java and python incompatibility with the Spark version I was using.

I will create a video, explaining how to use Spark without Jupyter notebook.

Databricks Community

How to make spark-submit work on windows?

Join Us as a Local Community Builder!

Solution Accelerator Series | #5 - Automating Product Review Summarization with LLMs

The next BrickTalks about the latest and greatest in AI/BI is scheduled for Oct 28!

🚀 Weekly Delta (8 - 14 October): A Look Back at This Week’s Top Community Highlights

BrickCon 2025 — Dec 3–5 | A Community Conference for Databricks Builders

🌟 Community Sparks of the Week | September 26 – October 2 🌟