weldermartins
Honored Contributor

Hi @Kaniz Fatma​, Ticket Number: #00125834.

It's been over a month since the ticket was opened, but still no response.

I tested it now with version 3.2.0 of Apache Spark on the Azure platform, it continues the same way with the message: "File not found". But in community.cloud.databricks the path is found and returns the expected result.

weldermartins
Honored Contributor
municipios = "https://servicodados.ibge.gov.br/api/v1/localidades/municipios"
from pyspark import SparkFiles
spark.sparkContext.addFile(municipios)
 
municipiosDF = spark.read.option("multiLine", True).option("mode", "OVERRIDE").json("file://"+SparkFiles.get("municipios"))

I did not understand.

Please change the code above as instructed by you. @Kaniz Fatma​ 

att,

Welder Martins

weldermartins
Honored Contributor

Hi @Kaniz Fatma (Databricks), it ran without errors. The problem is that SparkFiles doesn't work on the Azure platform. I'm extracting data from the API with other functionality. I'm even using the URLLIB function palliatively. RDD will be deprecated as of Apache Spark version 3.0.

Thak's.

weldermartins
Honored Contributor

@Kaniz Fatma​  hi, do you have access to orders that were opened in Databricks? The Ticket was opened in December 2021 and so far they have not commented on the deadline. Thanks.

User16764241763
Databricks Employee
Databricks Employee

@Hubert Dudek​ 

Have to tried with file:/// ?

I remember starting Spark 3.2, it honors the native hadoop file system if no file access protocol is defined.

View solution in original post

Hi it was few months ago. I need to check it again with new DR.


My blog: https://databrickster.medium.com/

Hubert-Dudek
Databricks MVP

I confirm that as @Arvind Ravish​ said adding file:/// is solving the problem.

image.png


My blog: https://databrickster.medium.com/

Hey,

But will this allocated address change? it would have to work according to the community. But thanks for the feedback.

polished syntax a bit:image


My blog: https://databrickster.medium.com/