<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: SparkFiles - strange behavior on Azure databricks (runtime 10) in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33736#M24674</link>
    <description>&lt;P&gt;ok, pity it can't be solved around here. But the ticket was opened, I give news of the progress.&lt;/P&gt;&lt;P&gt;thanks.&lt;/P&gt;</description>
    <pubDate>Tue, 14 Dec 2021 22:15:33 GMT</pubDate>
    <dc:creator>weldermartins</dc:creator>
    <dc:date>2021-12-14T22:15:33Z</dc:date>
    <item>
      <title>SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33725#M24663</link>
      <description>&lt;P&gt;When you use:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;from pyspark import SparkFiles
spark.sparkContext.addFile(url)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;it adds file to NON dbfs /local_disk0/ but then when you want to read file:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;spark.read.json(SparkFiles.get("file_name"))&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;it wants to read it from /dbfs/local_disk0/. I tried also with file:// and many other creative ways and it doesn't work.&lt;/P&gt;&lt;P&gt;Of course it is working after using %sh cp - moving from /local_disk0/ to /dbfs/local_disk0/ .&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It seems to be a bug like addFile was switched to dbfs on azure databricks but SparkFiles not (in original spark it addFile and gets to/from workers).&lt;/P&gt;&lt;P&gt;I couldn't find also any settings to manually specify RootDirectory for SparkFiles.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 06 Dec 2021 17:29:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33725#M24663</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2021-12-06T17:29:41Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33726#M24664</link>
      <description>&lt;P&gt;Hello. &lt;/P&gt;&lt;P&gt;I'm in the same situation. Data extraction via API using sparkfiles in Community Databricks runs without error, however in Azure it generates the mentioned error.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Dec 2021 13:46:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33726#M24664</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2021-12-13T13:46:55Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33727#M24665</link>
      <description>&lt;P&gt;In Azure it generates the mentioned error too&lt;/P&gt;</description>
      <pubDate>Mon, 13 Dec 2021 19:50:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33727#M24665</guid>
      <dc:creator>jorgeff</dc:creator>
      <dc:date>2021-12-13T19:50:21Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33728#M24666</link>
      <description>&lt;P&gt;@Kaniz Fatma​&amp;nbsp;@Piper Wilson​&amp;nbsp;can you help to escalate that issue, as more people are complaining about that&lt;/P&gt;</description>
      <pubDate>Mon, 13 Dec 2021 19:53:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33728#M24666</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2021-12-13T19:53:36Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33729#M24667</link>
      <description>&lt;P&gt;Hello everyone &lt;/P&gt;&lt;P&gt;This problem to be happening with me too, in Azure. If somebody to can help us&lt;/P&gt;</description>
      <pubDate>Mon, 13 Dec 2021 20:07:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33729#M24667</guid>
      <dc:creator>Marcos_Gois</dc:creator>
      <dc:date>2021-12-13T20:07:00Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33730#M24668</link>
      <description>&lt;P&gt;@Hubert Dudek​&amp;nbsp;- You got it!&lt;/P&gt;</description>
      <pubDate>Mon, 13 Dec 2021 21:43:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33730#M24668</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-12-13T21:43:29Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33731#M24669</link>
      <description>&lt;P&gt;Hi, I'm new here and I have some doubts. Will the bug fix be attended to only if there are votes, comments and views?&lt;/P&gt;</description>
      <pubDate>Tue, 14 Dec 2021 11:59:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33731#M24669</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2021-12-14T11:59:57Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33732#M24670</link>
      <description>&lt;P&gt;someone should get back to us&lt;/P&gt;</description>
      <pubDate>Tue, 14 Dec 2021 12:52:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33732#M24670</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2021-12-14T12:52:54Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33733#M24671</link>
      <description>&lt;P&gt;&lt;A href="https://community.databricks.com/s/profile/0053f000000tvpbAAA" alt="https://community.databricks.com/s/profile/0053f000000tvpbAAA" target="_blank"&gt;@Kaniz Fatma&lt;/A&gt;&amp;nbsp;(Databricks)&amp;nbsp;&lt;A href="https://community.databricks.com/s/profile/0053f000000WG7QAAW" alt="https://community.databricks.com/s/profile/0053f000000WG7QAAW" target="_blank"&gt;@Piper&lt;/A&gt;&amp;nbsp;(Customer)&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hi how are you?&lt;/P&gt;&lt;P&gt;Does this problem have a solution option?&lt;/P&gt;</description>
      <pubDate>Tue, 14 Dec 2021 19:34:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33733#M24671</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2021-12-14T19:34:52Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33734#M24672</link>
      <description>&lt;P&gt;@Prabakar Ammeappin​&amp;nbsp;@Werner Stinckens​ @Jose Gonzalez​&amp;nbsp;&amp;nbsp;maybe you could look as well to that issue &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 14 Dec 2021 20:05:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33734#M24672</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2021-12-14T20:05:12Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33735#M24673</link>
      <description>&lt;P&gt;@Hubert Dudek​, @Dev John​, @Marcos Gois​, @Jorge Fernandes​, and @welder martins​&amp;nbsp;- Are you able to open a support ticket here - &lt;A href="https://help.databricks.com/s/contact-us" target="test_blank"&gt;https://help.databricks.com/s/contact-us&lt;/A&gt;?&lt;/P&gt;</description>
      <pubDate>Tue, 14 Dec 2021 20:59:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33735#M24673</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-12-14T20:59:01Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33736#M24674</link>
      <description>&lt;P&gt;ok, pity it can't be solved around here. But the ticket was opened, I give news of the progress.&lt;/P&gt;&lt;P&gt;thanks.&lt;/P&gt;</description>
      <pubDate>Tue, 14 Dec 2021 22:15:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33736#M24674</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2021-12-14T22:15:33Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33737#M24675</link>
      <description>&lt;P&gt;@welder martins​&amp;nbsp;- Thank you for opening the ticket. We want to cover all our bases. &lt;/P&gt;</description>
      <pubDate>Wed, 15 Dec 2021 18:54:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33737#M24675</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-12-15T18:54:07Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33739#M24677</link>
      <description>&lt;P&gt;Yes this solution was already discussed on stackoverflow. Problem is that this spark functionality should be adjusted in DBR to handle everything automatically via dbfs. Problem is that it seems that it was partly adjusted but not fully.&lt;/P&gt;</description>
      <pubDate>Tue, 28 Dec 2021 10:53:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33739#M24677</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2021-12-28T10:53:34Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33741#M24679</link>
      <description>&lt;P&gt;Hello everyone, any news?&lt;/P&gt;&lt;P&gt;Thanks.&lt;/P&gt;&lt;P&gt;@Kaniz Fatma​&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jan 2022 11:49:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33741#M24679</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2022-01-24T11:49:11Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33743#M24681</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma​,&amp;nbsp;Ticket Number: #00125834.&lt;/P&gt;&lt;P&gt;It's been over a month since the ticket was opened, but still no response.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I tested it now with version 3.2.0 of Apache Spark on the Azure platform, it continues the same way with the message: "File not found". But in community.cloud.databricks the path is found and returns the expected result.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jan 2022 12:42:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33743#M24681</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2022-01-24T12:42:49Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33745#M24683</link>
      <description>&lt;PRE&gt;&lt;CODE&gt;municipios = "https://servicodados.ibge.gov.br/api/v1/localidades/municipios"
from pyspark import SparkFiles
spark.sparkContext.addFile(municipios)
&amp;nbsp;
municipiosDF = spark.read.option("multiLine", True).option("mode", "OVERRIDE").json("file://"+SparkFiles.get("municipios"))&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;I did not understand.&lt;/P&gt;&lt;P&gt;Please change the code above as instructed by you. @Kaniz Fatma​&amp;nbsp;&lt;/P&gt;&lt;P&gt;att,&lt;/P&gt;&lt;P&gt;Welder Martins&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jan 2022 13:14:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33745#M24683</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2022-01-24T13:14:38Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33748#M24686</link>
      <description>&lt;P&gt;Hi @Kaniz Fatma (Databricks), it ran without errors. The problem is that SparkFiles doesn't work on the Azure platform. I'm extracting data from the API with other functionality. I'm even using the URLLIB function palliatively. RDD will be deprecated as of Apache Spark version 3.0.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thak's.&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jan 2022 16:55:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33748#M24686</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2022-01-24T16:55:24Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33749#M24687</link>
      <description>&lt;P&gt;@Kaniz Fatma​&amp;nbsp; hi, do you have access to orders that were opened in Databricks? The Ticket was opened in December 2021 and so far they have not commented on the deadline. Thanks.&lt;/P&gt;</description>
      <pubDate>Tue, 25 Jan 2022 11:39:51 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33749#M24687</guid>
      <dc:creator>weldermartins</dc:creator>
      <dc:date>2022-01-25T11:39:51Z</dc:date>
    </item>
    <item>
      <title>Re: SparkFiles - strange behavior on Azure databricks (runtime 10)</title>
      <link>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33751#M24689</link>
      <description>&lt;P&gt;@Hubert Dudek​&amp;nbsp;&lt;/P&gt;&lt;P&gt;Have to tried with file:/// ?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I remember starting Spark 3.2, it honors the native hadoop file system if no file access protocol is defined.&lt;/P&gt;</description>
      <pubDate>Mon, 14 Mar 2022 15:20:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/sparkfiles-strange-behavior-on-azure-databricks-runtime-10/m-p/33751#M24689</guid>
      <dc:creator>User16764241763</dc:creator>
      <dc:date>2022-03-14T15:20:53Z</dc:date>
    </item>
  </channel>
</rss>

