<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Exporting data from databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29924#M21615</link>
    <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I get "file is too large" in some of the cells. In the others, there is no "download" option showing up at the bottom of the cells. (I have a json file)&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 09 Sep 2020 13:34:32 GMT</pubDate>
    <dc:creator>KR87</dc:creator>
    <dc:date>2020-09-09T13:34:32Z</dc:date>
    <item>
      <title>Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29908#M21599</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I couldn't find in documentation a way to export an RDD as a text file to a local folder by using python. Is it possible?&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Feb 2016 15:45:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29908#M21599</guid>
      <dc:creator>DimitrisMpizos</dc:creator>
      <dc:date>2016-02-08T15:45:52Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29909#M21600</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Sounds like you're looking for saveAsTextFile().&lt;/P&gt;
&lt;P&gt;Refer to the documentation here:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://spark.apache.org/docs/latest/api/python/pyspark.html#pyspark.RDD.saveAsTextFile" target="test_blank"&gt;https://spark.apache.org/docs/latest/api/python/pyspark.html#pyspark.RDD.saveAsTextFile&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Feb 2016 19:20:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29909#M21600</guid>
      <dc:creator>raela</dc:creator>
      <dc:date>2016-02-08T19:20:41Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29910#M21601</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;You can use the FileStore to save a data file to that location and retrieve it by going to your shard and retrieve it there. Look at the Databricks Guide -&amp;gt; Product Overview -&amp;gt; FileStore. &lt;/P&gt;
&lt;P&gt;You can access the files saved there by going to: &lt;/P&gt;
&lt;P&gt;&lt;A href="https://xxxxx.cloud.databricks.com/files/folder/specific_file" target="test_blank"&gt;https://xxxxx.cloud.databricks.com/files/folder/specific_file&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 08 Feb 2016 22:33:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29910#M21601</guid>
      <dc:creator>miklos</dc:creator>
      <dc:date>2016-02-08T22:33:18Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29911#M21602</link>
      <description>&lt;P&gt;Hello all, hello ​@Miklos_C​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;How do you access the files? I don't understand what is the "xxxx" in the URL "&lt;A href="https://xxxxx.cloud.databricks.com/files/folder/specific_file" alt="https://xxxxx.cloud.databricks.com/files/folder/specific_file" target="_blank"&gt;https://xxxxx.cloud.databricks.com/files/folder/specific_file&lt;/A&gt;". Do we need to replace it with the 16-numbers sequence that identifies us? With our username/mail adress? I'm using Databricks Community Edition, do I need to put the string "community" somewhere?&lt;/P&gt;&lt;P&gt;Thank you for your answer&lt;/P&gt;&lt;P&gt; Have a nice day&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2016 14:23:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29911#M21602</guid>
      <dc:creator>lefish</dc:creator>
      <dc:date>2016-03-31T14:23:00Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29912#M21603</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Do you know what is xxxxx in the URL?, I have the same problem.&lt;/P&gt;
&lt;P&gt;Regards&lt;/P&gt;
&lt;P&gt;Diego &lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Oct 2016 12:28:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29912#M21603</guid>
      <dc:creator>gachet</dc:creator>
      <dc:date>2016-10-13T12:28:47Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29913#M21604</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Simply replace it with "community"&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;Thibault 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 13 Oct 2016 13:45:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29913#M21604</guid>
      <dc:creator>lefish</dc:creator>
      <dc:date>2016-10-13T13:45:44Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29914#M21605</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Thanks a lot. It works !!&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 19 Oct 2016 20:22:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29914#M21605</guid>
      <dc:creator>gachet</dc:creator>
      <dc:date>2016-10-19T20:22:52Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29915#M21606</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;do you know how to acces the filestore with wget?&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 16:08:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29915#M21606</guid>
      <dc:creator>gachet</dc:creator>
      <dc:date>2016-10-20T16:08:28Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29916#M21607</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Absolutely not... &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 16:35:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29916#M21607</guid>
      <dc:creator>lefish</dc:creator>
      <dc:date>2016-10-20T16:35:43Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29917#M21608</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;This won't work because you'd have to authenticate with Databricks in order to download it. This is suitable for doing things like loading javascript libraries but not for extracting data from Databricks. In order to download data you should connect to amazon S3 or use the DBFS api.&lt;/P&gt;
&lt;P&gt;&lt;A href="http://docs.databricks.com/spark/latest/data-sources/amazon-s3.html" target="test_blank"&gt;http://docs.databricks.com/spark/latest/data-sources/amazon-s3.html&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="http://docs.databricks.com/api/latest/dbfs.html" target="test_blank"&gt;http://docs.databricks.com/api/latest/dbfs.html&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 20 Oct 2016 18:25:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29917#M21608</guid>
      <dc:creator>Bill_Chambers</dc:creator>
      <dc:date>2016-10-20T18:25:02Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29918#M21609</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Thank you for the answer.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 24 Oct 2016 19:58:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29918#M21609</guid>
      <dc:creator>gachet</dc:creator>
      <dc:date>2016-10-24T19:58:23Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29919#M21610</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;PRE&gt;&lt;CODE&gt;I use s3 as an intermediary. rdd.saveAsTextFile("s3a://...")&lt;/CODE&gt;&lt;/PRE&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 31 May 2017 18:51:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29919#M21610</guid>
      <dc:creator>grfiv</dc:creator>
      <dc:date>2017-05-31T18:51:10Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29920#M21611</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;What is the code to save a data object to the file store? I have a list object with JSON elements that I want to save to local disk, but am unable to do so.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 09 Jun 2017 15:08:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29920#M21611</guid>
      <dc:creator>nassir_m</dc:creator>
      <dc:date>2017-06-09T15:08:27Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29921#M21612</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;there should really be a way to simply export a file to your desktop&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 04 Feb 2018 16:51:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29921#M21612</guid>
      <dc:creator>PaulLintilhac</dc:creator>
      <dc:date>2018-02-04T16:51:38Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29922#M21613</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;To: &lt;B&gt;Export a file to local desktop&lt;/B&gt;&lt;/P&gt;
&lt;P&gt;Workaround : &lt;B&gt;Basically you have to do a "Create a table in notebook" with DBFS &lt;/B&gt;&lt;/P&gt;
&lt;P&gt;The steps are: &lt;/P&gt;
&lt;OL&gt;&lt;LI&gt;Click on "Data" icon &amp;gt; &lt;/LI&gt;&lt;LI&gt;Click "Add Data" button &amp;gt; &lt;/LI&gt;&lt;LI&gt;Click "DBFS" button &amp;gt; &lt;/LI&gt;&lt;LI&gt;Click "FileStore" folder icon in 1st pane "Select a file from DBFS" &amp;gt; &lt;/LI&gt;&lt;LI&gt;In the 2nd pane, scroll down to locate folder icon "tables" &amp;gt; &lt;/LI&gt;&lt;LI&gt;Click to select folder icon "tables" &amp;gt; &lt;/LI&gt;&lt;LI&gt;In the 3rd pane, Click to select folder icon "FileStore" &amp;gt; &lt;/LI&gt;&lt;LI&gt;In the 4th pane, Click to select folder icon "tables" &amp;gt; &lt;/LI&gt;&lt;LI&gt;In the 5th pance locate your file, (mine was a .csv), click to select &amp;gt; &lt;/LI&gt;&lt;LI&gt;Click "Create table in Notebook" button &amp;gt; &lt;/LI&gt;&lt;LI&gt;This will create a notebook &amp;gt; &lt;/LI&gt;&lt;LI&gt;Click on "Run All" to run the notebook (create clustes, if needed) &amp;gt; &lt;/LI&gt;&lt;LI&gt;Click "OK" when prompted to "Attach and Run" &amp;gt;&lt;/LI&gt;&lt;LI&gt;There would be download ICON in a couple of places which you can click to download to the local machine.&lt;/LI&gt;&lt;/OL&gt;
&lt;P&gt;&lt;I&gt;(Tweak this notebook to download different files)&lt;/I&gt;&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 25 Mar 2019 15:18:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29922#M21613</guid>
      <dc:creator>Manu1</dc:creator>
      <dc:date>2019-03-25T15:18:04Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29923#M21614</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Any suggestion for automating this process without human interaction? Where can I get directories for a local machine or network drive?!&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 18 Mar 2020 18:28:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29923#M21614</guid>
      <dc:creator>KR87</dc:creator>
      <dc:date>2020-03-18T18:28:13Z</dc:date>
    </item>
    <item>
      <title>Re: Exporting data from databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29924#M21615</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I get "file is too large" in some of the cells. In the others, there is no "download" option showing up at the bottom of the cells. (I have a json file)&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Sep 2020 13:34:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/exporting-data-from-databricks/m-p/29924#M21615</guid>
      <dc:creator>KR87</dc:creator>
      <dc:date>2020-09-09T13:34:32Z</dc:date>
    </item>
  </channel>
</rss>

