<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How do I download and unzip datasets from Kaggle into DBFS? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7254#M3173</link>
    <description>&lt;P&gt;Hi, You can refer to &lt;A href="https://docs.databricks.com/files/unzip-files.html" alt="https://docs.databricks.com/files/unzip-files.html" target="_blank"&gt;https://docs.databricks.com/files/unzip-files.html&lt;/A&gt;. You can curl the file you want and then it can be unzipped as mentioned in the doc. &lt;/P&gt;&lt;P&gt;Please let us know if this helps.&lt;/P&gt;&lt;P&gt;Also, please tag @Debayan with your next update which will notify me. &lt;/P&gt;</description>
    <pubDate>Thu, 23 Mar 2023 05:43:00 GMT</pubDate>
    <dc:creator>Debayan</dc:creator>
    <dc:date>2023-03-23T05:43:00Z</dc:date>
    <item>
      <title>How do I download and unzip datasets from Kaggle into DBFS?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7250#M3169</link>
      <description />
      <pubDate>Wed, 22 Mar 2023 14:30:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7250#M3169</guid>
      <dc:creator>StephanieAlba</dc:creator>
      <dc:date>2023-03-22T14:30:08Z</dc:date>
    </item>
    <item>
      <title>Re: How do I download and unzip datasets from Kaggle into DBFS?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7251#M3170</link>
      <description>&lt;P&gt;As shown &lt;A href="https://stackoverflow.com/a/75813525/1290485" alt="https://stackoverflow.com/a/75813525/1290485" target="_blank"&gt;here&lt;/A&gt; on StackOverflow&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;import opendatasets as od
&amp;nbsp;
od.download("https://www.kaggle.com/competitions/tlvmc-parkinsons-freezing-gait-prediction/data","/dbfs/FileStore/mypath/")&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;The output, when running this, shows first the zip being downloaded. Once the download is complete, it automatically extracts or unzips the files.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;Extracting archive /dbfs/FileStore/mypath/tlvmc-parkinsons-freezing-gait-prediction/tlvmc-parkinsons-freezing-gait-prediction.zip to /dbfs/FileStore/mypath/tlvmc-parkinsons-freezing-gait-prediction
&amp;nbsp;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 22 Mar 2023 14:32:26 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7251#M3170</guid>
      <dc:creator>StephanieAlba</dc:creator>
      <dc:date>2023-03-22T14:32:26Z</dc:date>
    </item>
    <item>
      <title>Re: How do I download and unzip datasets from Kaggle into DBFS?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7252#M3171</link>
      <description>&lt;P&gt;Hi @Stephanie Rivera​. In the Databricks notebook, you can handle this with either Python, Scala or bash. &lt;/P&gt;&lt;P&gt; I have not tried it but below should work as this command works in native shell. &lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;%sh curl some_url --output myfile.zip
&amp;nbsp;
%sh unzip myfile.zip -d "some directory"&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 22 Mar 2023 15:22:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7252#M3171</guid>
      <dc:creator>pvignesh92</dc:creator>
      <dc:date>2023-03-22T15:22:30Z</dc:date>
    </item>
    <item>
      <title>Re: How do I download and unzip datasets from Kaggle into DBFS?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7253#M3172</link>
      <description>&lt;P&gt;@Stephanie Rivera​&amp;nbsp;please download u r kaggle file and unzip, if it is less than 100mb, you can follow below steps to directly to upload &lt;A href="https://docs.databricks.com/ingestion/add-data/upload-data.html" target="test_blank"&gt;https://docs.databricks.com/ingestion/add-data/upload-data.html&lt;/A&gt; else please take below example and replace u r zip url and target &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;%sh curl &lt;A href="https://resources.lendingclub.com/LoanStats3a.csv.zip" target="test_blank"&gt;https://resources.lendingclub.com/LoanStats3a.csv.zip&lt;/A&gt; --output /tmp/LoanStats3a.csv.zip&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;unzip /tmp/LoanStats3a.csv.zip&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 22 Mar 2023 16:14:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7253#M3172</guid>
      <dc:creator>karthik_p</dc:creator>
      <dc:date>2023-03-22T16:14:10Z</dc:date>
    </item>
    <item>
      <title>Re: How do I download and unzip datasets from Kaggle into DBFS?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7254#M3173</link>
      <description>&lt;P&gt;Hi, You can refer to &lt;A href="https://docs.databricks.com/files/unzip-files.html" alt="https://docs.databricks.com/files/unzip-files.html" target="_blank"&gt;https://docs.databricks.com/files/unzip-files.html&lt;/A&gt;. You can curl the file you want and then it can be unzipped as mentioned in the doc. &lt;/P&gt;&lt;P&gt;Please let us know if this helps.&lt;/P&gt;&lt;P&gt;Also, please tag @Debayan with your next update which will notify me. &lt;/P&gt;</description>
      <pubDate>Thu, 23 Mar 2023 05:43:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-do-i-download-and-unzip-datasets-from-kaggle-into-dbfs/m-p/7254#M3173</guid>
      <dc:creator>Debayan</dc:creator>
      <dc:date>2023-03-23T05:43:00Z</dc:date>
    </item>
  </channel>
</rss>

