<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Local Development on Databricks in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33915#M24816</link>
    <description>&lt;P&gt;I use it for local development of our libraries.  Works fine, but I did not yet use it to submit to clusters.&lt;/P&gt;</description>
    <pubDate>Tue, 23 Aug 2022 07:32:04 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2022-08-23T07:32:04Z</dc:date>
    <item>
      <title>Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33914#M24815</link>
      <description>&lt;P&gt;&lt;B&gt;How to Develop Locally on Databricks with your Favorite IDE&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://dbx.readthedocs.io/en/latest/" alt="https://dbx.readthedocs.io/en/latest/" target="_blank"&gt;&lt;U&gt;dbx&lt;/U&gt;&lt;/A&gt; is a Databricks Labs project that allows you to develop code locally and then submit against Databricks interactive and job compute clusters from your favorite local IDE (&lt;A href="https://docs.databricks.com/dev-tools/dbx.html?_ga=2.111066757.1834991441.1661144247-1468227001.1645719051" alt="https://docs.databricks.com/dev-tools/dbx.html?_ga=2.111066757.1834991441.1661144247-1468227001.1645719051" target="_blank"&gt;&lt;U&gt;AWS&lt;/U&gt;&lt;/A&gt; | &lt;A href="https://docs.microsoft.com/en-us/azure/databricks/dev-tools/dbx" alt="https://docs.microsoft.com/en-us/azure/databricks/dev-tools/dbx" target="_blank"&gt;&lt;U&gt;Azure&lt;/U&gt;&lt;/A&gt; | &lt;A href="https://docs.gcp.databricks.com/dev-tools/dbx.html" alt="https://docs.gcp.databricks.com/dev-tools/dbx.html" target="_blank"&gt;&lt;U&gt;GCP&lt;/U&gt;&lt;/A&gt;) such as VS Code, PyCharm, IntelliJ, or Eclipse.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;dbx is an extension of the Databricks CLI and also makes it easy to manage multiple execution environments and deployment configurations as well as pre-built templates for integration with popular CI tools such as GitHub Actions, Azure DevOps, and GitLab.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Databricks has an &lt;A href="https://docs.databricks.com/dev-tools/vscode-ext.html" alt="https://docs.databricks.com/dev-tools/vscode-ext.html" target="_blank"&gt;official extension for VS Code&lt;/A&gt; to be able to execute code written locally against jobs or all purpose clusters. In addition, there is an official &lt;A href="https://docs.databricks.com/dev-tools/sqltools-driver.html" alt="https://docs.databricks.com/dev-tools/sqltools-driver.html" target="_blank"&gt;Databricks Driver for SQLTools in VS Code&lt;/A&gt; to browse SQL objects and run SQL queries in Databricks workspaces from VS Code.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Let us know in the comments if you have had a chance to test out dbx or our VS Code plugins for local IDE development!&lt;/P&gt;</description>
      <pubDate>Tue, 23 Aug 2022 07:17:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33914#M24815</guid>
      <dc:creator>isaac_gritz</dc:creator>
      <dc:date>2022-08-23T07:17:30Z</dc:date>
    </item>
    <item>
      <title>Re: Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33915#M24816</link>
      <description>&lt;P&gt;I use it for local development of our libraries.  Works fine, but I did not yet use it to submit to clusters.&lt;/P&gt;</description>
      <pubDate>Tue, 23 Aug 2022 07:32:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33915#M24816</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-08-23T07:32:04Z</dc:date>
    </item>
    <item>
      <title>Re: Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33916#M24817</link>
      <description>&lt;P&gt;I've found "dbx" really interesting. In particular it makes interactions with databricks from the local environment very smooth. I love how the dbx documentation describes the entire development process: it's the first time I see support for good engineering practices on the development phase.&lt;/P&gt;&lt;P&gt;​&lt;/P&gt;&lt;P&gt;There's one use case that dbx is not helping me with. I would like to develop a model locally using only pyspark and accessing the data on dbfs. I'm willing to use "dbutils", but to run it locally I need 'databricks-connect', which doesn't support databricks runtime 11, the latest one. Is there any other way to use dbutils locally?&lt;/P&gt;</description>
      <pubDate>Tue, 30 Aug 2022 14:58:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33916#M24817</guid>
      <dc:creator>matiasm</dc:creator>
      <dc:date>2022-08-30T14:58:33Z</dc:date>
    </item>
    <item>
      <title>Re: Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33917#M24818</link>
      <description>&lt;P&gt;Hi @Matias Marenchino​&amp;nbsp;unfortunately you cannot run dbutils locally but if you can use dbx execute against an interactive cluster for a more interactive development experience.&lt;/P&gt;</description>
      <pubDate>Wed, 31 Aug 2022 16:56:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33917#M24818</guid>
      <dc:creator>isaac_gritz</dc:creator>
      <dc:date>2022-08-31T16:56:49Z</dc:date>
    </item>
    <item>
      <title>Re: Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33918#M24819</link>
      <description>&lt;P&gt;dbx is great for deploy, but hopefully spark connect could be released as soon as possible&lt;/P&gt;</description>
      <pubDate>Fri, 18 Nov 2022 14:51:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33918#M24819</guid>
      <dc:creator>xiangzhu</dc:creator>
      <dc:date>2022-11-18T14:51:07Z</dc:date>
    </item>
    <item>
      <title>Re: Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33919#M24820</link>
      <description>&lt;P&gt;We could build a helper function that detects if we are running on a generic pyspark versus on a Databricks cluster.  That way, when databricks dbutils aren't available, we'd have a stand-in that would allow us to work disconnected until our code is ready to deploy to a cluster. @Isaac Gritz​&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 03 Jan 2023 14:51:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33919#M24820</guid>
      <dc:creator>slowder</dc:creator>
      <dc:date>2023-01-03T14:51:07Z</dc:date>
    </item>
    <item>
      <title>Re: Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33920#M24821</link>
      <description>&lt;P&gt;I'm actually not a fan of dbx. I prefer the AWS Glue interactive sessions way of using the IDE. It's exactly like the web notebook experience. I can see the reason why dbx exists, but I'd still like to use a regular notebook experience in my IDE.&lt;/P&gt;</description>
      <pubDate>Wed, 04 Jan 2023 21:04:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/33920#M24821</guid>
      <dc:creator>Jfoxyyc</dc:creator>
      <dc:date>2023-01-04T21:04:53Z</dc:date>
    </item>
    <item>
      <title>Re: Local Development on Databricks</title>
      <link>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/115518#M45104</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;&lt;P&gt;You can use any of existing IDE. I'm using pycharm. I have created my own utils to run code on databricks. In .env file I have environmental variables and using SDK I'm creating SparkSession object and WorkspaceObject that you can use to read credentials, etc. (gives you dbutils functionality). In your pyproject.toml you need only databricks connect library (version for your cluster runtime).&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;databricks-connect == 15.4.6&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;,&lt;/PRE&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;PRE&gt;&lt;SPAN&gt;#ENV Variables for Databricks SDK&lt;BR /&gt;DATABRICKS_HOST &lt;/SPAN&gt;=&lt;SPAN&gt;https://....&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;DATABRICKS_TOKEN&lt;/SPAN&gt;=&lt;SPAN&gt;xxxxx&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;CLUSTER_ID&lt;/SPAN&gt;=&lt;SPAN&gt;xxxx&lt;/SPAN&gt;&lt;/PRE&gt;&lt;/DIV&gt;</description>
      <pubDate>Tue, 15 Apr 2025 13:10:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/local-development-on-databricks/m-p/115518#M45104</guid>
      <dc:creator>kmodelew</dc:creator>
      <dc:date>2025-04-15T13:10:59Z</dc:date>
    </item>
  </channel>
</rss>

