<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic What's the best way to develop Apache Spark Jobs from an IDE (such as IntelliJ/Pycharm)? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/what-s-the-best-way-to-develop-apache-spark-jobs-from-an-ide/m-p/25978#M18120</link>
    <description>&lt;P&gt;A number of people like developing locally using an IDE and then deploying. What are the recommended ways to do that with Databricks jobs?&lt;/P&gt;</description>
    <pubDate>Mon, 07 Jun 2021 17:53:57 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2021-06-07T17:53:57Z</dc:date>
    <item>
      <title>What's the best way to develop Apache Spark Jobs from an IDE (such as IntelliJ/Pycharm)?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-s-the-best-way-to-develop-apache-spark-jobs-from-an-ide/m-p/25978#M18120</link>
      <description>&lt;P&gt;A number of people like developing locally using an IDE and then deploying. What are the recommended ways to do that with Databricks jobs?&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2021 17:53:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-s-the-best-way-to-develop-apache-spark-jobs-from-an-ide/m-p/25978#M18120</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-06-07T17:53:57Z</dc:date>
    </item>
    <item>
      <title>Re: What's the best way to develop Apache Spark Jobs from an IDE (such as IntelliJ/Pycharm)?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-s-the-best-way-to-develop-apache-spark-jobs-from-an-ide/m-p/25979#M18121</link>
      <description>&lt;P&gt;The Databricks Runtime and Apache Spark use the same base API. One can create Spark jobs that run locally and have them run on Databricks with all available Databricks features.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It is required that one uses SparkSession.builder.getOrCreate() to create the SparkSession. The SparkSession is created in the Databricks environment and is treated as a singleton.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In addition, one can also test using databricks connect. Databricks connect replaces Apache Spark/Pyspark on your local machine and allows for your local machine to execute jobs on a Databricks cluster.  &lt;A href="https://docs.databricks.com/dev-tools/databricks-connect.html" target="test_blank"&gt;https://docs.databricks.com/dev-tools/databricks-connect.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2021 17:57:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-s-the-best-way-to-develop-apache-spark-jobs-from-an-ide/m-p/25979#M18121</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2021-06-07T17:57:56Z</dc:date>
    </item>
  </channel>
</rss>

