<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Differences among python libraries in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93434#M4401</link>
    <description>&lt;P&gt;I am confused as to the differences between various python libraries for databricks: especially with regard to differences among [databricks-connect](&lt;A href="https://pypi.org/project/databricks-connect/)," target="_blank" rel="noopener"&gt;https://pypi.org/project/databricks-connect/),&lt;/A&gt; [databricks-api](&lt;A href="https://pypi.org/project/databricks-api/)," target="_blank" rel="noopener"&gt;https://pypi.org/project/databricks-api/),&lt;/A&gt; [databricks-sql-connector](&lt;A href="https://pypi.org/project/databricks-sql-connector/" target="_blank"&gt;https://pypi.org/project/databricks-sql-connector/&lt;/A&gt;), and [databricks-sdk](&lt;A href="https://pypi.org/project/databricks-sdk/)" target="_blank" rel="noopener"&gt;https://pypi.org/project/databricks-sdk/)&lt;/A&gt;. It seems like databricks-connect is the official offering from Databricks? Which library should I use for what purposes?&lt;/P&gt;</description>
    <pubDate>Thu, 10 Oct 2024 13:27:59 GMT</pubDate>
    <dc:creator>endaemon</dc:creator>
    <dc:date>2024-10-10T13:27:59Z</dc:date>
    <item>
      <title>Differences among python libraries</title>
      <link>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93434#M4401</link>
      <description>&lt;P&gt;I am confused as to the differences between various python libraries for databricks: especially with regard to differences among [databricks-connect](&lt;A href="https://pypi.org/project/databricks-connect/)," target="_blank" rel="noopener"&gt;https://pypi.org/project/databricks-connect/),&lt;/A&gt; [databricks-api](&lt;A href="https://pypi.org/project/databricks-api/)," target="_blank" rel="noopener"&gt;https://pypi.org/project/databricks-api/),&lt;/A&gt; [databricks-sql-connector](&lt;A href="https://pypi.org/project/databricks-sql-connector/" target="_blank"&gt;https://pypi.org/project/databricks-sql-connector/&lt;/A&gt;), and [databricks-sdk](&lt;A href="https://pypi.org/project/databricks-sdk/)" target="_blank" rel="noopener"&gt;https://pypi.org/project/databricks-sdk/)&lt;/A&gt;. It seems like databricks-connect is the official offering from Databricks? Which library should I use for what purposes?&lt;/P&gt;</description>
      <pubDate>Thu, 10 Oct 2024 13:27:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93434#M4401</guid>
      <dc:creator>endaemon</dc:creator>
      <dc:date>2024-10-10T13:27:59Z</dc:date>
    </item>
    <item>
      <title>Re: Differences among python libraries</title>
      <link>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93444#M4402</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/125985"&gt;@endaemon&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Those are completly different libraries, each one have specific purpose.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;1. Databricks Connect&lt;/STRONG&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Purpose&lt;/STRONG&gt;: This is the official library provided by Databricks for connecting local Python environments to a Databricks cluster.&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Use Case&lt;/STRONG&gt;: It allows you to write Spark code on your local machine and execute it on a remote Databricks cluster, making it very useful for development and testing. You can develop, test, and debug Spark code locally as if it were running in Databricks.&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;2. databricks-api (legacy, use Databricks Python SDK)&lt;/H3&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Purpose&lt;/STRONG&gt;: This library is a third-party wrapper for the Databricks REST API.&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Use Case&lt;/STRONG&gt;: It is meant for programmatic management and automation of Databricks resources. You can use it to create and manage clusters, jobs, and other resources in Databricks, but it does not provide Spark job execution capabilities.&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;3. databricks sql connector&lt;/H3&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Purpose&lt;/STRONG&gt;: This is specifically for connecting to Databricks SQL endpoints.&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Use Case&lt;/STRONG&gt;: If your primary need is to execute SQL queries against Databricks SQL endpoints or SQL endpoints in Unity Catalog, this library is the appropriate choice.&lt;/LI&gt;&lt;/UL&gt;&lt;H3&gt;4. databricks-sdk&lt;/H3&gt;&lt;UL&gt;&lt;LI&gt;&lt;STRONG&gt;Purpose&lt;/STRONG&gt;: This is the official Databricks SDK for Python and offers a comprehensive interface for interacting with all aspects of Databricks.&amp;nbsp;&lt;/LI&gt;&lt;LI&gt;&lt;STRONG&gt;Use Case&lt;/STRONG&gt;: It provides a more modern and complete interface than databricks-api, supporting operations across different Databricks services such as clusters, jobs, jobs, DBFS, and more. It's ideal for developers building complex applications or automated systems that need to interact with various Databricks features.&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Thu, 10 Oct 2024 14:17:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93444#M4402</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-10-10T14:17:15Z</dc:date>
    </item>
    <item>
      <title>Re: Differences among python libraries</title>
      <link>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93457#M4403</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;Thank you for typing all that up. It is very clear and helpful.&lt;/P&gt;&lt;P&gt;Two follow ups if I may:&lt;/P&gt;&lt;P&gt;1. If one's primary goal is to execute SQL queries why prefer databricks sql connector over a generic jdbc or odbc package?&lt;/P&gt;&lt;P&gt;2. Did I miss any other important Databricks "official" packages?&lt;/P&gt;</description>
      <pubDate>Thu, 10 Oct 2024 14:49:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93457#M4403</guid>
      <dc:creator>endaemon</dc:creator>
      <dc:date>2024-10-10T14:49:21Z</dc:date>
    </item>
    <item>
      <title>Re: Differences among python libraries</title>
      <link>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93458#M4404</link>
      <description>&lt;P&gt;1. According to databricks is easier to setup than for example pyODBC&lt;/P&gt;&lt;P&gt;2. I think you already listed most important ones. You can take a look at below link for more:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/en/dev-tools/sql-drivers-tools.html" target="_blank"&gt;https://docs.databricks.com/en/dev-tools/sql-drivers-tools.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 10 Oct 2024 14:58:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/93458#M4404</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-10-10T14:58:07Z</dc:date>
    </item>
    <item>
      <title>Re: Differences among python libraries</title>
      <link>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/94992#M4463</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;&amp;nbsp;, Thanks for the explanation it is really helpful&lt;/P&gt;</description>
      <pubDate>Sat, 19 Oct 2024 08:41:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/differences-among-python-libraries/m-p/94992#M4463</guid>
      <dc:creator>bhanu_gautam</dc:creator>
      <dc:date>2024-10-19T08:41:02Z</dc:date>
    </item>
  </channel>
</rss>

