10-10-2024 06:19 AM - edited 10-10-2024 06:27 AM
I am confused as to the differences between various python libraries for databricks: especially with regard to differences among [databricks-connect](https://pypi.org/project/databricks-connect/), [databricks-api](https://pypi.org/project/databricks-api/), [databricks-sql-connector](https://pypi.org/project/databricks-sql-connector/), and [databricks-sdk](https://pypi.org/project/databricks-sdk/). It seems like databricks-connect is the official offering from Databricks? Which library should I use for what purposes?
10-10-2024 07:17 AM
Hi @endaemon ,
Those are completly different libraries, each one have specific purpose.
1. Databricks Connect
10-10-2024 07:17 AM
Hi @endaemon ,
Those are completly different libraries, each one have specific purpose.
1. Databricks Connect
10-19-2024 01:41 AM
@szymon_dybczak , Thanks for the explanation it is really helpful
10-10-2024 07:49 AM
Thank you for typing all that up. It is very clear and helpful.
Two follow ups if I may:
1. If one's primary goal is to execute SQL queries why prefer databricks sql connector over a generic jdbc or odbc package?
2. Did I miss any other important Databricks "official" packages?
10-10-2024 07:58 AM
1. According to databricks is easier to setup than for example pyODBC
2. I think you already listed most important ones. You can take a look at below link for more:
https://docs.databricks.com/en/dev-tools/sql-drivers-tools.html
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group