2 weeks ago - last edited 2 weeks ago
I am confused as to the differences between various python libraries for databricks: especially with regard to differences among [databricks-connect](https://pypi.org/project/databricks-connect/), [databricks-api](https://pypi.org/project/databricks-api/), [databricks-sql-connector](https://pypi.org/project/databricks-sql-connector/), and [databricks-sdk](https://pypi.org/project/databricks-sdk/). It seems like databricks-connect is the official offering from Databricks? Which library should I use for what purposes?
2 weeks ago
Hi @endaemon ,
Those are completly different libraries, each one have specific purpose.
1. Databricks Connect
2 weeks ago
Hi @endaemon ,
Those are completly different libraries, each one have specific purpose.
1. Databricks Connect
Saturday
@szymon_dybczak , Thanks for the explanation it is really helpful
2 weeks ago
Thank you for typing all that up. It is very clear and helpful.
Two follow ups if I may:
1. If one's primary goal is to execute SQL queries why prefer databricks sql connector over a generic jdbc or odbc package?
2. Did I miss any other important Databricks "official" packages?
2 weeks ago
1. According to databricks is easier to setup than for example pyODBC
2. I think you already listed most important ones. You can take a look at below link for more:
https://docs.databricks.com/en/dev-tools/sql-drivers-tools.html
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group