10-10-2024 06:19 AM - edited 10-10-2024 06:27 AM
I am confused as to the differences between various python libraries for databricks: especially with regard to differences among [databricks-connect](https://pypi.org/project/databricks-connect/), [databricks-api](https://pypi.org/project/databricks-api/), [databricks-sql-connector](https://pypi.org/project/databricks-sql-connector/), and [databricks-sdk](https://pypi.org/project/databricks-sdk/). It seems like databricks-connect is the official offering from Databricks? Which library should I use for what purposes?
10-10-2024 07:17 AM
Hi @endaemon ,
Those are completly different libraries, each one have specific purpose.
1. Databricks Connect
10-10-2024 07:17 AM
Hi @endaemon ,
Those are completly different libraries, each one have specific purpose.
1. Databricks Connect
10-19-2024 01:41 AM
@szymon_dybczak , Thanks for the explanation it is really helpful
10-10-2024 07:49 AM
Thank you for typing all that up. It is very clear and helpful.
Two follow ups if I may:
1. If one's primary goal is to execute SQL queries why prefer databricks sql connector over a generic jdbc or odbc package?
2. Did I miss any other important Databricks "official" packages?
10-10-2024 07:58 AM
1. According to databricks is easier to setup than for example pyODBC
2. I think you already listed most important ones. You can take a look at below link for more:
https://docs.databricks.com/en/dev-tools/sql-drivers-tools.html
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now