- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-07-2023 04:21 PM
Lakehouse architectures seem enticing, especially from the standpoint of querying the data lake directly as it sits (as opposed to first migrating the data to an external data warehouse). While documentation and support seems pretty clear regarding support BI platforms like Tableau and Power BI, there is really no reference for using it as source for queries through custom APIs.
I'm wondering if anyone has recommendations for querying Databricks SQL from .NET Web APIs through ODBC connections? Is this even a supported or recommended use case? Would the recommendation for these types of queries be to execute them against more of a traditional data warehouse? If it's supported, what are the downsides of querying directly through Databricks SQL (Cost? Performance? Tooling?). I'm guessing I'd be somewhat limited since I'd need to install the Databricks ODBC driver in order to query it? Also, it seems there are limits to the number of concurret calls that can be made per cluster?
Many of these limitations make me think I should still push data to external data warehouses for these scenarios, but I wanted to see what everyone out there thinks.
Thanks!
Geoff
Accepted Solutions
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-07-2023 05:28 PM
That is the wonderful thing about the Lakehouse, data is in open formats with open API's. Please see these options for querying via Go, Node.js, Python as well as via API
https://www.databricks.com/blog/2022/06/29/connect-from-anywhere-to-databricks-sql.html
As you said, you can also download and install the ODBC driver to connect from different applications:
https://docs.databricks.com/integrations/jdbc-odbc-bi.html
Databricks SQL Warehouses can scale vertically for data throughput as well as horizontally for concurrency, please review your concurrency requirements with your Databricks account team for specific guidance.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-07-2023 05:28 PM
That is the wonderful thing about the Lakehouse, data is in open formats with open API's. Please see these options for querying via Go, Node.js, Python as well as via API
https://www.databricks.com/blog/2022/06/29/connect-from-anywhere-to-databricks-sql.html
As you said, you can also download and install the ODBC driver to connect from different applications:
https://docs.databricks.com/integrations/jdbc-odbc-bi.html
Databricks SQL Warehouses can scale vertically for data throughput as well as horizontally for concurrency, please review your concurrency requirements with your Databricks account team for specific guidance.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
03-07-2023 08:33 PM
these are really awesome details

