Warehousing & Analytics

by JustinM • New Contributor II

01-08-2024 10:49:43 PM

1245 Views
4 replies
2 kudos

Cannot connect to SQL Warehouse using JDBC connector in Spark

When trying to connect to a SQL warehouse using the JDBC connector with Spark the below error is thrown. Note that connecting directly to a cluster with similar connection parameters works without issue, the error only occurs with SQL Warehouses.py4j...

Warehousing & Analytics

Reply

1245 Views
4 replies
2 kudos

01-08-2024 10:49:43 PM

View Replies

Latest Reply

jmms
Visitor

2 hours ago

2 kudos

Same error here, I am trying to save spark dataframe to Delta lake using JDBC driver and pyspark using this code:#Spark session spark_session = SparkSession.builder \ .appName("RCT-API") \ .config("spark.metrics.namespace", "rct-a...

2 kudos

2 hours ago

3 More Replies

by DataFarmer • New Contributor II

2 weeks ago

131 Views
1 replies
0 kudos

Data Warehouse in Databricks Date values as date or int: what is recommended?

In relational data warehouse systems it was best practise to represent date values as YYYYMMDD integer type values in tables. Date comparison could be done easily without using date-functions and with low performance impact.Is this still the recomme...

Warehousing & Analytics

Reply

131 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

9 hours ago

0 kudos

Hi @DataFarmer I Databricks I will advise you to use date type instead of int, this will make your life much simpler while working on the date type data.

0 kudos

9 hours ago

by Kroy • Contributor

12-15-2023 6:34:28 PM

474 Views
3 replies
2 kudos

Not able to create SQL warehouse cluster in free subscription

I have taken a free subscription to azure databricks, but when try to create 2x small warehouse clusture, getting below error, help appreciated.

Warehousing & Analytics

Reply

474 Views
3 replies
2 kudos

12-15-2023 6:34:28 PM

View Replies

Latest Reply

TimJB
New Contributor

yesterday

2 kudos

Can somebody please answer this? I'm having the same issue.

2 kudos

yesterday

2 More Replies

by florent • New Contributor III

06-29-2022 5:12:57 AM

1586 Views
7 replies
6 kudos

Resolved! it's possible to deliver a sql dashboard created in a Dev workspace to a Prod workspace?

In order to create a ci/cd pipeline to deliver dashboards (here monitoring), how to export / import a dashboard created in databricks sql dashboard from one workspace to another?Thanks

Warehousing & Analytics

Reply

1586 Views
7 replies
6 kudos

06-29-2022 5:12:57 AM

View Replies

Latest Reply

miranda_luna_db
Contributor II

Friday

6 kudos

Recommendation is to update your legacy dashboard to Lakeview and then leverage built in export/import support.

6 kudos

Friday

6 More Replies

by Kaizen • Contributor III

Thursday

90 Views
1 replies
0 kudos

Command to display all computes available in your workspace

Hi Is there a command you could use to list all computes configured in your workspace (active and non-active). This would be really helpful for anyone managing the platfrom to pull all the meta data (tags ,etc) and quickly evaluate all the configura...

Warehousing & Analytics

Reply

90 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

Friday

0 kudos

@Kaizen You've got three ways of doing this:- Using REST API (https://docs.databricks.com/api/workspace/clusters/list),- Using CLI (https://github.com/databricks/cli/blob/main/docs/commands.md#databricks-clusters-list---list-all-clusters)- Using Pyth...

0 kudos

Friday

by Ramakrishnan83 • New Contributor III

a week ago

113 Views
1 replies
0 kudos

Intermittent SQL Failure on Databricks SQL Warehouse

Team,I did setup a SQL Warehouse Cluster to support request from Mobile devices through REST API. I read through the documentation of concurrent query limit which is 10. But in my scenario I had 5 small clusters and the query monitoring indicated the...

Warehousing & Analytics

Reply

113 Views
1 replies
0 kudos

a week ago

View Replies

Latest Reply

Kaniz
Community Manager

Wednesday

0 kudos

Hi @Ramakrishnan83, Databricks SQL does indeed support concurrent read requests. However, the exact definition of concurrency can vary based on the cluster configuration and workload.By default, Databricks limits the number of concurrent queries per...

0 kudos

Wednesday

by pankaj2264 • New Contributor II

2 weeks ago

1052 Views
2 replies
1 kudos

Using profile_metrics and drift_metrics

Is there any business use-case where profile_metrics and drift_metrics are used by Databricks customers.If so,kindly provide the scenario where to leverage this feature e.g data lineage,table metadata updates.

Warehousing & Analytics

Reply

1052 Views
2 replies
1 kudos

2 weeks ago

View Replies

Latest Reply

MohsenJ
New Contributor III

2 weeks ago

1 kudos

hey @pankaj2264. both profile metric and drift metric tables are created and used by Lakehouse monitoring to assess the performance of your model and data over time or relative to a baseline table. you can find all the relevant information here Intro...

1 kudos

2 weeks ago

1 More Replies

by techuser • New Contributor III

11-27-2023 7:18:48 PM

4391 Views
10 replies
1 kudos

Resolved! Databricks Liquid Cluster

Hi,Is it possible to convert existing delta table with partition having data to clustering? If so can you please suggest the steps required? I tried and searched but couldn't find any. Is it that liquid clustering can be done only for new Delta table...

Warehousing & Analytics

Reply

4391 Views
10 replies
1 kudos

11-27-2023 7:18:48 PM

View Replies

Latest Reply

Raja_Databricks
New Contributor II

2 weeks ago

1 kudos

Does Liquid Clustering accepts Merge or How Upsert can be done efficiently with Liquid clustered delta table

1 kudos

2 weeks ago

9 More Replies

by rocky5 • New Contributor III

2 weeks ago

465 Views
1 replies
0 kudos

Resolved! Incorrect results of row_number() function

I wrote simple code:from pyspark.sql import SparkSession from pyspark.sql.window import Window from pyspark.sql.functions import row_number, max import pyspark.sql.functions as F streaming_data = spark.read.table("x") window = Window.partitionBy("BK...

Warehousing & Analytics

Reply

465 Views
1 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

ThomazRossito
New Contributor II

2 weeks ago

0 kudos

Hi,In my opinion the result is correctWhat needs to be noted in the result is that it is sorted by the "Onboarding_External_LakehouseId" column so if there is "BK_AccountApplicationId" with the same code, it will be partitioned into 2 row_numbersJust...

0 kudos

2 weeks ago

by jcozar • Contributor

3 weeks ago

675 Views
2 replies
0 kudos

Join multiple streams with watermarks

Hi!I receive three streams from a postgres CDC. These 3 tables, invoices users and products, need to be joined. I want to use a left join with respect the invoices stream. In order to compute correct results and release old states, I use watermarks a...

Warehousing & Analytics

Reply

675 Views
2 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

3 weeks ago

0 kudos

Hi @jcozar, It seems you’re encountering an issue with multiple event time columns in your Spark Structured Streaming join. Let’s break down the problem and find a solution. Event Time Columns: In Spark Structured Streaming, event time is crucia...

0 kudos

3 weeks ago

1 More Replies

by jcozar • Contributor

3 weeks ago

819 Views
2 replies
0 kudos

Read Structured Streaming state information

Hi!I am exploring the read state functionality in spark streaming: https://docs.databricks.com/en/structured-streaming/read-state.htmlWhen I start a streaming query like this: ( ... .writeStream .option("checkpointLocation", f"{CHECKPOIN...

Warehousing & Analytics

Reply

819 Views
2 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

3 weeks ago

0 kudos

Hi @jcozar, Execute the streaming query again to construct the state schema.Ensure that the checkpoint location (dbfs:/tmp/checkpoints/experiment_2_2) is correct and accessible.

0 kudos

3 weeks ago

1 More Replies

by Linglin • New Contributor II

3 weeks ago

97 Views
0 replies
0 kudos

Add Visualization in Notebook to Dashboard, how to set default add to Dashboard Bottom

I'm creating dashboard with multiple visualizations from a notebook.Whenever I add a new visualization, the default position in dashboard is top left which mess up all the format I did for previous graph. Is there a way to default add to the bottom o...

Warehousing & Analytics

Reply

97 Views
0 replies
0 kudos

3 weeks ago

by rocky5 • New Contributor III

3 weeks ago

177 Views
1 replies
0 kudos

Stream static join with aggregation

Hi,I am trying to make Stream - Static join with aggregation with no luck. I have a streaming table where I am getting events with two nasted arraysID Array1 Array21 [1,2] [3,4]I need make two joins to static dictionary tables (without an...

Warehousing & Analytics

Reply

177 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Kaniz
Community Manager

3 weeks ago

0 kudos

Hi @rocky5, . You want to perform a stream-static join with aggregation in Databricks SQL, where you have a streaming table with nested arrays and need to join it with static dictionary tables based on IDs contained in those arrays. Here are the ...

0 kudos

3 weeks ago

by Hubert-Dudek • Esteemed Contributor III

3 weeks ago

193 Views
1 replies
0 kudos

1 min auto termination

SQL warehouse can auto-terminate after 1 minute, not 5, as in UI. Just run a simple CLI command. Of course, with such a low auto termination, you lose the benefit of CACHE, but for some ad-hoc queries, it is the perfect setup when combined with serve...

Warehousing & Analytics

Reply

193 Views
1 replies
0 kudos

3 weeks ago

View Replies

Latest Reply

Ayushi_Suthar
Honored Contributor

3 weeks ago

0 kudos

Hi @Hubert-Dudek , Hope you are doing well! Could you please clarify more on your ask here? However, from the above details, the SQL warehouse mentioned is auto-terminating after 1 minute of inactivity because the Auto stop is set to 1 minute. Howe...

0 kudos

3 weeks ago

by Noortje • New Contributor II

a month ago

553 Views
3 replies
0 kudos

Databricks Looker Studio connector

Hi all! The Databricks Looker Studio connector has now been available for a few weeks. Tested the connector but running into several issues: I am used to working with dynamic queries, so I am able to use date parameters (similar to BigQuery Looker St...

Warehousing & Analytics

BI tool connector

Looker Studio

Reply

553 Views
3 replies
0 kudos

a month ago

View Replies

Latest Reply

Noortje
New Contributor II

3 weeks ago

0 kudos

Hi @Kaniz Hope you're doing well! I am very curious about the following thing: However, there might be workarounds or alternative approaches to achieve similar functionality. You could explore using Looker’s native features for dynamic filtering or c...

0 kudos

3 weeks ago

2 More Replies

Databricks

Forum Posts

Cannot connect to SQL Warehouse using JDBC connector in Spark

Data Warehouse in Databricks Date values as date or int: what is recommended?

Not able to create SQL warehouse cluster in free subscription

Resolved! it's possible to deliver a sql dashboard created in a Dev workspace to a Prod workspace?

Command to display all computes available in your workspace

Intermittent SQL Failure on Databricks SQL Warehouse

Using profile_metrics and drift_metrics

Resolved! Databricks Liquid Cluster

Resolved! Incorrect results of row_number() function

Join multiple streams with watermarks

Read Structured Streaming state information

Add Visualization in Notebook to Dashboard, how to set default add to Dashboard Bottom

Stream static join with aggregation

1 min auto termination

Databricks Looker Studio connector

Incorrect results of row_number() function

Unhandled error while executing ['DatabricksSQLCu...

What is databricks SQL, spark SQL and how are they...

How to let Business Users edit tables in Databrick...

api/2.0/sql/history/queries endpoint does not retu...