Warehousing & Analytics

by MadelynM • Databricks Employee

07-03-2024 10:20:04 AM

3208 Views
0 replies
0 kudos

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights. Keynote: Data Warehouse presente...

Screenshot 2024-07-03 at 10.15.26 AM.png

Warehousing & Analytics

AI BI Dashboards

AI BI Genie

Databricks SQL

3208 Views
0 replies
0 kudos

07-03-2024 10:20:04 AM

by qwerty3 • Contributor

09-26-2024 8:38:40 AM

1576 Views
3 replies
0 kudos

Unable to obtain count of dataframe

I am unable to obtain a count of a dataframe, it always get stuck at 1 stage, I have tried reducing the size, what can be the issue? How can I read cluster logs to identify the issue?

Warehousing & Analytics

Reply

1576 Views
3 replies
0 kudos

09-26-2024 8:38:40 AM

View Replies

Latest Reply

qwerty3
Contributor

09-26-2024 8:50:54 AM

0 kudos

Driver memory is good enough, it is able to handle 90 lakhs data, what I am giving it is definitely less than that, what can I do about skewed data and shuffling?

0 kudos

09-26-2024 8:50:54 AM

2 More Replies

by JS_L • New Contributor II

09-23-2024 2:18:30 AM

1686 Views
2 replies
1 kudos

ERROR: key not found in SQL when trying to pass the result of a CTE as a function parameter

Hi Community,I try to pass the result of a CTE as a function parameter as code below WITH t1 AS ( SELECT array_join(collect_list(output), ',') AS x FROM my_catalog.my_db.get_x(:startTime, :endTime) ) SELECT 'AM_offline' as Type, CASE WHEN off...

Warehousing & Analytics

Reply

1686 Views
2 replies
1 kudos

09-23-2024 2:18:30 AM

View Replies

Latest Reply

JS_L
New Contributor II

09-24-2024 8:46:12 PM

1 kudos

Hi @szymon_dybczak Thanks for replying. I don't the issue is related to datatype, since the query works if I pass the subquery to _x parameter without CTE.Please see as below code:SELECT 'AM_offline' as Type, CASE WHEN offline_ratio > 1.5 THEN 'no-Go...

1 kudos

09-24-2024 8:46:12 PM

1 More Replies

by sachamourier • Contributor

09-20-2024 7:27:58 AM

3062 Views
3 replies
0 kudos

Importing Python files into another Workspace Python file does not work

I have created Python modules containing some Python functions and I would like to import them from a notebook contained in the Workspace. For example, I have a "etl" directory, containing a "snapshot.py" file with some Python functions, and an empty...

Warehousing & Analytics

Databricks

Modules

python

Reply

3062 Views
3 replies
0 kudos

09-20-2024 7:27:58 AM

View Replies

Latest Reply

filipniziol
Esteemed Contributor

09-20-2024 7:45:45 AM

0 kudos

Hi @sachamourier ,It will work, but you need carefully craft path to sys.path.append(), you even do not need __init__.py to make it work.Try to hard-code the path to the snapshot.py in workspace.Add this to your notebook: import sys import os absolu...

0 kudos

09-20-2024 7:45:45 AM

2 More Replies

by LeoRickli • New Contributor II

09-06-2024 9:33:51 AM

1842 Views
4 replies
2 kudos

Serverless SQL warehouses on GCP?

According to the official Databricks documentation on GCP, I should have the ability to deploy a serverless SQL warehouse inside Databricks. Following the documentation, it is requested to turn on Serverless SQL warehouses (On), but there is nothing ...

Warehousing & Analytics

Reply

1842 Views
4 replies
2 kudos

09-06-2024 9:33:51 AM

View Replies

Latest Reply

LeoRickli
New Contributor II

09-17-2024 10:18:44 AM

2 kudos

Hello @filipniziol, thanks for the response.I'm the workspace owner. I just gave myself the account admin (Metastore admin) but still got nothing new.

2 kudos

09-17-2024 10:18:44 AM

3 More Replies

by EmmaP • New Contributor III

09-11-2024 1:16:02 AM

3554 Views
5 replies
2 kudos

Understand cluster activity Serverless SQL

Hello, Following abnormally high costs when using serverless sql on September 9 and 10, I noticed that the cluster sometimes stays on for an hour even though it's not receiving any new requests, and that the auto-stop is set to 5 minutes of inactivit...

Warehousing & Analytics

Reply

3554 Views
5 replies
2 kudos

09-11-2024 1:16:02 AM

View Replies

Latest Reply

RCo
New Contributor III

09-16-2024 9:19:38 AM

2 kudos

Hi @EmmaP!I have encountered this. Even though the UI says that they are complete, they actually are not. While the query itself completed, the client is still fetching the data from the SQL Warehouse.To check if this is your issue, from the monitori...

2 kudos

09-16-2024 9:19:38 AM

4 More Replies

by Hubert-Dudek • Databricks MVP

03-13-2024 3:02:58 AM

5301 Views
2 replies
2 kudos

PDF report from databricks

You can now send pdf reports from Lakeview dashboards. Just hit subscribe (you can also add subscribers by yourself in schedule settings) #databricks

Warehousing & Analytics

Reply

5301 Views
2 replies
2 kudos

03-13-2024 3:02:58 AM

View Replies

Latest Reply

bernsb
New Contributor II

09-12-2024 5:11:24 AM

2 kudos

Cool. This is a very convenient feature since most people now use the PDF format when working with text files. If anyone has ever had any issues with this format, I can say that I recently needed to merge several PDF files into one, and with the help...

2 kudos

09-12-2024 5:11:24 AM

1 More Replies

by xwen • New Contributor II

09-10-2024 3:01:42 AM

1253 Views
1 replies
1 kudos

how to modify data type of a column explicitly via DBSQL

is there a SQL equivalent of overwriteSchema ?https://docs.databricks.com/en/delta/update-schema.html#explicitly-update-schema-to-change-column-type-or-name

Warehousing & Analytics

Reply

1253 Views
1 replies
1 kudos

09-10-2024 3:01:42 AM

View Replies

Latest Reply

xwen
New Contributor II

09-10-2024 7:41:08 AM

1 kudos

In place schema adjustment =>Then ALTER TABLE XXX ADD/DROP COLUMN XXX INTExamplecreate table test (id int, first_name string, last_name string ); insert into test values (1, 'john', 'smith'); alter table test add column age int; select * from testCr...

1 kudos

09-10-2024 7:41:08 AM

by msolcuadrado • New Contributor II

09-06-2024 8:38:03 AM

2019 Views
1 replies
0 kudos

SQL warehouse autostop not working

I'm using a SQL warehouse with autostop after 5 minutes of inactivity. However, the cluster is constantly activating and deactivating without any explanation. There are no queries being executed, and I can't identify any reasons why it is happening,...

Warehousing & Analytics

Reply

2019 Views
1 replies
0 kudos

09-06-2024 8:38:03 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

09-09-2024 12:32:40 AM

0 kudos

Hi @msolcuadrado ,In your case I would try to contact directly with databricks support team. This is a serious issue and I feel your pain. They should help you pinpoint an excat cause + maybe you'll get a refund

0 kudos

09-09-2024 12:32:40 AM

by Rich85 • New Contributor

09-05-2024 1:02:20 AM

6223 Views
1 replies
0 kudos

Incorrect syntax near '=' error that I can't solve

Hi,I'm receiving the error Incorrect syntax near '=' when I run simple queries like the example below. This only happens when I use a column created using a CASE statement in the WHERE clause. I can use any other column in the WHERE clause, includi...

Warehousing & Analytics

Reply

6223 Views
1 replies
0 kudos

09-05-2024 1:02:20 AM

View Replies

Latest Reply

Kayla
Valued Contributor II

09-05-2024 4:57:23 AM

0 kudos

What jumps out to me at first is the backticks on `Peak Vertical Force / BW`, but I'm assuming that's just a column name and not an attempt at division.Next that jumps out is TestType and TestTypeName being aliased as testType and testTypeName- spark...

0 kudos

09-05-2024 4:57:23 AM

by mathiaskvist • New Contributor III

08-26-2024 1:26:10 AM

2069 Views
4 replies
0 kudos

SQL Warehouse REST statement execution validation fails with DECLARE SET

Hi,I'm using the REST API for SQL Warehouse in order to execute queries. I have experienced multiple times that query validation fails over the REST API, while executing the same query in the Databricks UI on the same cluster succeeds. An example: [P...

Warehousing & Analytics

Reply

2069 Views
4 replies
0 kudos

08-26-2024 1:26:10 AM

View Replies

Latest Reply

adriennn
Valued Contributor

08-29-2024 1:01:03 AM

0 kudos

Had to try for myself and it seems the sql execution context in the REST API is different than that of an *.sql script, notebook or query made against an sql warehouse through the ui. The error stems from the fact that the SET command can also be use...

0 kudos

08-29-2024 1:01:03 AM

3 More Replies

by JosephX • New Contributor

08-23-2024 7:37:29 AM

927 Views
1 replies
0 kudos

optimize query from power bi desktop

How to tuning databricks query performance from Power BI Dosktop

Warehousing & Analytics

Reply

927 Views
1 replies
0 kudos

08-23-2024 7:37:29 AM

View Replies

Latest Reply

Brahmareddy
Esteemed Contributor

08-23-2024 8:14:14 AM

0 kudos

Hi Joeshph,How are you doing today?Give a try with below inputs and let me know if works well.Filter and aggregate data in Databricks to reduce load before it reaches Power BI. Use DirectQuery carefully, simplify measures, and reduce the number of vi...

0 kudos

08-23-2024 8:14:14 AM

by paulocorrea • New Contributor II

07-30-2024 8:06:35 AM

3237 Views
3 replies
0 kudos

Issue with Lateral Column Alias (LCA)

I have a query using LCA. When referencing another table that has a column with the same name as the column used as LCA, the behavior of the query changes and it starts referencing the table column instead of the column that is already in the select ...

Warehousing & Analytics

Reply

3237 Views
3 replies
0 kudos

07-30-2024 8:06:35 AM

View Replies

Latest Reply

ClausStier
New Contributor II

08-23-2024 3:59:21 AM

0 kudos

Hi @Kaniz_Fatma,we had the same problem as @paulocorrea.That's why it would be correct for to me to throw an error on ambiguous columns and the LCA could/must be addressed with a default identifier.Thanks

0 kudos

08-23-2024 3:59:21 AM

2 More Replies

by RickB • New Contributor II

08-22-2024 9:01:12 AM

1528 Views
3 replies
1 kudos

SQL Positional parameters: INVALID_PARAMETER_MARKER_VALUE.DUPLICATE_NAME

When trying to execute a query via sql warehouse, I get the following error:INVALID_PARAMETER_MARKER_VALUE.DUPLICATE_NAMEthe sql statement uses ? placeholders and the correct number of arguments are being passed.I am not able to use named placeholder...

Warehousing & Analytics

Reply

1528 Views
3 replies
1 kudos

08-22-2024 9:01:12 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

08-22-2024 9:41:13 AM

1 kudos

Hi @RickB ,Which API are you using to invoke this? Parameter markers can be provided by:Python using its pyspark.sql.SparkSession.sql() API.Scala using its org.apache.spark.sql.SparkSession.sql() API.Java using its org.apache.spark.sql.SparkSession.s...

1 kudos

08-22-2024 9:41:13 AM

2 More Replies

by Akshay_Petkar • Valued Contributor

08-22-2024 6:08:48 AM

1750 Views
2 replies
2 kudos

SQL Differences When Using SSMS with Databricks Lakehouse Federation

I'm planning to connect SQL Server Management Studio (SSMS) with Databricks using Lakehouse Federation. I understand that there are some differences in the SQL dialects between SSMS and Databricks SQL. For instance, in SSMS, we use TOP 10 to limit th...

Warehousing & Analytics

Reply

1750 Views
2 replies
2 kudos

08-22-2024 6:08:48 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

08-23-2024 1:20:40 AM

2 kudos

To add on this:if you really have to use T-SQL (the MS dialect of SQL), you can define the SQL warehouse from databricks as a linked server on your SQL server.As said: SSMS is merely a sql client, the SQL dialect to be used is defined by the database...

2 kudos

08-23-2024 1:20:40 AM

1 More Replies

by AZHAR-QUADRI • New Contributor

08-21-2024 11:44:30 PM

2744 Views
1 replies
0 kudos

How to create my First Dashboard in Lakeview

Hello Community . I am a newbie here having an experience in tableau and power bi . I wanted to explore Dashboard creation in Lakeview . I have created a free trial databricks account . Although there are plenty of articles and videos on how to crea...

Warehousing & Analytics

Reply

2744 Views
1 replies
0 kudos

08-21-2024 11:44:30 PM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

08-22-2024 10:05:29 AM

0 kudos

Hi @AZHAR-QUADRI ,You probably created workspace in a standard tier, that's why you can't see side bar. Recreate your workspace as a premium tier.

0 kudos

08-22-2024 10:05:29 AM

Databricks Community

Forum Posts

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Unable to obtain count of dataframe

ERROR: key not found in SQL when trying to pass the result of a CTE as a function parameter

Importing Python files into another Workspace Python file does not work

Serverless SQL warehouses on GCP?

Understand cluster activity Serverless SQL

PDF report from databricks

how to modify data type of a column explicitly via DBSQL

SQL warehouse autostop not working

Incorrect syntax near '=' error that I can't solve

SQL Warehouse REST statement execution validation fails with DECLARE SET

optimize query from power bi desktop

Issue with Lateral Column Alias (LCA)

SQL Positional parameters: INVALID_PARAMETER_MARKER_VALUE.DUPLICATE_NAME

SQL Differences When Using SSMS with Databricks Lakehouse Federation

How to create my First Dashboard in Lakeview

Join Us as a Local Community Builder!

How to make a table in databricks using excel file

Show values as rows instead of columns in pivot ta...

Intermittent connectivity issues between Power BI ...

Metric View measure on joined table

Understanding what impacts "Optimizing query & pru...