Warehousing & Analytics

by MadelynM • Databricks Employee

07-03-2024 10:20:04 AM

1814 Views
0 replies
0 kudos

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights. Keynote: Data Warehouse presente...

Screenshot 2024-07-03 at 10.15.26 AM.png

Warehousing & Analytics

AI BI Dashboards

AI BI Genie

Databricks SQL

1814 Views
0 replies
0 kudos

07-03-2024 10:20:04 AM

by omjohn • New Contributor

08-04-2024 6:56:25 PM

790 Views
0 replies
0 kudos

Sparklyr error in spark_apply: Error: java.lang.NoSuchMethodError

When trying incorporate an R package into my Spark workflow using the spark_apply() funciton in Sparklyr, I get the error:Error: java.lang.NoSuchMethodError: org.apache.spark.sql.catalyst.encoders.RowEncoder$.apply(Lorg/apache/spark/sql/types/StructT...

Warehousing & Analytics

Reply

790 Views
0 replies
0 kudos

08-04-2024 6:56:25 PM

by dweben • New Contributor II

07-29-2024 10:37:42 AM

642 Views
0 replies
0 kudos

Databricks azure dashboard chart issue

Hi all,Just started using Db Dashboards on Azure. While generating a line chart with a x-axis value based on a date data type I've applied a filter which removes some of the data. The above is without filtering, the below is with filtering. You can o...

Warehousing & Analytics

Reply

642 Views
0 replies
0 kudos

07-29-2024 10:37:42 AM

by Akshay_Petkar • Contributor III

07-26-2024 3:21:11 AM

532 Views
0 replies
0 kudos

Issue with Column Name Conflict While Importing .gz File into Spark DataFrame

I'm encountering an issue while importing a .gz file containing JSON data into a Spark DataFrame in Databricks. The error indicates a column name conflict. Could you please advise on how to resolve this issue and handle duplicate column names during ...

Warehousing & Analytics

Reply

532 Views
0 replies
0 kudos

07-26-2024 3:21:11 AM

by PM0 • New Contributor

07-25-2024 4:50:40 AM

1003 Views
0 replies
0 kudos

Rate limit Usage

We are having Databricks tables and we have an API which query that data, the query will be dynamic and API would allow user to query anything. However users can query lot of data and consume lot of DBU but generic rate limiting wont help, as any sin...

Warehousing & Analytics

Reply

1003 Views
0 replies
0 kudos

07-25-2024 4:50:40 AM

by pauloquantile • New Contributor III

09-04-2023 3:51:22 AM

14124 Views
6 replies
1 kudos

PowerBI "Token expired while fetching results: TEAuthTokenExpired."

Hi everyone,We are at the moment stumbeling upon a big challenge with loading data into PowerBI. I need some advice!To give a bit of conext: we introduced Databricks instead of Azure Synapse for a client of ours. We are currently busy with moving all...

Warehousing & Analytics

OAuth

PowerBI

Reply

14124 Views
6 replies
1 kudos

09-04-2023 3:51:22 AM

View Replies

Latest Reply

viralpatel
New Contributor II

07-23-2024 11:48:45 PM

1 kudos

@Retired_mod Is there any further update from Databricks which can be helpful here OR what @pauloquantile mentioned is the only workaround solution?

1 kudos

07-23-2024 11:48:45 PM

5 More Replies

by Spoon_Man • New Contributor II

07-22-2024 12:17:13 AM

903 Views
2 replies
2 kudos

Converting an Entity Attribute Value Data Model to Struct datatype

I'm new to databricks and I have a source data model that stores the data as Name-Value pairs (i.e. normalised) in two columns in the table.EntityIDNameValue1Field1SomeValue11Field2SomeValue21Field3SomeValue32Field1SomeValue12Field3SomeValue3The defi...

Warehousing & Analytics

Reply

903 Views
2 replies
2 kudos

07-22-2024 12:17:13 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

07-22-2024 6:03:47 AM

2 kudos

Your first approach didn't work, because named_struct needs it's arguments on odd postition to be foldable.So you can think of it in following way, at compile time compiler needs to "see" this value. That's why even if you prepared proper expression ...

2 kudos

07-22-2024 6:03:47 AM

1 More Replies

by Executivecars • New Contributor

04-25-2022 10:20:15 PM

1051 Views
2 replies
0 kudos

Looking for Chauffeur service Melbourne? Visit Executive cars for Chauffeur service Melbourne. ✓24/7 Operation ✓100+ Vehicles ✓ Chauffeur Cars Melbour...

Looking for Chauffeur service Melbourne? Visit Executive cars for Chauffeur service Melbourne. ✓24/7 Operation ✓100+ Vehicles ✓ Chauffeur Cars Melbourne.

Warehousing & Analytics

Reply

1051 Views
2 replies
0 kudos

04-25-2022 10:20:15 PM

View Replies

Latest Reply

FiveLittleFish
New Contributor II

07-18-2024 11:28:21 PM

0 kudos

If you're in Melbourne and need reliable chauffeur services, Executive Cars seems like a great option with its extensive fleet and round-the-clock availability. Safety and reliability are crucial in the transportation industry. Speaking of which, hav...

0 kudos

07-18-2024 11:28:21 PM

1 More Replies

by KrisMcDougal • New Contributor

07-16-2024 12:30:13 PM

1011 Views
2 replies
2 kudos

Is there any way to see who ran a notebook?

I'm trying to find information to see who runs a notebook. I'm able to see who created the notebook, and I can find out when the notebook is ran, but there doesn't seem to be any information on who ran it, only who created the notebook.

Warehousing & Analytics

Reply

1011 Views
2 replies
2 kudos

07-16-2024 12:30:13 PM

View Replies

Latest Reply

Rishabh-Pandey
Esteemed Contributor

07-18-2024 2:48:04 AM

2 kudos

SELECT * FROM system.logs WHERE event_type = 'notebook_run' ORDER BY timestamp DESC;@KrisMcDougal with this code you can check from which user id the first command of the notebook got started and you will get to know who started the notebook.

2 kudos

07-18-2024 2:48:04 AM

1 More Replies

by OluPopoola • New Contributor II

07-08-2024 11:13:26 AM

867 Views
1 replies
0 kudos

Cant Use Delta Live Tables to read MSK using IAM authenthication

Hi AllI am trying to use Delta Live Tables to connect to MSK.We have set up serverless MSK clusters that use IAM for its authetication. I cannot connect to it from a dlt notebook. The same code near enough works on normal clusters that have java libr...

Warehousing & Analytics

Reply

867 Views
1 replies
0 kudos

07-08-2024 11:13:26 AM

View Replies

Latest Reply

OluPopoola
New Contributor II

07-08-2024 12:12:57 PM

0 kudos

Just rephrasing the question:I am trying to use the DLT to connect to serverless MSK clusters authenticated by IAM. The code works on ordinary clusters but doesn't work when run on DLT clusters. I think the issue is the authentication because we can ...

0 kudos

07-08-2024 12:12:57 PM

by Akshay_Petkar • Contributor III

07-04-2024 12:36:21 AM

794 Views
1 replies
1 kudos

Impact of Overwriting Databricks SQL Tables on Versioning

When changes are made to a Databricks SQL table, a new version is created. If changes to the table are made using Spark or Python in a notebook and the table is overwritten, will a new version be created, or will it remain as version number 0?

Warehousing & Analytics

Reply

794 Views
1 replies
1 kudos

07-04-2024 12:36:21 AM

View Replies

Latest Reply

Rishabh-Pandey
Esteemed Contributor

07-05-2024 3:18:36 AM

1 kudos

When changes are made to a Databricks SQL table (Delta table) using Spark or Python in a notebook, and the table is overwritten, a new version will indeed be created. It will not remain as version number 0. Each overwrite operation increments the ver...

1 kudos

07-05-2024 3:18:36 AM

by MadelynM • Databricks Employee

07-03-2024 10:20:04 AM

1814 Views
0 replies
0 kudos

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights. Keynote: Data Warehouse presente...

Warehousing & Analytics

AI BI Dashboards

AI BI Genie

Databricks SQL

1814 Views
0 replies
0 kudos

07-03-2024 10:20:04 AM

by Akshay_Petkar • Contributor III

06-27-2024 5:17:09 AM

1523 Views
0 replies
0 kudos

AWS Databricks external tables are delta tables?

If I create an external table on AWS Databricks, will it be a Delta table? If not, is there a way to make it a Delta table, or is there no Delta capability for external tables?

Warehousing & Analytics

Reply

1523 Views
0 replies
0 kudos

06-27-2024 5:17:09 AM

by Akshay_Petkar • Contributor III

06-26-2024 5:22:02 AM

1394 Views
1 replies
0 kudos

Databrics offers auto tuning?

is databricks offers auto tunning?

Warehousing & Analytics

Reply

1394 Views
1 replies
0 kudos

06-26-2024 5:22:02 AM

View Replies

Latest Reply

raphaelblg
Databricks Employee

06-26-2024 1:02:54 PM

0 kudos

Yes, First of all, open source spark already has a set of auto-tuning features denominated Adaptive Query Execution (AQE). Here are more details: https://spark.apache.org/docs/latest/sql-performance-tuning.html#adaptive-query-execution. For even bett...

0 kudos

06-26-2024 1:02:54 PM

by Shaimaa • New Contributor II

06-25-2024 8:13:21 AM

830 Views
1 replies
0 kudos

running a query against multiple parquet files from a folder

I am runninng a query against multiple parquet files:SELECT SUM(CASE WHEN match_result.year_incorporated IS NOT NULL AND match_result.year_incorporated != '' THEN 1 ELSE 0 END) FROM parquet.`s3://folder_path/*`for some files, the field `year_incorpo...

Warehousing & Analytics

Reply

830 Views
1 replies
0 kudos

06-25-2024 8:13:21 AM

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

06-25-2024 11:30:21 PM

0 kudos

@Shaimaa The column type mismatch between the files could be an issue here.For example: if in one file column 'xyz' is a type of INTEGER and in another one the same column is a type of STRING, Spark will give you a schema conversion error.Below is a ...

0 kudos

06-25-2024 11:30:21 PM

by Shaimaa • New Contributor II

06-24-2024 10:39:38 AM

1776 Views
3 replies
0 kudos

null object while running a query against parquet

I am running this query against parquet:SELECT SUM(CASE WHEN match_result.ecommerce.has_online_payments THEN 1 ELSE 0 END) FROM parquet.`s3://folder_path/*`when all the values of the object `match_result.ecommerce` are null, I get the following erro...

Warehousing & Analytics

Reply

1776 Views
3 replies
0 kudos

06-24-2024 10:39:38 AM

View Replies

Latest Reply

Shaimaa
New Contributor II

06-25-2024 4:44:22 AM

0 kudos

None of these solutions with coalesce work because it's "match_result.ecommerce" that is null not "match_result.ecommerce.has_online_payments". So it's still trying to extract a value from a null. Help me modify the query accordingly please.

0 kudos

06-25-2024 4:44:22 AM

2 More Replies

Databricks Community

Forum Posts

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

Sparklyr error in spark_apply: Error: java.lang.NoSuchMethodError

Databricks azure dashboard chart issue

Issue with Column Name Conflict While Importing .gz File into Spark DataFrame

Rate limit Usage

PowerBI "Token expired while fetching results: TEAuthTokenExpired."

Converting an Entity Attribute Value Data Model to Struct datatype

Looking for Chauffeur service Melbourne? Visit Executive cars for Chauffeur service Melbourne. ✓24/7 Operation ✓100+ Vehicles ✓ Chauffeur Cars Melbour...

Is there any way to see who ran a notebook?

Cant Use Delta Live Tables to read MSK using IAM authenthication

Impact of Overwriting Databricks SQL Tables on Versioning

[Recap] Data + AI Summit 2024 - Warehousing & Analytics | Improve performance and increase insights

AWS Databricks external tables are delta tables?

Databrics offers auto tuning?

running a query against multiple parquet files from a folder

null object while running a query against parquet

Join Us as a Local Community Builder!

Using Parameters in EXECUTE IMMEDIATE on Databrick...

Misleading UNBOUND_SQL_PARAMETER even though param...

AI/BI Dashboards

Streamlit app on Databricks doesn't recognise the ...

Linearizability on Delta Lake table