Data Engineering

Forum Posts

Sorted by:

by cat017 • New Contributor III

11-13-2024 1:42:05 AM

927 Views
2 replies
1 kudos

DLT - Continuously Updated File Issue

Hi everyone,I'm encountering an issue with my DLT pipeline that I haven't been able to resolve. My pipeline reads a single CSV file that is over 100 GB in size. This file is continuously updated throughout the day. When DLT attempts to read the file ...

Data Engineering

927 Views
2 replies
1 kudos

11-13-2024 1:42:05 AM

View Replies

Latest Reply

cat017
New Contributor III

11-14-2024 12:26:43 PM

1 kudos

Hi @Alberto_Umana ,thank you for the reply. I've already tried the Auto Loader a few times, but didnt work. i will try it again. But, as you suggest, its better to file a case.

1 kudos

11-14-2024 12:26:43 PM

1 More Replies

by ArjunS310 • New Contributor III

08-28-2022 8:55:31 AM

2774 Views
4 replies
3 kudos

Resolved! Did not receive a badge upon completing databricks fundamentals assessment

Team,I completed the training and assessment on databricks assesment and passed with 80% and received a certificate of completion but did not receive a badge as mentioned in the description of the course. Could you please help.

Data Engineering

2774 Views
4 replies
3 kudos

08-28-2022 8:55:31 AM

View Replies

Latest Reply

Danny_Lee
Valued Contributor

08-21-2024 5:04:53 PM

3 kudos

I find the badge usually comes the next day with a link to https://credentials.databricks.com/ where you can download a certificate and share a badge in social media.

3 kudos

08-21-2024 5:04:53 PM

3 More Replies

by anujlathi • New Contributor

11-14-2024 4:47:44 AM

1449 Views
1 replies
0 kudos

Connecting Sharepoint to AWS Databricks

Hi, I need to connect to Sharepoint and read data to a spark dataframe in databricks. The Databricks account is based on AWS.

Data Engineering

1449 Views
1 replies
0 kudos

11-14-2024 4:47:44 AM

View Replies

Latest Reply

MuthuLakshmi
Databricks Employee

11-14-2024 6:21:06 AM

0 kudos

@anujlathi Please go through the below doc links. They have detailed instructions for establishing connectivity and it has worked for few customers. https://www.cdata.com/kb/tech/sharepoint-jdbc-azure-databricks.rst https://community.databricks.com/...

0 kudos

11-14-2024 6:21:06 AM

by Maaax • New Contributor

02-07-2024 7:52:44 AM

1988 Views
1 replies
0 kudos

Table shared via Delta sharingschema not shown in Unity-Catalog

Hello dear community,I use the following command to register the provider including shares to unity-catalog:databricks unity-catalog create-provider --name company --recipient-profile-json-file ~/Develop/profile.jsonOnce it is registered, I could see...

Data Engineering

1988 Views
1 replies
0 kudos

02-07-2024 7:52:44 AM

View Replies

Latest Reply

steyler-db
Databricks Employee

11-14-2024 5:12:46 AM

0 kudos

Hello Maaax, thanks for pointing this issue you are facing, let's recap this issue in sections: Error description:The error you are encountering, "Failed to request /ajax-api/2.1/unity-catalog/tables/company.patient_schema2.patient_table2?include_bro...

0 kudos

11-14-2024 5:12:46 AM

by JissMathew • Valued Contributor

11-12-2024 10:30:42 PM

635 Views
2 replies
1 kudos

DLT issue while implemeting pipeline

i'm using pay - as you go subscription and using multi node with shared access cluster , still facing this issue

Data Engineering

635 Views
2 replies
1 kudos

11-12-2024 10:30:42 PM

View Replies

Latest Reply

SparkJun
Databricks Employee

11-12-2024 10:38:02 PM

1 kudos

It looks like a cluster CPU core quota limit issue. Please reach out to the Azure support and they can look into your options to increase the quota and review your current usage.

1 kudos

11-12-2024 10:38:02 PM

1 More Replies

by ChsAIkrishna • Contributor

11-13-2024 8:44:31 AM

980 Views
2 replies
0 kudos

Databricks Workflow duration issues

There is a discrepancy in the Databricks Workflow . Currently, the start and end times printed are when the workflow is triggered, not when the actual workflow execution begins. Ideally, it should log the task execution time. The issue is that the cu...

Data Engineering

980 Views
2 replies
0 kudos

11-13-2024 8:44:31 AM

View Replies

Latest Reply

Walter_C
Databricks Employee

11-13-2024 10:26:32 AM

0 kudos

This is already reported on a feature request to make sure we set a difference in between the queue time and the execution time in the job UI page, this can be tracked with id DB-I-8963

0 kudos

11-13-2024 10:26:32 AM

1 More Replies

by JoseU • New Contributor

07-17-2024 10:40:25 AM

1823 Views
1 replies
0 kudos

Cannot install libraries to cluster

Getting the following error when trying to install libraries to all purpose compute using the Library tab in Cluster details. We had vendor setup the cluster and they have since dropped off. I have switched the owner to an active AD user however stil...

Data Engineering

1823 Views
1 replies
0 kudos

07-17-2024 10:40:25 AM

View Replies

Latest Reply

k1t3k
New Contributor II

11-14-2024 1:32:47 AM

0 kudos

Have you managed to find a solution for this?

0 kudos

11-14-2024 1:32:47 AM

by AmineDE • New Contributor II

11-12-2024 6:25:22 AM

1583 Views
2 replies
1 kudos

[DATATYPE_MISMATCH.DATA_DIFF_TYPES] Cannot resolve "coalesce(VALUE, false)"

Hi All,i got the error below using a compute instance with any of the runtime but not with sql warehouse xsmall. How could we explain it ?Runtime Error in model sics_business_recon_unpivoted (models\Engineering\Business\sics_business_recon_unpivoted....

Data Engineering

1583 Views
2 replies
1 kudos

11-12-2024 6:25:22 AM

View Replies

Latest Reply

SparkJun
Databricks Employee

11-12-2024 10:29:03 PM

1 kudos

can you try to run this on DBSQL and see if that errors out: coalesce(CAST(VALUE AS BOOLEAN), false)? What's the DBR version for your cluster?

1 kudos

11-12-2024 10:29:03 PM

1 More Replies

by slakshmanan • New Contributor III

10-07-2024 7:13:34 PM

3601 Views
9 replies
1 kudos

how to use rest api to find long running query in databricks

how to use rest api to find long running query in databricks from sql/queries/all

Data Engineering

3601 Views
9 replies
1 kudos

10-07-2024 7:13:34 PM

View Replies

Latest Reply

Srini_ADB
New Contributor II

11-13-2024 11:45:03 PM

1 kudos

@Ajay-Pandey Thanks. This API works fine. But it is showing only the current day queries. How can we get the all queries which is currently running.

1 kudos

11-13-2024 11:45:03 PM

8 More Replies

by jgrycz • New Contributor III

11-13-2024 6:35:53 AM

2862 Views
2 replies
2 kudos

Resolved! Can not set Service Principal User role to a service principal

Hi!I'm trying to assign `Service Principal Users` role to a newly create Service Principal using terraform.For that I use following block of code:```resource "databricks_service_principal_role" "sp_job_runner_user_role" { service_principal_id = data...

Data Engineering

2862 Views
2 replies
2 kudos

11-13-2024 6:35:53 AM

View Replies

Latest Reply

jgrycz
New Contributor III

11-13-2024 11:21:56 PM

2 kudos

@Alberto_Umana thanks for help!

2 kudos

11-13-2024 11:21:56 PM

1 More Replies

by CliveChan • New Contributor II

10-06-2024 8:15:52 PM

877 Views
3 replies
0 kudos

Coursera Applied Data Science for Data Analysts - Classroom Setup Failed

I tried to run the lab in the Coursera Applied Data Science for Data Analysts Classroom Setup and failed with the following error: Is there any fix for this?

Screenshot 2024-10-07 at 11.14.38 AM.png

Data Engineering

877 Views
3 replies
0 kudos

10-06-2024 8:15:52 PM

View Replies

Latest Reply

CliveChan
New Contributor II

11-13-2024 10:57:53 PM

0 kudos

0 kudos

11-13-2024 10:57:53 PM

2 More Replies

by shoubhit • New Contributor

12-29-2023 2:33:38 AM

2202 Views
1 replies
0 kudos

Merge Databricks customer and partner account

I have created my customer academy account with databricks accidentally, but I had to create it with partner one as my company has partnership with it. I need to take certification exam urgently, please help me in merging these two accounts as I am a...

Data Engineering

2202 Views
1 replies
0 kudos

12-29-2023 2:33:38 AM

View Replies

Latest Reply

mahfooz_iiitian
New Contributor III

11-13-2024 11:10:33 PM

0 kudos

Is your issue resolved as I am also need to merge my accounts

0 kudos

11-13-2024 11:10:33 PM

by Vetrivel • Contributor

11-12-2024 11:24:22 PM

2300 Views
2 replies
0 kudos

Cost Optimization for serverless Delta Live Table Implementation

I am currently using serverless Delta Live Tables for our silver layer, specifically leveraging the apply changes API method for SCD Type 2. However, we have observed that the costs are higher than initially anticipated, and I would like to seek your...

Data Engineering

2300 Views
2 replies
0 kudos

11-12-2024 11:24:22 PM

View Replies

Latest Reply

Mounika_Tarigop
Databricks Employee

11-13-2024 12:22:53 PM

0 kudos

To optimize DBU consumption and reduce costs while using serverless Delta Live Tables (DLT) for your silver layer, particularly with the apply changes API method for Slowly Changing Dimension (SCD) Type 2, consider the following options: - Instead o...

0 kudos

11-13-2024 12:22:53 PM

1 More Replies

by Brad • Contributor II

10-15-2024 4:22:07 PM

2692 Views
10 replies
0 kudos

why latestOffset and getBatch takes so long time

Hi team,Kinesis -> delta table raw -> job with trigger=availableNow -> delta table target. The Kinesis->delta table raw is running continuously. The job is daily with trigger=availableNow. The job reads from raw, do some transformation, and run a MER...

Data Engineering

2692 Views
10 replies
0 kudos

10-15-2024 4:22:07 PM

View Replies

Latest Reply

Brad
Contributor II

11-13-2024 2:58:25 PM

0 kudos

@VZLA , thanks for the input and suggestion. Will create a support ticket.

0 kudos

11-13-2024 2:58:25 PM

9 More Replies

by Brad • Contributor II

11-08-2024 12:29:33 AM

778 Views
2 replies
0 kudos

Can I have sequence guarantee when replicate with CDF

Hi team,I have a delta table src, and somehow I want to replicate it to another table tgt with CDF, sort of (spark .readStream .format("delta") .option("readChangeFeed", "true") .table('src') .writeStream .format("delta") ...

Data Engineering

778 Views
2 replies
0 kudos

11-08-2024 12:29:33 AM

View Replies

Latest Reply

Brad
Contributor II

11-13-2024 2:54:56 PM

0 kudos

Thanks. If the replicated table can have the _commit_version in strict sequence, I can take it as a global ever-incremental col and consume the delta of it (e.g. in batch way) with select * from replicated_tgt where _commit_version > ( selecct la...

0 kudos

11-13-2024 2:54:56 PM

1 More Replies

Databricks Community

Forum Posts

DLT - Continuously Updated File Issue

Resolved! Did not receive a badge upon completing databricks fundamentals assessment

Connecting Sharepoint to AWS Databricks

Table shared via Delta sharingschema not shown in Unity-Catalog

DLT issue while implemeting pipeline

Databricks Workflow duration issues

Cannot install libraries to cluster

[DATATYPE_MISMATCH.DATA_DIFF_TYPES] Cannot resolve "coalesce(VALUE, false)"

how to use rest api to find long running query in databricks

Resolved! Can not set Service Principal User role to a service principal

Coursera Applied Data Science for Data Analysts - Classroom Setup Failed

Merge Databricks customer and partner account

Cost Optimization for serverless Delta Live Table Implementation

why latestOffset and getBatch takes so long time

Can I have sequence guarantee when replicate with CDF

Join Us as a Local Community Builder!

Lakehouse Federation - fetch size parameter for op...

Best Practices for implementing DLT, Autoloader in...

Claude Access to Workspace and Catalog

Broadcast Join Failure in Streaming: Failed to sto...

DLT or DP: How to do full refresh of Delta table f...