Data Engineering

Forum Posts

Sorted by:

by BobEng • New Contributor

01-13-2024 4:49:41 AM

576 Views
2 replies
0 kudos

Delta Live Tables are dropped when pipeline is deleted

I created simplistic DLT pipeline that create one table. When I delete the pipeline the tables is dropped as well. That's not really desired behavior. Since I remember there was a strong distinction between data (stored in tables) and processing (spa...

Data Engineering

576 Views
2 replies
0 kudos

01-13-2024 4:49:41 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-17-2024 3:29:33 AM

0 kudos

Hi @BobEng, Here are a few things that might help: Pipeline Settings: Delta Live Tables provides a user interface for configuring and editing pipeline settings. You can configure most settings with either the UI or a JSON specification. Table Mana...

0 kudos

01-17-2024 3:29:33 AM

1 More Replies

by rt-slowth • Contributor

01-10-2024 10:15:03 PM

883 Views
4 replies
0 kudos

User: anonymous is not authorized to perform: sqs:receivemessage on resource

from pyspark.sql import functions as F from pyspark.sql import types as T from pyspark.sql import DataFrame, Column from pyspark.sql.types import Row import dlt S3_PATH = 's3://datalake-lab/xxxx/' S3_SCHEMA = 's3://datalake-lab/xxxx/schemas/' @dl...

Data Engineering

883 Views
4 replies
0 kudos

01-10-2024 10:15:03 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 4:07:41 AM

0 kudos

Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your question?This...

0 kudos

01-18-2024 4:07:41 AM

3 More Replies

by pyter • New Contributor III

01-17-2024 6:12:55 AM

4046 Views
6 replies
2 kudos

Resolved! [13.3] Vacuum on table fails if shallow clone without write access exists

Hello everyone,We use unity catalog, separating our dev, test and prod data into individual catalogs.We run weekly vacuums on our prod catalog using a service principal that only has (read+write) access to this production catalog, but no access to ou...

Data Engineering

4046 Views
6 replies
2 kudos

01-17-2024 6:12:55 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 12:48:15 AM

2 kudos

2 kudos

01-18-2024 12:48:15 AM

5 More Replies

by Databricks_Work • New Contributor II

01-11-2024 8:29:21 AM

451 Views
4 replies
1 kudos

I have to optimise our initial load, for that I want to perform batch inserts while loading data.

Data Engineering

451 Views
4 replies
1 kudos

01-11-2024 8:29:21 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 4:05:36 AM

1 kudos

1 kudos

01-18-2024 4:05:36 AM

3 More Replies

by rt-slowth • Contributor

01-09-2024 3:57:52 PM

361 Views
2 replies
0 kudos

Handling files used more than once in a streaming pipeline

I am implementing Structured Streaming using Delta Live Table. I want to delete the parquet files once they are used. What options should I set so that the files loaded in S3 are not deleted?

Data Engineering

361 Views
2 replies
0 kudos

01-09-2024 3:57:52 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 4:03:35 AM

0 kudos

0 kudos

01-18-2024 4:03:35 AM

1 More Replies

by seefoods • New Contributor III

01-10-2024 12:20:14 AM

546 Views
4 replies
1 kudos

Resolved! cluster metrics databricks runtime 13.1

hello everyone, how to collect metrics provided by clusters metrics databricks runtime 13.1 using bash script

Data Engineering

546 Views
4 replies
1 kudos

01-10-2024 12:20:14 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:44:28 AM

1 kudos

1 kudos

01-18-2024 3:44:28 AM

3 More Replies

by chari • Contributor

01-10-2024 4:05:19 AM

425 Views
1 replies
1 kudos

connect azure databricks to reverso using API

Hi Databricks community,Reverso is a language translation tool that converts texts from one language to another. I need to convert hundreds of text but its time consuming. Hence, I want to use its API to automate the process.How can I achieve this in...

Data Engineering

API

425 Views
1 replies
1 kudos

01-10-2024 4:05:19 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:33:01 AM

1 kudos

Hi @chari, To use the Reverso API inside Databricks notebooks, you can use the requests library in Python.

1 kudos

01-18-2024 3:33:01 AM

by Nathant93 • New Contributor II

01-10-2024 7:04:16 AM

316 Views
2 replies
0 kudos

alter function owner in UC

I have a number of functions in a schema in a catalog in Unity Catalog, is there a coding way to be able to change the owner of the functions created without having to do it manually via the gui?

Data Engineering

316 Views
2 replies
0 kudos

01-10-2024 7:04:16 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:19:37 AM

0 kudos

0 kudos

01-18-2024 3:19:37 AM

1 More Replies

by MunikrishnaS • New Contributor II

01-08-2024 7:55:50 PM

1190 Views
7 replies
0 kudos

What are optimized solutions for moving on-premise IBM DB2 CDC data to Databricks Delta table

Hi Team,My requirement is to move build a solution to move zos(db2) CDC data to Delta table on Realtime bases(at least near realtime) , data volume and number of tables are little huge (100 tables) I have researched I dont find any inbuild options in...

Data Engineering

1190 Views
7 replies
0 kudos

01-08-2024 7:55:50 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:07:02 AM

0 kudos

0 kudos

01-18-2024 3:07:02 AM

6 More Replies

by Pratibha • New Contributor II

01-08-2024 3:40:56 AM

595 Views
2 replies
0 kudos

how max_retry_interval_millis works with retry_on_timeout in Data bricks.

my project I want if job take longer time then it will terminate and again it will try even if there is timeout error and in databricks launched status should show retry by scheduler and it should follow min_retry_interval_millis before start retry...

Data Engineering

min_retry_interval_millis

595 Views
2 replies
0 kudos

01-08-2024 3:40:56 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:04:16 AM

0 kudos

0 kudos

01-18-2024 3:04:16 AM

1 More Replies

by DH_Fable • New Contributor II

01-08-2024 3:55:43 AM

444 Views
2 replies
0 kudos

Downloading multiple excel files at once from repo

I have a notebook that produces lots of excel files which I want downloading on my local machine.I can only currently download one by one which takes a long time when there are a lot of them.Is there a way without using Azure CLI to download all of t...

Data Engineering

444 Views
2 replies
0 kudos

01-08-2024 3:55:43 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:04:07 AM

0 kudos

0 kudos

01-18-2024 3:04:07 AM

1 More Replies

by Databricks-acn • New Contributor II

01-06-2024 3:21:49 AM

914 Views
5 replies
0 kudos

Unable to load data in DLT tables from Federated data sources

I tried to run this query and failing to load the data .What do I need to do load from federated data sources using DLT if this is not correct CREATE OR REPLACE LIVE TABLE bulkuploadhistory COMMENT 'Table generated for bulkuploadhistory.' TBLPROPERTI...

Data Engineering

dlt

914 Views
5 replies
0 kudos

01-06-2024 3:21:49 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:51:48 AM

0 kudos

0 kudos

01-18-2024 2:51:48 AM

4 More Replies

by pawelzak • New Contributor III

01-17-2024 3:17:16 AM

1652 Views
3 replies
1 kudos

Dashboard update through API

Hi,I would like to create / update dashboard definition based on the json file. How can one do it? I tried the following:databricks api post /api/2.0/preview/sql/dashboards/$dashboard_id --json @file.json But it does not update the widgets...How can...

Data Engineering

1652 Views
3 replies
1 kudos

01-17-2024 3:17:16 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 12:55:43 AM

1 kudos

1 kudos

01-18-2024 12:55:43 AM

2 More Replies

by Kaushik2 • New Contributor

01-04-2024 12:28:24 PM

386 Views
2 replies
0 kudos

Reports on list of users and roles that have access to Databricks workspace

Are there any in-built reports available in Databricks UI that list the users and roles with access to the workspace?

Data Engineering

386 Views
2 replies
0 kudos

01-04-2024 12:28:24 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:21:00 AM

0 kudos

0 kudos

01-18-2024 2:21:00 AM

1 More Replies

by Ruby8376 • Valued Contributor

01-04-2024 4:31:57 PM

662 Views
3 replies
1 kudos

Query endpoint on Azure sql or databricks?

Hi Currently all data reauired resides in Az sql database. We have a project in which we need to query on demand this data in Salesforce data cloud to be further used for reporting in CRMA dashboard.do we need to move this data from az sql to delta l...

Data Engineering

662 Views
3 replies
1 kudos

01-04-2024 4:31:57 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:20:26 AM

1 kudos

1 kudos

01-18-2024 2:20:26 AM

2 More Replies

User

Count

1602

736

343

284

247

Databricks

Forum Posts

Delta Live Tables are dropped when pipeline is deleted

User: anonymous is not authorized to perform: sqs:receivemessage on resource

Resolved! [13.3] Vacuum on table fails if shallow clone without write access exists

I have to optimise our initial load, for that I want to perform batch inserts while loading data.

Handling files used more than once in a streaming pipeline

Resolved! cluster metrics databricks runtime 13.1

connect azure databricks to reverso using API

alter function owner in UC

What are optimized solutions for moving on-premise IBM DB2 CDC data to Databricks Delta table

how max_retry_interval_millis works with retry_on_timeout in Data bricks.

Downloading multiple excel files at once from repo

Unable to load data in DLT tables from Federated data sources

Dashboard update through API

Reports on list of users and roles that have access to Databricks workspace

Query endpoint on Azure sql or databricks?

Best way to parse Google Analytics data in Databri...

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...