Data Engineering

Forum Posts

Sorted by:

by ashraf1395 • Honored Contributor

02-20-2025 1:40:27 AM

316 Views
2 replies
1 kudos

Resolved! Not readable format of event_logs traceback in dlt pipeline

This is my dlt pipeline event_log - why is it not in readable foramt how can i correct it.This is my pipeline code : import logging logger = logging.getLogger(__name__) logger.info("Error") raise "Error is error"

Data Engineering

316 Views
2 replies
1 kudos

02-20-2025 1:40:27 AM

View Replies

Latest Reply

jorperort
Contributor

12 hours ago

1 kudos

Hi @ashraf1395 ,I'm working with Delta Live Tables (DLT) and the event_log table. I would like to know if it is possible to access the event handler that DLT uses to write custom logs and send them to this table when events are published.If this is n...

1 kudos

12 hours ago

1 More Replies

by susanne • New Contributor III

2 weeks ago

125 Views
2 replies
1 kudos

How to write event_log destination into DLT Settings JSON via Asset Bundles

Hi all,I would like to publish the event_log of my DLT Pipeline to a specific schema in Unity Catalog.Following this article (https://docs.databricks.com/gcp/en/dlt/observability#query-the-event-log) this can be done by writing this into the DLTs set...

Data Engineering

125 Views
2 replies
1 kudos

2 weeks ago

View Replies

Latest Reply

ashraf1395
Honored Contributor

14 hours ago

1 kudos

Hi @susanne , indeed , i tried to create it using dabs as well. This feature is not available using dabs I guess, maybe they will add it once event_logs moves to ga from public preview.databricks API will be a good alternative but if you try it using...

1 kudos

14 hours ago

1 More Replies

by minhhung0507 • Contributor

Friday

136 Views
1 replies
0 kudos

CANNOT_UPDATE_TABLE_SCHEMA

I'm encountering a puzzling schema merge issue with my Delta Live Table. My setup involves several master tables on Databricks, and due to a schema change in the source database, one of my Delta Live Tables has a column (e.g., "reference_score") that...

Data Engineering

136 Views
1 replies
0 kudos

Friday

View Replies

Latest Reply

Brahmareddy
Honored Contributor II

yesterday

0 kudos

Hey Hung, this is a pretty common issue when working with Delta Live Tables (DLT) and schema evolution. When the data type of a column changes—like in your case from Integer to Float—Delta sees that as an incompatible schema change, and by default, D...

0 kudos

yesterday

by sriramnedunuri • New Contributor II

Friday

249 Views
1 replies
0 kudos

Regexp_replaces pattern only once

select regexp_replace('asdfhsdf&&1&&asdfasdf&&2&&asdf','&&[0-100]&&','') output here it replaces first pattern but not the 2nd patterno/poutputasdfhsdfasdfasdf&&2&&asdf even if we use position its not working something like both wont work select ...

Data Engineering

249 Views
1 replies
0 kudos

Friday

View Replies

Latest Reply

sriramnedunuri
New Contributor II

Friday

0 kudos

changing to [0-9] works this can be closed.

0 kudos

Friday

by ruoyuqian • New Contributor II

08-07-2024 10:03:15 PM

2592 Views
4 replies
4 kudos

How to print out logs during DLT pipeline run

I'm trying to debug my pipeline in DLT and during runtime I need some log info and how do I do a print('something') during DLT run?

Data Engineering

2592 Views
4 replies
4 kudos

08-07-2024 10:03:15 PM

View Replies

Latest Reply

iooj
New Contributor

Friday

4 kudos

>> LogManager.getLogger() seems is not working in DLT notebook DLTError: [PY4J_BLOCKED_API] You are using a Python API that is not supported in the current environment. Please check Databricks documentation for alternatives. An error occurred while c...

4 kudos

Friday

3 More Replies

by hpant • New Contributor III

Thursday

153 Views
1 replies
1 kudos

Is it possible to create external volume using databricks asset bundle?

Is it possible to create external volume using databricks asset bundle? I have this code from databricks.yml file which is working perfectly fine for manged volume: resources: volumes: bronze_checkpoints_volume: catalog_name: ...

Data Engineering

153 Views
1 replies
1 kudos

Thursday

View Replies

Latest Reply

ashraf1395
Honored Contributor

Thursday

1 kudos

Hey @hpant Yes, we can create external databricks volumes as well using databricks asset bundles bundle: name: external_volume resources: volumes: external_volume_creation: catalog_name: retail_data name: test_volume schema_...

1 kudos

Thursday

by TomHauf • New Contributor

Tuesday

327 Views
1 replies
1 kudos

Sending my weather data to a clients cloud storage

Hi, One of our clients is asking to switch from our API feed to have weather data delivered automatically to their Cloud Storage. What steps do I need to take from my end? Do I need to join Databricks to do so? Thanks. Tom

Data Engineering

327 Views
1 replies
1 kudos

Tuesday

View Replies

Latest Reply

XP
Databricks Employee

Thursday

1 kudos

Hey @TomHauf, while it may not be essential in your case, you should at least consider using Databricks to facilitate loading data into your customers cloud storage. Databricks gives you a few options to make sharing with third parties simple as per ...

1 kudos

Thursday

by JangaReddy • New Contributor

Thursday

140 Views
0 replies
0 kudos

Serverless Access

Hi Team,Can you help us, how to restrict serverless access to only specific users/groups. (through Workspace admin /account admin)?Regards,Phani

Data Engineering

140 Views
0 replies
0 kudos

Thursday

by dc-rnc • New Contributor II

Wednesday

210 Views
1 replies
0 kudos

Writing to Delta Table and retrieving back the IDs doesn't work

Hi.I have a workflow in which I write few rows into a Delta Table with auto-generated IDs. Then, I need to retrieve them back just after they're written into the table to collect those generated IDs, so I read the table and I use two columns (one is ...

Data Engineering

210 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

jeremy98
Contributor III

Thursday

0 kudos

I'm interested too in this problem.. someone could help?

0 kudos

Thursday

by IGRACH • New Contributor II

Wednesday

156 Views
1 replies
1 kudos

Unable to delete a table

When I try to delete a table, I'm getting this error:[ErrorClass=INVALID_STATE] TABLE catalog.schema.table_name cannot be deleted because it is being shared via Delta Sharing.I have checked on the internet about it, but could not find any info about ...

Data Engineering

156 Views
1 replies
1 kudos

Wednesday

View Replies

Latest Reply

ashraf1395
Honored Contributor

Wednesday

1 kudos

Hi @IGRACH ,You are facing this issue bcz I guess the table you want to delete is being shared by delta sharing. You can go to the shared object by following this dochttps://docs.databricks.com/aws/en/delta-sharing/create-share#update-sharesandThen, ...

1 kudos

Wednesday

by HoussemBL • New Contributor III

01-21-2025 6:39:30 AM

411 Views
3 replies
0 kudos

External tables in DLT pipelines

Hello community,I have implemented a DLT pipeline.In the "Destination" setting of the pipeline I have specified a unity catalog with target schema of type external referring to an S3 destination.My DLT pipeline works well. Yet, I noticed that all str...

Data Engineering

411 Views
3 replies
0 kudos

01-21-2025 6:39:30 AM

View Replies

Latest Reply

Sushil_saini
New Contributor

Wednesday

0 kudos

This won't work.best approach is create dlt sink to write to delta external table. This pipeline should only be 1 step. Read table and append flow using data sink. It works fine.

0 kudos

Wednesday

2 More Replies

by naga93 • New Contributor

Tuesday

136 Views
0 replies
0 kudos

How to read Delta Lake table with Spaces/Special Characters in Column Names in Dremio

Hello,I am currently writing a Delta Lake table from Databricks to Unity Catalog using PySpark 3.5.0 (15.4 LTS Databricks runtime). We want the EXTERNAL Delta Lake tables to be readable from both UC and Dremio. Our Dremio build version is 25.0.6.The ...

Data Engineering

136 Views
0 replies
0 kudos

Tuesday

by afisl • New Contributor II

11-27-2023 5:47:39 AM

11479 Views
8 replies
5 kudos

Resolved! Apply unitycatalog tags programmatically

Hello,I'm interested in the "Tags" feature of columns/schemas/tables of the UnityCatalog (described here: https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/tags)I've been able to play with them by hand and would now lik...

Data Engineering

Is it possible to use Liquid Clustering on Delta Live Tables / Materialised Views?

Is it possible to use Liquid Clustering on Delta Live Tables? If it is available what is the Python syntax for adding liquid clustering to a Delta Live Table / Materialised view please?

Data Engineering

1929 Views
4 replies
1 kudos

07-04-2024 5:40:47 AM

View Replies

Latest Reply

surajitDE
New Contributor III

a week ago

1 kudos

@Dlt.table(name=table_name,comment="just_testing",table_properties={"quality": "gold","mergeSchema": "true"},cluster_by=["test_id", "find_date"] # Optimizes for queries filtering on these columns)def testing_table():return create_testing_table(df_fin...

1 kudos

a week ago

3 More Replies

by Eric_Kieft • New Contributor III

a week ago

179 Views
3 replies
1 kudos

Centralized Location of Table History/Timestamps in Unity Catalog

Is there a centralized location in Unity Catalog that retains the table history, specifically the last timestamp, for managed delta tables?DESCRIBE HISTORY will provide it for a specific table, but I would like to get it for a number of tables.inform...

Data Engineering

179 Views
3 replies
1 kudos

a week ago

View Replies

Latest Reply

Brahmareddy
Honored Contributor II

a week ago

1 kudos

Hi Eric_Kieft,How are you doing today?, As per my understanding, yeah, Unity Catalog doesn’t currently provide a direct system table that tracks all table modifications (including inserts/updates) across multiple managed Delta tables. DESCRIBE HISTOR...

1 kudos

a week ago

2 More Replies