Data Engineering

Forum Posts

Sorted by:

by BobCat62 • New Contributor III

yesterday

168 Views
3 replies
1 kudos

How to copy notebooks from local to the tarrget folder via asset bundles

Hi all,I am able to deploy Databricks assets to the target workspace. Jobs and workflows can also be created successfully.But I have aspecial requirement, that I copy the note books to the target folder on databricks workspace.Example:on Local I have...

Data Engineering

168 Views
3 replies
1 kudos

yesterday

View Replies

Latest Reply

BobCat62
New Contributor III

9 hours ago

1 kudos

Hello @ashraf1395 ,Nice to hear you and thank you for your hints.Actually with your idea, I could reach half of my aim you can see here the folder structure in my VS code:and here is part of my `databrick.yml` file:targets: dev: # The default tar...

1 kudos

9 hours ago

2 More Replies

by cmathieu • New Contributor II

yesterday

200 Views
3 replies
0 kudos

DAB - All projects files deployed

I have an issue with DAB where all the project files, starting from root ., get deployed to the /files folder in the bundle. I would prefer being able to deploy certain util notebooks, but not all the files of the project. I'm able to not deploy any ...

Data Engineering

200 Views
3 replies
0 kudos

yesterday

View Replies

Latest Reply

ashraf1395
Honored Contributor

42m ago

0 kudos

@cmathieu , It will support deployment of whole directory and not others as well.

0 kudos

42m ago

2 More Replies

by khangnguyen164 • New Contributor

yesterday

69 Views
2 replies
0 kudos

Error "insert concurrent to Delta Lake" when 2 streaming merge data to same table at the same time

Hello everyone ,We currently have 2 streaming (Bronze job) created on 2 tasks in the same job, running the same compute job and both merge data into the same table (Silver table). If I create it like above, sometimes I get an error related to "insert...

Data Engineering

69 Views
2 replies
0 kudos

yesterday

View Replies

Latest Reply

khangnguyen164
New Contributor

3 hours ago

0 kudos

Anyone else can help me this case

0 kudos

3 hours ago

1 More Replies

by HoussemBL • New Contributor III

01-21-2025 6:39:30 AM

350 Views
3 replies
0 kudos

External tables in DLT pipelines

Hello community,I have implemented a DLT pipeline.In the "Destination" setting of the pipeline I have specified a unity catalog with target schema of type external referring to an S3 destination.My DLT pipeline works well. Yet, I noticed that all str...

Data Engineering

350 Views
3 replies
0 kudos

01-21-2025 6:39:30 AM

View Replies

Latest Reply

Sushil_saini
Visitor

5 hours ago

0 kudos

This won't work.best approach is create dlt sink to write to delta external table. This pipeline should only be 1 step. Read table and append flow using data sink. It works fine.

0 kudos

5 hours ago

2 More Replies

by a_user12 • New Contributor III

10 hours ago

36 Views
1 replies
0 kudos

databricks bundle Deploy: exit code 0 even if an error occurs

We have a CI/CD pipeline where we run:databricks bundle deploy [...]The code works fine, however, if we missconfigure it, we see in the output an error message such asDeploying resources... Updating deployment state... Warning: Detected unresolved va...

Data Engineering

asset bundle

36 Views
1 replies
0 kudos

10 hours ago

View Replies

Latest Reply

a_user12
New Contributor III

9 hours ago

0 kudos

you can close it: it was an ci/cd issue

0 kudos

9 hours ago

by gsouza • Visitor

9 hours ago

28 Views
0 replies
0 kudos

Databricks asset bundle occasionally duplicating jobs

Since last year, we have adopted Databricks Asset Bundles for deploying our workflows to the production and staging environments. The tool has proven to be quite effective, and we currently use Azure DevOps Pipelines to automate bundle deployment, tr...

Data Engineering

28 Views
0 replies
0 kudos

9 hours ago

by IGRACH • New Contributor II

11 hours ago

23 Views
0 replies
0 kudos

Unable to delete a table

When I try to delete a table, I'm getting this error:[ErrorClass=INVALID_STATE] TABLE catalog.schema.table_name cannot be deleted because it is being shared via Delta Sharing.I have checked on the internet about it, but could not find any info about ...

Data Engineering

23 Views
0 replies
0 kudos

11 hours ago

by Rajt1 • Visitor

11 hours ago

9 Views
0 replies
0 kudos

Job , Task, Stage Creation

I am running below code -df = spark.read.json('xyz.json')df.countI want to understand the actual working of the spark. How many jobs & stages will be created. I want to understand the detailed & easier concept of how it works?

Data Engineering

9 Views
0 replies
0 kudos

11 hours ago

by mrstevegross • Contributor

yesterday

159 Views
3 replies
0 kudos

Attempt to use a custom container with an instance pool fails

I am trying to run a job with (1) custom containers, and (2) via an instance pool. Here's the setup:The custom container is just the DBR-provided `databricksruntime/standard:12.2-LTS`The instance pool is defined via the UI (see screenshot, below).At ...

Data Engineering

159 Views
3 replies
0 kudos

yesterday

View Replies

Latest Reply

mrstevegross
Contributor

12 hours ago

0 kudos

I think I have solved this. I added a URL for `preloaded_docker_image` to my instance pool, and the job worked correctly.This suggests that the DBR docs for preloaded_docker_image are incomplete; they should clarify that a user must add an entry in o...

0 kudos

12 hours ago

2 More Replies

by ADuma • New Contributor III

13 hours ago

37 Views
0 replies
0 kudos

Job sometimes failing due to library installation error of Pypi library

I am running a job on a Cluster from a compute pool that is installing a package from our Azure Artifacts Feed. My task is supposed to run a wheel task from our library which has about a dozen dependencies.For more than 95% of the runs this job works...

Data Engineering

37 Views
0 replies
0 kudos

13 hours ago

by matanper • New Contributor III

07-25-2023 7:07:19 AM

4247 Views
6 replies
1 kudos

Custom docker image fails to initalize

I'm trying to use a custom docker image for my job. This is my docker file:FROM databricksruntime/standard:12.2-LTS COPY . . RUN /databricks/python3/bin/pip install -U pip RUN /databricks/python3/bin/pip install -r requirements.txt USER rootMy job ...

Data Engineering

4247 Views
6 replies
1 kudos

07-25-2023 7:07:19 AM

View Replies

Latest Reply

mrstevegross
Contributor

13 hours ago

1 kudos

Did y'all ever figure this out? I'm running in a similar issue.

1 kudos

13 hours ago

5 More Replies

by dc-rnc • New Contributor II

13 hours ago

50 Views
0 replies
0 kudos

Writing to Delta Table and retrieving back the IDs doesn't work

Hi.I have a workflow in which I write few rows into a Delta Table with auto-generated IDs. Then, I need to retrieve them back just after they're written into the table to collect those generated IDs, so I read the table and I use two columns (one is ...

Data Engineering

50 Views
0 replies
0 kudos

13 hours ago

by p_romm • New Contributor III

14 hours ago

43 Views
0 replies
0 kudos

INVALID_HANDLE.SESSION_NOT_FOUND

We run several workflows and tasks parallel using serverless compute. In many different places of code we started to get errors as below. It looks like that when one task fails, every other that run at the same moment fails as well. After retry on on...

Data Engineering

43 Views
0 replies
0 kudos

14 hours ago

by badari_narayan • New Contributor II

Thursday

107 Views
1 replies
0 kudos

Having an issue assigning databricks_current_metastore with terraform provider

I am trying to assign my databricks_current_metastore on terraform and I get the following error back as an output Error: cannot read current metastore: cannot get client current metastore: invalid Databricks Workspace configurationwith data.databric...

Data Engineering

107 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

Panda
Valued Contributor

14 hours ago

0 kudos

@badari_narayan Based on above terraform code, you are trying to use the databricks.accounts provider to read the current workspace metastore, which is incorrect — the databricks_current_metastore data source is a workspace-level resource, and must b...

0 kudos

14 hours ago

by jdlogos • New Contributor II

a week ago

185 Views
2 replies
1 kudos

apply_changes_from_snapshot with expectations

Hi,Question: Are expectations supposed to function in conjunction with create_streaming_table() and apply_changes_from_snapshot?Our team is investigating Delta Live Tables and we have a working prototype using Autoloader to ingest some files from a m...

Data Engineering

185 Views
2 replies
1 kudos

a week ago

View Replies

Latest Reply

jdlogos
New Contributor II

14 hours ago

1 kudos

Hi Stefan-Koch,We reached out to our account rep and was instructed to create an Azure support ticket since we do not yet have a paid support plan. We are hoping to negotiate for paid support. However, I do not believe the documentation surrounding...

1 kudos

14 hours ago

1 More Replies

User

Count

1611

763

345

286

252

Databricks Community

Forum Posts

How to copy notebooks from local to the tarrget folder via asset bundles

DAB - All projects files deployed

Error "insert concurrent to Delta Lake" when 2 streaming merge data to same table at the same time

External tables in DLT pipelines

databricks bundle Deploy: exit code 0 even if an error occurs

Databricks asset bundle occasionally duplicating jobs

Unable to delete a table

Job , Task, Stage Creation

Attempt to use a custom container with an instance pool fails

Job sometimes failing due to library installation error of Pypi library

Custom docker image fails to initalize

Writing to Delta Table and retrieving back the IDs doesn't work

INVALID_HANDLE.SESSION_NOT_FOUND

Having an issue assigning databricks_current_metastore with terraform provider

apply_changes_from_snapshot with expectations

Join Us as a Local Community Builder!

Revert cluster DBR version to last DBR

Delta Live Tables are refreshed in parallel rather...

How can I efficiently remove backslashes during a ...

Partitioning vs. Clustering for a 50 TiB Delta Lak...

Run failed with termination code: RunExecutionErro...