Data Engineering

Forum Posts

Sorted by:

by CaptainJack • New Contributor III

07-31-2024 4:07:34 AM

1807 Views
4 replies
1 kudos

Get taskValue from job as task, and then pass it to next task.

I have workflow like this.1 task: job as a task. Inside this job there is task which is seting parameter x as taskValue using dbutils.jobs.taskValues.set. 2. task dependent on previous job as a task. I would like to access this parameter x. I tried t...

Data Engineering

1807 Views
4 replies
1 kudos

07-31-2024 4:07:34 AM

View Replies

Latest Reply

NandiniN
Databricks Employee

08-02-2024 6:16:34 PM

1 kudos

I see, I have requested for someone else to guide you on this. cc: @Retired_mod

1 kudos

08-02-2024 6:16:34 PM

3 More Replies

by turtleXturtle • New Contributor II

07-31-2024 7:53:50 PM

996 Views
1 replies
0 kudos

Delta share existing parquet files in R2

Hi - I have existing parquet files in Cloudflare R2 storage (created outside of Databricks). I would like to share them via Delta Share, but I keep running into an error. Is it possible to share existing parquet files without duplicating them?I did...

Data Engineering

996 Views
1 replies
0 kudos

07-31-2024 7:53:50 PM

View Replies

Latest Reply

turtleXturtle
New Contributor II

08-02-2024 1:11:15 PM

0 kudos

Thanks @Retired_mod. It's currently possible to share a delta table stored in an S3 external location without duplication or doing the `DEEP CLONE` first. Is it on the roadmap to support this for R2 as well?

0 kudos

08-02-2024 1:11:15 PM

by MYB24 • New Contributor III

01-10-2024 8:24:06 AM

9063 Views
6 replies
0 kudos

Resolved! Error: cannot create mws credentials: invalid Databricks Account configuration

Good Evening, I am configuring databricks_mws_credentials through Terraform on AWS. I am getting the following error:Error: cannot create mws credentials: invalid Databricks Account configuration││ with module.databricks.databricks_mws_credentials.t...

Data Engineering

AWS

credentials

Databricks

Terraform

9063 Views
6 replies
0 kudos

01-10-2024 8:24:06 AM

View Replies

Latest Reply

Alexandre467
New Contributor II

07-29-2024 7:48:36 AM

0 kudos

Hello, I'm facing a similaire Issue. I try to update my TF with properly authentification and I have this error ?! ╷ │ Error: cannot create mws credentials: failed visitor: context canceled │ │ with databricks_mws_credentials.this, │ on main.tf ...

0 kudos

07-29-2024 7:48:36 AM

5 More Replies

by riccostamendes • New Contributor II

01-25-2023 5:21:22 PM

60050 Views
3 replies
0 kudos

Just a doubt, can we develop a kedro project in databricks?

I am asking this because up to now I have just seen some examples of deploying a pre-existent kedro project in databricks in order to run some pipelines...

Data Engineering

60050 Views
3 replies
0 kudos

01-25-2023 5:21:22 PM

View Replies

Latest Reply

noklam
New Contributor II

08-02-2024 9:40:15 AM

0 kudos

Hi! Kedro Dev here. You can surely develop Kedro on Databricks, in fact we have a lot of Kedro project running on Databricks. In the past there has been some friction, mainly because Kedro are project based while Databricks focus a lot on notebook. T...

0 kudos

08-02-2024 9:40:15 AM

2 More Replies

by georgef • New Contributor III

05-06-2024 7:18:19 AM

4231 Views
2 replies
1 kudos

Resolved! Cannot import relative python paths

Hello,Some variations of this question have been asked before but there doesn't seem to be an answer for the following simple use case:I have the following file structure on a Databricks Asset Bundles project: src --dir1 ----file1.py --dir2 ----file2...

Data Engineering

4231 Views
2 replies
1 kudos

05-06-2024 7:18:19 AM

View Replies

Latest Reply

m997al
Contributor III

08-02-2024 7:11:00 AM

1 kudos

Hi. This was a long-standing issue for me too. This solution may not be what is desired, but it works perfectly for my needs.In my python code, I have this structure:if __name__ == '__main__': # directory structure where "mycode" is this code ...

1 kudos

08-02-2024 7:11:00 AM

1 More Replies

by unity_Catalog • New Contributor III

08-02-2024 7:10:32 AM

468 Views
0 replies
0 kudos

Migrating legacy dashboards from source workspace to target workspace

I am trying to migrate legacy dashboards from source workspace to target workspace using python code. Is there any API's to GET legacy dashboards and POST API for migrating. When I am trying to migrate legacy dashboards and the underlying queries, I ...

Data Engineering

468 Views
0 replies
0 kudos

08-02-2024 7:10:32 AM

by Pierre1 • New Contributor

08-01-2024 7:04:17 AM

1942 Views
1 replies
2 kudos

DLT with Unity Catalog: Multipart table name

Hello,I can't seem to find up to date info on how to handle catalog.schema.table in DLT live table and Unity catalog.My statement is the following and is failing with the error: Multipart table name is not supported. Any workaround possible?Thanks a ...

Data Engineering

1942 Views
1 replies
2 kudos

08-01-2024 7:04:17 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

08-02-2024 4:46:58 AM

2 kudos

Hi @Pierre1 ,Actually, you don't provide this infromation in the code. You specify this information when you create dlt pipeline.If you do not select a catalog and target schema for a pipeline, tables are not published to Unity Catalog and can only b...

2 kudos

08-02-2024 4:46:58 AM

by mbdata • New Contributor II

08-11-2020 2:34:04 AM

40920 Views
6 replies
8 kudos

Resolved! Toggle line comment

I work with Azure Databricks. The shortcut Ctrl + / to toggle line comment doesn't work on AZERTY keyboard on Firefox... Do you know this issue ? Is there an other shortcut I can try ? Thanks !

Data Engineering

40920 Views
6 replies
8 kudos

08-11-2020 2:34:04 AM

View Replies

Latest Reply

Flo
New Contributor III

06-27-2024 9:12:40 AM

8 kudos

'cmd + shift + 7' works for me!I'm using an AZERTY keyboard on Chrome for MacOS.

8 kudos

06-27-2024 9:12:40 AM

5 More Replies

by Anonymous3 • New Contributor

08-02-2024 2:34:10 AM

678 Views
0 replies
0 kudos

How to migrate Legacy Dashboard from one workspace to another workspace?

Is it possible to migrate Databricks Legacy Dashboards along with associated components such as queries and datasets from one workspace to another? I have attempted to export the Legacy Dashboards along with the queries and datasets, but upon importi...

Data Engineering

678 Views
0 replies
0 kudos

08-02-2024 2:34:10 AM

by vishal48 • New Contributor II

08-02-2024 2:29:52 AM

618 Views
0 replies
1 kudos

Integrating row and column level security in parent child tables with masking only selected rows

Currently I am working with a project where we need to mask PIIs in few columns for VIP customers only.Let me explain briefly with example:Table A: [personid, status, address, UID, VIPFLAG] --> Mask "UID" and "address" only where VIPFLAG is 1Table ...

Data Engineering

618 Views
0 replies
1 kudos

08-02-2024 2:29:52 AM

by guangyi • Contributor III

08-01-2024 10:42:56 PM

1500 Views
3 replies
1 kudos

Resolved! Complex type variable in Databricks.yml not working

For example here I extract the schedule parameter as a complex type variable: variables: schedule: description: schedule time type: complex default: quartz_cron_expression: '0 22 17 * * ?' timezone_id: Asia/Shanghai pa...

Data Engineering

1500 Views
3 replies
1 kudos

08-01-2024 10:42:56 PM

View Replies

Latest Reply

pavlosskev
New Contributor III

08-02-2024 2:17:27 AM

1 kudos

If the validation is fine on your colleague's laptop and not on yours, my first assumption would be that it's a version issue. Do you have the same Databricks CLI version as your colleagues? You can check with databricks --version Also according to...

1 kudos

08-02-2024 2:17:27 AM

2 More Replies

by Kotekaman • New Contributor

08-01-2024 9:42:34 PM

451 Views
1 replies
1 kudos

Merge Update in Notebook Faster Than Scala script

Hi Folks,I tested running a merge update using SQL queries in a notebook, and it is faster than using a Scala script. Both tests were done using the same cluster size in Databricks.How can I make the Scala script as fast as the SQL notebook?

Data Engineering

451 Views
1 replies
1 kudos

08-01-2024 9:42:34 PM

View Replies

Latest Reply

Witold
Honored Contributor

08-02-2024 12:01:57 AM

1 kudos

Have you already compared both query plans?

1 kudos

08-02-2024 12:01:57 AM

by CaptainJack • New Contributor III

07-30-2024 7:44:03 AM

2656 Views
3 replies
2 kudos

Resolved! Error Handling and Custom Messages in Workflows

I would like to be able to get custom error's message ideally visible from Workflows > Jobs UI.1. For example, workflow failed because file was missing and could not find it, in this case I am getting "Status" Failed and "Error Code" RunExecutionErro...

Data Engineering

2656 Views
3 replies
2 kudos

07-30-2024 7:44:03 AM

View Replies

Latest Reply

Edthehead
Contributor III

08-01-2024 10:28:59 PM

2 kudos

What you can do is pass the custom error message you want from the notebook back to the workflow output = f"There was an error with {error_code} : {error_msg}"dbutils.notebook.exit(output) Then when you are fetching the status of your pipeline, you c...

2 kudos

08-01-2024 10:28:59 PM

2 More Replies

by Manthansingh • New Contributor

08-01-2024 6:28:42 AM

1304 Views
2 replies
0 kudos

Writing part files in single text file

i want to write all my part file into a single text file is there anything i can do

Data Engineering

1304 Views
2 replies
0 kudos

08-01-2024 6:28:42 AM

View Replies

Latest Reply

Edthehead
Contributor III

08-01-2024 7:27:56 PM

0 kudos

When writing a pyspark dataframe to a file, it will always write to a part file by default. This is because of partitions, even if there is only 1 partitions.To write into a single file you can convert the pyspark dataframe to a pandas dataframe and ...

0 kudos

08-01-2024 7:27:56 PM

1 More Replies

by herry • New Contributor III

12-06-2021 3:12:04 AM

4007 Views
4 replies
4 kudos

Resolved! Get the list of loaded files from Autoloader

Hello,We can use Autoloader to track the files that have been loaded from S3 bucket or not. My question about Autoloader: is there a way to read the Autoloader database to get the list of files that have been loaded?I can easily do this in AWS Glue j...

Data Engineering

4007 Views
4 replies
4 kudos

12-06-2021 3:12:04 AM

View Replies

Latest Reply

Anonymous
Not applicable

12-09-2021 8:12:42 AM

4 kudos

@Herry Ramli - Would you be happy to mark Hubert's answer as best so that other members can find the solution more easily?Thanks!

4 kudos

12-09-2021 8:12:42 AM

3 More Replies

User

Count

1611

763

345

286

252

Databricks Community

Forum Posts

Get taskValue from job as task, and then pass it to next task.

Delta share existing parquet files in R2

Resolved! Error: cannot create mws credentials: invalid Databricks Account configuration

Just a doubt, can we develop a kedro project in databricks?

Resolved! Cannot import relative python paths

Migrating legacy dashboards from source workspace to target workspace

DLT with Unity Catalog: Multipart table name

Resolved! Toggle line comment

How to migrate Legacy Dashboard from one workspace to another workspace?

Integrating row and column level security in parent child tables with masking only selected rows

Resolved! Complex type variable in Databricks.yml not working

Merge Update in Notebook Faster Than Scala script

Resolved! Error Handling and Custom Messages in Workflows

Writing part files in single text file

Resolved! Get the list of loaded files from Autoloader

Join Us as a Local Community Builder!

CREATE TEMP TABLE

Error while reading file from Cloud Storage

Permission Issue in Delta Lake Course

Connecting to SQL on Databricks Using SQLAlchemy o...

databricks bundle Deploy: exit code 0 even if an e...