Data Engineering

Forum Posts

Sorted by:

by AlbertWang • Valued Contributor

08-07-2024 6:12:19 PM

3781 Views
7 replies
3 kudos

Resolved! Azure Databricks Unity Catalog - cannot access managed volume in notebook

We have set up Azure Databricks with Unity Catalog (Metastore).Used Managed Identity (Databricks Access Connector) for connection from workspace(s) to ADLS Gen2ADLS Gen2 storage account has Storage Blob Data Contributor and Storage Queue Data Contrib...

Data Engineering

3781 Views
7 replies
3 kudos

08-07-2024 6:12:19 PM

View Replies

Latest Reply

fifata
New Contributor II

an hour ago

3 kudos

@AlbertWang @VAMSaha22 Since you want private connectivity I assume you have a vnet and a PE associated with the gen2 account. That PE needs to have a sub-resource of type dfs when the storage account is gen2/hierarchical namespace. You might want to...

3 kudos

an hour ago

6 More Replies

by Hari_P • New Contributor II

2 hours ago

9 Views
0 replies
0 kudos

Sharing Databricks Notebook Functionality Without Revealing Source Code

Hi All,I have a unique scenario in Databricks and would appreciate your insights.I’ve developed functionality in Databricks notebooks, and I’d like to share this with other developers within the same workspace. My goal is to allow colleagues to impor...

Data Engineering

9 Views
0 replies
0 kudos

2 hours ago

by ClintHall • New Contributor

5 hours ago

15 Views
0 replies
0 kudos

Error filtering by datetime Lakehouse Federated SQL Server table

In unity catalog, I have a connection to a SQL Server database. When I try to filter by a datetime column using a datetime with fractional seconds, Databricks gives me this error:Job aborted due to stage failure: com.microsoft.sqlserver.jdbc.SQLServe...

Data Engineering

15 Views
0 replies
0 kudos

5 hours ago

by Mildred • New Contributor

06-27-2025 9:50:35 AM

1752 Views
1 replies
0 kudos

Parameter "expand_tasks" on List job runs request seams not to be working (databricsk api)

I'm setting it as True, but it doesn't return the cluster_instance info. Here is the function I'm using:def get_job_runs(job_id): """ Fetches job runs for a specific job from Databricks Jobs API. """ headers = { "Authorization...

Data Engineering

1752 Views
1 replies
0 kudos

06-27-2025 9:50:35 AM

View Replies

Latest Reply

Krishna_S
Databricks Employee

7 hours ago

0 kudos

Hi @Mildred The way you passed the data for the expand_tasks parameter is wrong: data = { data = { "job_id": job_id, "expand_tasks": "true" } It should not be passed as Python boolean values, but as a string "true" or "false" Once you do that will...

0 kudos

7 hours ago

by jorperort • Contributor

07-04-2025 12:51:09 PM

1797 Views
3 replies
0 kudos

Executing Bash Scripts or Binaries Directly in Databricks Jobs on Single Node Cluster

Hi,Is it possible to directly execute a Bash script or a binary executable from the operating system of a Databricks job compute node using a single node cluster?I’m using databricks asset bundels for job initialization and execution. When the job s...

Data Engineering

1797 Views
3 replies
0 kudos

07-04-2025 12:51:09 PM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

8 hours ago

0 kudos

Hello @jorperort , I did some research internally and have some tips/suggestions for you to consider: Based on the research and available documentation, it is not possible to directly execute a Bash script or binary executable from the operating sy...

0 kudos

8 hours ago

2 More Replies

by Pratikmsbsvm • Contributor

07-15-2025 3:37:21 AM

1527 Views
1 replies
0 kudos

How to Read and Wrire Data between 2 seperate instance of Databricks

How to Read and Wrire Data between 2 seperate instance of Databricks.I want to have bi-directional data read and write between Databricks A and Databricks B. Both are not in same instance.Please help

Data Engineering

1527 Views
1 replies
0 kudos

07-15-2025 3:37:21 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

8 hours ago

0 kudos

Hello @Pratikmsbsvm , I want to better understand what you mean by "Instance"? Do you mean two seperate workspace within the same ADB account or do you mean two different ADB accounts? Please clarify so I can provide guidance. Regards, Louis.

0 kudos

8 hours ago

by data-grassroots • New Contributor III

Wednesday

71 Views
3 replies
0 kudos

ExcelWriter and local files

I have a couple things going on here.First, to explain what I'm doing, I'm passing an array of objects in to a function that contain a dataframe per item. I want to write those dataframes to an excel workbook - one dataframe per worksheet. That part ...

Data Engineering

71 Views
3 replies
0 kudos

Wednesday

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

8 hours ago

0 kudos

Hey @data-grassroots , I did some digging with our internal docs and have some suggestions/tips to help you further diagnose the issue: You're following the recommended Databricks approach for editing Excel files: copying the template to a local pa...

0 kudos

8 hours ago

2 More Replies

by ralphchan • New Contributor II

02-14-2025 4:56:02 AM

3277 Views
4 replies
0 kudos

Connect Oracle Fusion (ERP / HCM) to Databricks

Any suggestion to connect Oracle Fusion (ERP/HCM) to Databricks?I have explored a few options including the use of Oracle Integration Cloud but it requires a lot of customization.

Data Engineering

3277 Views
4 replies
0 kudos

02-14-2025 4:56:02 AM

View Replies

Latest Reply

nayan_wylde
Honored Contributor III

8 hours ago

0 kudos

I used Fivetran Oracle Fusion Connector in past. It is a fully managed ELT connector that extracts data from Oracle Fusion and loads it into Databricks.

0 kudos

8 hours ago

3 More Replies

by cpollock • New Contributor III

yesterday

47 Views
2 replies
0 kudos

Resolved! Getting NO_TABLES_IN_PIPELINE error in Lakeflow Declarative Pipelines

Yesterday (10/1) starting around 12 PM EST we starting getting the following error in our Lakeflow Declarative Pipelines (LDP) process. We get this in environments where none of our code has changed. I found some info on the serverless compute abou...

Data Engineering

47 Views
2 replies
0 kudos

yesterday

View Replies

Latest Reply

saurabh18cs
Honored Contributor II

9 hours ago

0 kudos

Hi @cpollock Check the “Event log” and “Pipeline logs” in the Databricks UI for any clues.also can you please share screenshot as pasted in window, attachment is not really working and only scanning

0 kudos

9 hours ago

1 More Replies

by DiskoSuperStar • New Contributor

yesterday

42 Views
1 replies
0 kudos

DLT Flow Redeclaration Error After Service Upgrade

Hi, our delta live tables(Lakeflow declarative pipelines) pipeline started failing after the Sep 30 / Oct 1 service upgrade with the following error :AnalysisException: Cannot have multiple queries named `<table_name>_realtime_flow` for `<table_name>...

Data Engineering

42 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

saurabh18cs
Honored Contributor II

9 hours ago

0 kudos

Hi @DiskoSuperStar IT seems you’ve run into a recently enforced change in Databricks DLT/Lakeflow:Multiple flows (append or otherwise) targeting the same table must have unique names. actually it looks correct on your code. Check if your table_info ...

0 kudos

9 hours ago

by QuanSun • New Contributor II

04-28-2025 6:38:43 AM

1022 Views
4 replies
1 kudos

How to select performance mode for Databricks Delta Live Tables

Hi everyone,Based on the official link,For triggered pipelines, you can select the serverless compute performance mode using the Performance optimized setting in the pipeline scheduler. When this setting is disabled, the pipeline uses standard perfor...

Data Engineering

1022 Views
4 replies
1 kudos

04-28-2025 6:38:43 AM

View Replies

Latest Reply

BF7
Contributor

yesterday

1 kudos

I would like an answer to his question also, I need to see how to turn this off, but any check box relating to performance optimization in my serverless pipeline does not show up.

1 kudos

yesterday

3 More Replies

by Gvnreddy • New Contributor

Wednesday

92 Views
3 replies
4 kudos

Need Help to learn scala

Hi Enthusiasts, recently i joined company in that company they used to develope databricks notebook with Scala programming language perviously, i worked on Pyspark it was very easy for me by the way i have 3 years of experence in DE i need help to wh...

Data Engineering

92 Views
3 replies
4 kudos

Wednesday

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10 hours ago

4 kudos

fwiw: you do not have to be a scala wiz to work on spark in scala.Older spark articles are often about scala in spark (before python took over).You will notice it is a lot like pyspark, but way way better. typing, immutability, things like leftfold ...

4 kudos

10 hours ago

2 More Replies

by john77 • New Contributor

Tuesday

93 Views
5 replies
1 kudos

Why ETL Pipelines and Jobs

I do notice that ETL Pipelines let's you run declarative SQL syntax such as DLT tables but you can do the same with Jobs if you use SQL as your task type. So why and when to use ETL Pipelines?

Data Engineering

93 Views
5 replies
1 kudos

Tuesday

View Replies

Latest Reply

saurabh18cs
Honored Contributor II

Wednesday

1 kudos

Hi @john77 SQL Task Type : Simple, one-off SQL operations or batch jobs + you need to orchestrate a mix of notebooks, Python/Scala code, and SQL in a single workflowLakeflow Declarative Pipelines : Complex , production ETL jobs requires lineage , mon...

1 kudos

Wednesday

4 More Replies

by DivyaKumar • New Contributor

11 hours ago

26 Views
1 replies
0 kudos

Databricks to Dataverse migration via ADF copy data

Hi team,I need to load data from databricks delta tables to dataverse tables and I have one unique id column which I am ensuring via mapping. Its datatype is GUID in dataverse and string in delta table. I ensured that column holds unique values. Sinc...

Data Engineering

26 Views
1 replies
0 kudos

11 hours ago

View Replies

Latest Reply

-werners-
Esteemed Contributor III

11 hours ago

0 kudos

That is not a valid guid.Dataverse will check this.http://guid.us/test/guid

0 kudos

11 hours ago

by Brahmareddy • Esteemed Contributor

yesterday

40 Views
1 replies
2 kudos

How Databricks Helped Me See Data Engineering Differently

Over the years working as a data engineer, I’ve started to see my role very differently. In the beginning, most of my focus was on building pipelines—extracting, transforming, and loading data so it could land in the right place. Pipelines were the g...

Data Engineering

40 Views
1 replies
2 kudos

yesterday

View Replies

Latest Reply

BS_THE_ANALYST
Esteemed Contributor II

12 hours ago

2 kudos

@Brahmareddy thanks for this! .Think you've nailed it on the head there. If the stakeholders trust the data and there's integrity, governance, and a single source of truth, you've got a recipe for a great product! Love this take @Brahmareddy . Really...

2 kudos

12 hours ago

Databricks Community

Forum Posts

Resolved! Azure Databricks Unity Catalog - cannot access managed volume in notebook

Sharing Databricks Notebook Functionality Without Revealing Source Code

Error filtering by datetime Lakehouse Federated SQL Server table

Parameter "expand_tasks" on List job runs request seams not to be working (databricsk api)

Executing Bash Scripts or Binaries Directly in Databricks Jobs on Single Node Cluster

How to Read and Wrire Data between 2 seperate instance of Databricks

ExcelWriter and local files

Connect Oracle Fusion (ERP / HCM) to Databricks

Resolved! Getting NO_TABLES_IN_PIPELINE error in Lakeflow Declarative Pipelines

DLT Flow Redeclaration Error After Service Upgrade

How to select performance mode for Databricks Delta Live Tables

Need Help to learn scala

Why ETL Pipelines and Jobs

Databricks to Dataverse migration via ADF copy data

How Databricks Helped Me See Data Engineering Differently

Join Us as a Local Community Builder!

Getting NO_TABLES_IN_PIPELINE error in Lakeflow De...

Folder execute permissions

JDCB Error trying a get schemas call.

How to get pipeline update duration programmatical...

Ingestion Framework