Data Engineering

Forum Posts

Sorted by:

by Maser_AZ • New Contributor II

03-03-2025 8:11:40 AM

3864 Views
1 replies
0 kudos

16.2 (includes Apache Spark 3.5.2, Scala 2.12) cluster in community edition taking long time

16.2 (includes Apache Spark 3.5.2, Scala 2.12) cluster in community edition taking long time to start.I m trying to launch 16.2 DBR but it seems the cluster which is one node is taking long time . Is this a bug in the community edition ?Here is the u...

Data Engineering

Databricks

3864 Views
1 replies
0 kudos

03-03-2025 8:11:40 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

The long startup time for a Databricks Runtime 16.2 (Apache Spark 3.5.2, Scala 2.12) single-node cluster in Databricks Community Edition is a known issue and not unique to your setup. Many users have reported this situation, with some clusters taking...

0 kudos

Friday

by Abishrp • Contributor

03-11-2025 9:50:26 PM

3047 Views
1 replies
0 kudos

Product code of Databricks in AWS CUR report

I need to know what is the productCode of Databricks in CUR report. Whether the productCode is same for all user?

Data Engineering

3047 Views
1 replies
0 kudos

03-11-2025 9:50:26 PM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

In the AWS Cost and Usage Report (CUR), the productCode for Databricks is used to identify costs attributed to Databricks usage within your AWS environment. The value that appears in the lineItem/ProductCode column for Databricks is typically "Databr...

0 kudos

Friday

by Nick_Pacey • New Contributor III

03-11-2025 1:15:14 AM

3159 Views
1 replies
0 kudos

Foreign Catalog error connecting to SQL Server 2008 R2

Hi,Is there a limitation or know issue when creating a foreign catalog to a SQL Server 2008 R2?We are successfully able to connect to this SQL Server through a JDBC connection string. To make this work, we have to switch the Java encrypt flag to fal...

Data Engineering

3159 Views
1 replies
0 kudos

03-11-2025 1:15:14 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

There are known limitations and issues when connecting to SQL Server 2008 R2, particularly around encryption and JDBC settings, which can manifest as errors in federated catalog operations—even though a direct JDBC connection might succeed if the "en...

0 kudos

Friday

by Kabil • New Contributor

03-17-2025 10:20:22 AM

3249 Views
1 replies
0 kudos

useing dlt metadata as runtime parameter

i have started using DLT pipeline, and i have common code which is used by multiple DLT pipeline. now i need to read metadata information like name of the pipeline and start time of the pipeline during run time, but since im using common code and pip...

Data Engineering

3249 Views
1 replies
0 kudos

03-17-2025 10:20:22 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

To dynamically access metadata like the pipeline name and start time at runtime in your common code for Delta Live Tables (DLT) pipelines, you should leverage runtime context and built-in metadata features provided by the DLT or related orchestrators...

0 kudos

Friday

by TamD • Contributor

03-18-2025 4:04:29 PM

3215 Views
1 replies
0 kudos

ModuleNotFoundError Importing fuction modules to DLT pipelines

Following best practice, we want to avoid reusing code by putting commonly used transformations into function libraries and then importing and calling those functions where required.We also want to follow Databricks recommendations to use serverless ...

Data Engineering

3215 Views
1 replies
0 kudos

03-18-2025 4:04:29 PM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

You are correctly following Databricks’ recommendation to store shared code in Python files and import them into your notebooks, especially for Delta Live Tables (DLT) pipelines and serverless environments. However, import path issues are common, par...

0 kudos

Friday

by cszczotka • New Contributor III

03-19-2025 12:22:54 AM

3364 Views
1 replies
0 kudos

Delta sharing open issue with access data on storage

Hi, I have configured delta sharing for external consumer in Azure Databricks. Azure Databricks and storage account are in VNET, no public access. The storage account has also disabled account key access and shared key authorization.I'm running delt...

Data Engineering

3364 Views
1 replies
0 kudos

03-19-2025 12:22:54 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

Delta Sharing in Azure Databricks allows sharing datasets across clouds and with external consumers, but when used in a tightly controlled network environment (private endpoints, no public access, restricted storage account authentication), it behave...

0 kudos

Friday

by dc-rnc • Contributor

05-06-2025 8:53:19 AM

3143 Views
2 replies
2 kudos

Issue pulling Docker Image on Databricks Cluster through Azure Container Registry

Hi Community.Essentially, we're using the ACR to push our custom Docker Image, then we would like to pull it to create a Databricks cluster. However, during the cluster creation, we got the following error:I'm convinced we tried to authenticate in al...

Data Engineering

3143 Views
2 replies
2 kudos

05-06-2025 8:53:19 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

2 kudos

You are experiencing an authentication issue when trying to use a custom Docker image from Azure Container Registry (ACR) with Databricks clusters, despite successfully using admin tokens and service principals with acrpull permissions in other conte...

2 kudos

Friday

1 More Replies

by jeremy98 • Honored Contributor

04-02-2025 10:50:48 AM

3400 Views
1 replies
0 kudos

Hydra configuration and job parameters of DABs

Hello Community,I'm trying to create a job pipeline in Databricks that runs a spark_python_task, which executes a Python script configured with Hydra. The script's configuration file defines parameters, such as id.How can I pass this parameter at the...

Data Engineering

3400 Views
1 replies
0 kudos

04-02-2025 10:50:48 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

You can pass and override configuration parameters for Hydra in a Databricks spark_python_task by specifying job-level parameters (as arguments) and using environment variables or Hydra’s command line overrides. For accessing secrets with dbutils.sec...

0 kudos

Friday

by siddharthsomni • New Contributor

05-13-2025 1:03:45 AM

2591 Views
2 replies
0 kudos

Databricks Bundle Asset - Notebook-based bundling alternative to CLI approach

Hello All - I have a scenario where we want to do entire bundling and packaging in notebook to deploy Jobs using Databricks Asset Bundle without using CLI or VS Code. I didn't find any material or reference that provides insights. Any input would be ...

Data Engineering

2591 Views
2 replies
0 kudos

05-13-2025 1:03:45 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

Deploying Databricks Asset Bundles entirely from a notebook—without using the CLI or VS Code—is not a standard workflow but can be orchestrated using newer features in the Databricks workspace UI and by leveraging programmatic workspace operations. D...

0 kudos

Friday

1 More Replies

by Marcus_S • New Contributor

05-26-2025 7:55:38 AM

2759 Views
1 replies
0 kudos

Change in UNRESOLVED_COLUMN error behavior in Runtime 14.3 LTS

I've noticed a change in how Databricks handles unresolved column references in PySpark when using All-purpose compute (not serverless).In Databricks Runtime 14.3 LTS, referencing a non-existent column like this:df = spark.table('default.example').se...

Data Engineering

2759 Views
1 replies
0 kudos

05-26-2025 7:55:38 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

Friday

0 kudos

Databricks has recently changed how unresolved column references are handled in PySpark on All-purpose compute clusters. In earlier Databricks Runtime (DBR) 14.3 LTS builds, referencing a non-existent column—such as: python df = spark.tabl...

0 kudos

Friday

by Michał • New Contributor III

09-03-2025 6:41:10 AM

1245 Views
6 replies
3 kudos

how to process a streaming lakeflow declarative pipeline in batches

Hi, I've got a problem and I have run out of ideas as to what else I can try. Maybe you can help? I've got a delta table with hundreds millions of records on which I have to perform relatively expensive operations. I'd like to be able to process some...

Data Engineering

1245 Views
6 replies
3 kudos

09-03-2025 6:41:10 AM

View Replies

Latest Reply

Michał
New Contributor III

Friday

3 kudos

thanks @mmayorga

3 kudos

Friday

5 More Replies

by deng_dev • New Contributor III

Friday

84 Views
3 replies
2 kudos

Databricks Apps pricing

Hi everyone!I was investigating Databricks Apps as solution for my task and didn't fully understood pricing.I have found this page and it indicates it will cost 75$ / DBU for Premium subscription plan when using AWS cloud. Is it full cost or will the...

Data Engineering

84 Views
3 replies
2 kudos

Friday

View Replies

Latest Reply

ManojkMohan
Honored Contributor

Friday

2 kudos

@deng_dev The $75 per DBU Premium subscription plan price for Databricks Apps on AWS shown on the Databricks Apps pricing page reflects the charge from Databricks itself. https://www.databricks.com/product/pricing/databricks-appsHowever, this is not ...

2 kudos

Friday

2 More Replies

by JameDavi_51481 • Contributor

01-17-2024 7:30:00 AM

9887 Views
11 replies
13 kudos

Can we add tags to Unity Catalog through Terraform?

We use Terraform to manage most of our infrastructure, and I would like to extend this to Unity Catalog. However, we are extensive users of tagging to categorize our datasets, and the only programmatic method I can find for adding tags is to use SQL ...

Data Engineering

9887 Views
11 replies
13 kudos

01-17-2024 7:30:00 AM

View Replies

Latest Reply

jlieow
Databricks Employee

Friday

13 kudos

In case anyone comes across this, have a look at databricks_entity_tag_assignment and see if it suits your needs.

13 kudos

Friday

10 More Replies

by DataGirl • New Contributor

09-08-2022 5:41:51 PM

16362 Views
7 replies
2 kudos

Multi value parameter on Power BI Paginated / SSRS connected to databricks using ODBC

Hi All, I'm wondering if anyone has had any luck setting up multi valued parameters on SSRS using ODBC connection to Databricks? I'm getting "Cannot add multi value query parameter" error everytime I change my parameter to multi value. In the query s...

Data Engineering

16362 Views
7 replies
2 kudos

09-08-2022 5:41:51 PM

View Replies

Latest Reply

kashti123
New Contributor

Friday

2 kudos

Hi I am also trying to set multi value parameters using the dynamic sql expression. However, the report gives error that multi value parameters are not supported by the data extension. Any help on this would be highly appreciated. Thanks , Drishti

2 kudos

Friday

6 More Replies

by kcyugesh • New Contributor

Thursday

75 Views
1 replies
1 kudos

Resolved! Delta live table not showing in workspace (Azure databricks with premium plan)

- I have a premium plan and owner level access

Screenshot 2025-11-07 at 12.15.29 PM.png

Screenshot 2025-11-07 at 12.22.33 PM.png

Data Engineering

75 Views
1 replies
1 kudos

Thursday

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

Thursday

1 kudos

Hi @kcyugesh ,They changed the name from DLT to Lakeflow Declartive Pipelines, so you won't find DLT name in UI.Click job & pipelines and then ETL pipeline to access declarative pipeline editior

1 kudos

Thursday

Databricks Community

Forum Posts

16.2 (includes Apache Spark 3.5.2, Scala 2.12) cluster in community edition taking long time

Product code of Databricks in AWS CUR report

Foreign Catalog error connecting to SQL Server 2008 R2

useing dlt metadata as runtime parameter

ModuleNotFoundError Importing fuction modules to DLT pipelines

Delta sharing open issue with access data on storage

Issue pulling Docker Image on Databricks Cluster through Azure Container Registry

Hydra configuration and job parameters of DABs

Databricks Bundle Asset - Notebook-based bundling alternative to CLI approach

Change in UNRESOLVED_COLUMN error behavior in Runtime 14.3 LTS

how to process a streaming lakeflow declarative pipeline in batches

Databricks Apps pricing

Can we add tags to Unity Catalog through Terraform?

Multi value parameter on Power BI Paginated / SSRS connected to databricks using ODBC

Resolved! Delta live table not showing in workspace (Azure databricks with premium plan)

Join Us as a Local Community Builder!

Resource Throttling; Large Merge Operation - Recen...

Databricks Asset Bundles - High Level Diagrams Flo...

Delta live table not showing in workspace (Azure d...

Unable to install libraries from requirements.txt ...

Databricks Bundle Validation Error After CLI Upgra...