Data Engineering

Forum Posts

Sorted by:

by Prasanna_N • New Contributor

03-17-2025 9:07:03 PM

2993 Views
2 replies
1 kudos

Inference table Monitoring

i have data from march1 to march 14 in the final inference table and i have given 1 week granularity. after that profile and drift table is generated and i see the window start time as like this objectstart: "2025-02-24T00:00:00.000Z"end: "2025-03-03...

Data Engineering

2993 Views
2 replies
1 kudos

03-17-2025 9:07:03 PM

View Replies

Latest Reply

AbhayPSingh
Databricks Employee

8 hours ago

1 kudos

More or less repeating what Mark said and adding some additional thoughts. Why the Window Starts from February 24 The reason you're seeing a window starting from February 24 (even though your data starts March 1) is because monitoring systems align t...

1 kudos

8 hours ago

1 More Replies

by Ronis • New Contributor

09-14-2022 2:41:24 AM

8860 Views
6 replies
1 kudos

SSRS Connect to Databricks

Hi ,I need to connect databricks query from microsoft SSRS.is it possible ? How do you make the connection?

Data Engineering

8860 Views
6 replies
1 kudos

09-14-2022 2:41:24 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

12 hours ago

1 kudos

SSRS has limited auth methods. It also is EOL so my answer is: No.This is not a limitation of Databricks but SSRS.You could define the connection as a linked server on sql server, that might open some extra (MS native) options.PS. it is best not to ...

1 kudos

12 hours ago

5 More Replies

by ManojkMohan • Honored Contributor

yesterday

39 Views
1 replies
0 kudos

Exposing Databricks API in Salesforce

Use Case:I want to expose a data bricks API URL in Salesforce, Salesforce will hit that exposed end point every time a record is created and data will be transferred from Salesforce to DatabricksWhen i try creating a serving end pointI am unable to s...

Data Engineering

39 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

mark_ott
Databricks Employee

12 hours ago

0 kudos

When integrating Salesforce with Databricks to push data upon record creation, using a serving endpoint is not the most common or optimal approach. Although Databricks Feature Serving endpoints can be used for model or feature APIs, they are primaril...

0 kudos

12 hours ago

by gudurusreddy99 • New Contributor

yesterday

63 Views
2 replies
1 kudos

Databricks DLT Joins: Streaming table join with Delta table is reading 2 Billion records per batch

Databricks DLT Joins: Streaming table join with Delta table is reading 2 Billion records from Delta Table for each and every Micro batch.How to overcome this issue to not to read 2 Billion records for every micro batch.Your suggestions and feedback w...

Data Engineering

63 Views
2 replies
1 kudos

yesterday

View Replies

Latest Reply

ManojkMohan
Honored Contributor

16 hours ago

1 kudos

@gudurusreddy99 Any update here , did you try the above solutions ?

1 kudos

16 hours ago

1 More Replies

by 1GauravS • New Contributor III

17 hours ago

42 Views
0 replies
0 kudos

Ingesting Data from Event Hubs via Kafka API with Serverless Compute

Hi!I'm currently working on ingesting log data from Azure Event Hubs into Databricks. Initially, I was using a managed Databricks workspace, which couldn't access Event Hubs over a private endpoint. To resolve this, our DevOps team provisioned a VNet...

Data Engineering

42 Views
0 replies
0 kudos

17 hours ago

by dndeng • New Contributor II

a week ago

146 Views
3 replies
0 kudos

Query to calculate cost of task from each job by day

I am trying to find the cost per Task in each Job every time it was executed (daily) but currently getting very huge numbers due to duplicates, can someone help me ? WITH workspace AS ( SELECT account_id, workspace_id, workspace_name,...

Data Engineering

146 Views
3 replies
0 kudos

a week ago

View Replies

Latest Reply

dndeng
New Contributor II

19 hours ago

0 kudos

still costs exploded it seems there is no way to get cost per task only per job.

0 kudos

19 hours ago

2 More Replies

by JuliandaCruz • New Contributor

yesterday

107 Views
4 replies
0 kudos

Access to Databricks Volumes via Databricks Connect not working anymore

Hi all, I use the extension to debug my python code regularly and since yesterday accessing files in the Databricks Volume isn't working anymore. The situation in the UI of Databricks is as follows:When I execute a glob statement to list all zip-file...

Data Engineering

107 Views
4 replies
0 kudos

yesterday

View Replies

Latest Reply

mmayorga
Databricks Employee

yesterday

0 kudos

hi @JuliandaCruz Thank you for reaching out! I was able to reproduce your case while using Databricks Connect. The "Upload and Run file" option worked fine and returned results, which is essentially the same as running from the Databricks UI. Thou...

0 kudos

yesterday

3 More Replies

by AgusBudianto • Contributor

4 weeks ago

335 Views
5 replies
2 kudos

Resolved! Why am I getting NameError name _all_timezones_unchecked' is not defined

I defined the following local time function get datetime: def get_sysdate(): jkt_tz = pytz.timezone('Asia/Jakarta') sysdate = datetime.now(jkt_tz).strftime('%Y-%m-%d %H:%M:%S') return sysdatespark.udf.register("get_sysdate", get_sysdate)But...

Data Engineering

335 Views
5 replies
2 kudos

4 weeks ago

View Replies

Latest Reply

AgusBudianto
Contributor

yesterday

2 kudos

Hi @Khaja_Zaffer I have connected with MS Support and explained: pytz is no thread safe package, I believe it will have some issue when executor init it parallelly. Second, this is a 3rd party lib, and suggest using the built-in library from ZoneInfo...

2 kudos

yesterday

4 More Replies

by AniruddhaGI • New Contributor II

06-17-2025 11:01:39 PM

2043 Views
2 replies
1 kudos

Workspace allows dbf path to install in Databricks 16.4 LTS

Feature: Library installation using requirements.txt on DB Runtime 16.4 LTSAffected Areas: Workspace isolation, Library ManagementSteps to Reproduce:Upload a wheel file to dbfPut the requirements.txt file in the Workspace and put dbfs path in require...

Data Engineering

library

Security

Workspace

2043 Views
2 replies
1 kudos

06-17-2025 11:01:39 PM

View Replies

Latest Reply

AniruddhaGI
New Contributor II

06-17-2025 11:11:19 PM

1 kudos

I would like to know if the workspace isolation is a priority, and only Databricks 14.3 and lower allow installation via DBFS.Why should the requirements.txt allow you to install libraries or packages via dbfs path?Could someone please explain why th...

1 kudos

06-17-2025 11:11:19 PM

1 More Replies

by turagittech • Contributor

05-18-2025 4:38:42 PM

2380 Views
2 replies
0 kudos

Batch reading from sql server tables with cdc on ssql server tables

Hi all,I need to do a batch load from sql server into Databricks. I have CC enabled on some tables. The simple appears to be union CDC and regular table to get a single set of records to load, but this appears to be fraught with risk of out of sequen...

Data Engineering

2380 Views
2 replies
0 kudos

05-18-2025 4:38:42 PM

View Replies

Latest Reply

Krishna_S
Databricks Employee

yesterday

0 kudos

Yes, you can use TVFs on Databricks. Please check the following link: https://docs.databricks.com/aws/en/sql/language-manual/sql-ref-syntax-qry-select-tvf#gsc.tab=0Can you please elaborate on how you are loading the SQL Server Data into Databricks? H...

0 kudos

yesterday

1 More Replies

by AshMod • New Contributor

yesterday

57 Views
2 replies
1 kudos

Job runs on serverless eventhough Job config has cluster definitions

Hi,I am defining the job along with job cluster specification using python sdk. But when the job runs it is using the serverless compute, instead of the defined cluster. I can say the job uses serverless from the job_run log and also from the system....

Data Engineering

57 Views
2 replies
1 kudos

yesterday

View Replies

Latest Reply

AshMod
New Contributor

yesterday

1 kudos

Thanks for checking @ManojkMohan. I found the issue in the job task definition. There is a job_clusters list in the job definition, where I provide the cluster config details. But this alone is not sufficient to have the task use the cluster. The job...

1 kudos

yesterday

1 More Replies

by QuanSun • New Contributor II

04-28-2025 6:38:43 AM

1361 Views
5 replies
2 kudos

How to select performance mode for Databricks Delta Live Tables

Hi everyone,Based on the official link,For triggered pipelines, you can select the serverless compute performance mode using the Performance optimized setting in the pipeline scheduler. When this setting is disabled, the pipeline uses standard perfor...

Data Engineering

1361 Views
5 replies
2 kudos

04-28-2025 6:38:43 AM

View Replies

Latest Reply

BF7
Contributor

yesterday

2 kudos

I have learned that this parameter is not governed in the pipeline configuration itself, but in the job task that runs the pipeline. This is confusing to me and I don't like it.

2 kudos

yesterday

4 More Replies

by saab123 • New Contributor II

04-07-2025 11:41:17 AM

3204 Views
1 replies
0 kudos

Not able to connect to Neo4j Aura Db from databricks

I am trying to connect to a Neo4j AuraDb instance-f9374927. Created a free professional instance of Neo4j. I am able to connect to this instance, add nodes and relationships. Created a Databricks shared cluster 14.3 LTS (includes Apache Spark 3.5.0...

Data Engineering

3204 Views
1 replies
0 kudos

04-07-2025 11:41:17 AM

View Replies

Latest Reply

mark_ott
Databricks Employee

yesterday

0 kudos

The connection issue between your Databricks cluster and Neo4j AuraDB instance (f9374927) with the ServiceUnavailableException: No routing server available message is tied to network-level SSL configuration and connectivity rather than incorrect code...

0 kudos

yesterday

by dbxlearner • New Contributor II

05-22-2025 8:31:12 AM

2769 Views
3 replies
0 kudos

Deploying using Databricks asset bundles (DABs) in a closed network

Hello, I'm trying to deploy DBX workflows using DABs using an Azure DevOps pipeline, in a network that cannot download the required terraform databricks provider package online, due to firewall/network restrictions.I have followed this post: https://...

Data Engineering

2769 Views
3 replies
0 kudos

05-22-2025 8:31:12 AM

View Replies

Latest Reply

dbxlearner
New Contributor II

05-23-2025 4:49:11 PM

0 kudos

Another thing I noticed is, when running the 'databricks bundle debug terraform' command, it mentions these variables:I have tried setting these variables as environment variables in my ADO pipeline, specially the databricks terraform provider variab...

0 kudos

05-23-2025 4:49:11 PM

2 More Replies

by ticuss • New Contributor

Wednesday

53 Views
1 replies
0 kudos

Lakebase / Feature Store error: “Failed to get identity details for username” (service principal)

Hello,I’m running into a Lakebase / Feature Store issue related to service principal authentication when trying to log or read from the Databricks Feature Store. Migrating from the legacy online tables. Here’s the exact error:psycopg2.OperationalErr...

Data Engineering

53 Views
1 replies
0 kudos

Wednesday

View Replies

Latest Reply

mark_ott
Databricks Employee

yesterday

0 kudos

The error you’re encountering —psycopg2.OperationalError: FATAL: Failed to get identity details for username: "user_uuid" — typically arises from an OAuth identity mismatch or invalid token scope when a Databricks service principal is used to authent...

0 kudos

yesterday

Databricks Community

Forum Posts

Inference table Monitoring

SSRS Connect to Databricks

Exposing Databricks API in Salesforce

Databricks DLT Joins: Streaming table join with Delta table is reading 2 Billion records per batch

Ingesting Data from Event Hubs via Kafka API with Serverless Compute

Query to calculate cost of task from each job by day

Access to Databricks Volumes via Databricks Connect not working anymore

Resolved! Why am I getting NameError name _all_timezones_unchecked' is not defined

Workspace allows dbf path to install in Databricks 16.4 LTS

Batch reading from sql server tables with cdc on ssql server tables

Job runs on serverless eventhough Job config has cluster definitions

How to select performance mode for Databricks Delta Live Tables

Not able to connect to Neo4j Aura Db from databricks

Deploying using Databricks asset bundles (DABs) in a closed network

Lakebase / Feature Store error: “Failed to get identity details for username” (service principal)

Join Us as a Local Community Builder!

Why am I getting NameError name _all_timezones_unc...

Lakeflow Connect SchemaParseException: Illegal cha...

ModuleNotFound error when using transformWithState...

Encountering an error while setting up a single-no...

AUTO CDC API and sequence column