cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

der
by Valued Contributor
  • 56 Views
  • 4 replies
  • 2 kudos

spark.databricks.sql.excel.enabled false at cluster level

Native databricks excel data source is GAhttps://www.reddit.com/r/databricks/comments/1t4un82/native_excel_support_is_now_ga/https://docs.databricks.com/aws/en/query/formats/excelHowever, as long as it is not possible to read from another adress than...

  • 56 Views
  • 4 replies
  • 2 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 2 kudos

Hi  @der ,Most likely because spark.databricks.sql.excel.enabled is a Databricks SQL/session-level internal config, not a SparkConf setting.This specific key appears to be read from the Spark SQL session config, so setting it after the notebook sessi...

  • 2 kudos
3 More Replies
npalne
by Visitor
  • 39 Views
  • 1 replies
  • 1 kudos

Regarding Databricks Certified Data Engineer Professional recent syllabus change

Dear Team,I am planning to appear Data Engineer Professional certification in June month end. Now I wanted to know will there be any change in syllabus or exam pattern. Because I heard syllabus has been changed for Data Engineer Associate exam.Kind r...

  • 39 Views
  • 1 replies
  • 1 kudos
Latest Reply
nidhin
New Contributor II
  • 1 kudos

As far as I can see, the latest official syllabus/exam guide is available here,https://www.databricks.com/sites/default/files/2026-05/databricks-certified-data-engineer-associate-exam-guide-may-2026.pdfDatabricks updates the syllabus from time to tim...

  • 1 kudos
AnandB
by Visitor
  • 37 Views
  • 1 replies
  • 0 kudos

Access denied to course

when I tried to access Data engineering + Generative AI course on Databricks account it says access denied contact your admin.. suggest what to do next?

  • 37 Views
  • 1 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 0 kudos

Hi @AnandB ,Try following:1. Try the Academy portal directlyLog in directly at https://customer-academy.databricks.com/learn (or https://partner-academy.databricks.com/learn if you're a partner), clear your browser cache and cookies first, then searc...

  • 0 kudos
vedanth
by New Contributor
  • 55 Views
  • 1 replies
  • 0 kudos

Salesforce Connector - Lakeflow Connect 400 Error

HI All,I have been trying to setup Salesforce using Lakeflow Connect and followed instructions on the docshttps://docs.databricks.com/aws/en/connect/managed-ingestion#sfdcHowever I face into invalid_grant error  However login history on salesforce sh...

vedanth_0-1779009668052.png
  • 55 Views
  • 1 replies
  • 0 kudos
Latest Reply
GaneshI
New Contributor
  • 0 kudos

Hi Vedanth,The invalid_grant error usually occurs due to authentication or OAuth configuration issues between Salesforce and Databricks Lakeflow Connect.Could you please verify the following points:Ensure the Salesforce user account is not locked and...

  • 0 kudos
Yannick_B
by New Contributor
  • 66 Views
  • 2 replies
  • 0 kudos

[DELTA_CREATE_EXTERNAL_TABLE_WITHOUT_TXN_LOG]

We are testing Delta writer in our environment  to create bronze tables and recently, I just needed to add one table to the notebook code and rerun the whole notebook that failed because of this error : [DELTA_CREATE_EXTERNAL_TABLE_WITHOUT_TXN_LOG] Y...

  • 66 Views
  • 2 replies
  • 0 kudos
Latest Reply
balajij8
Contributor III
  • 0 kudos

@Yannick_B You are trying to register an external table pointing to a directory that does not contain required Delta transaction logs (_delta_log folder) and hence you see the error.When you run External Table Command, Databricks generally expects th...

  • 0 kudos
1 More Replies
GaneshI
by New Contributor
  • 76 Views
  • 1 replies
  • 0 kudos

Does enabling Change Data Feed on a Delta table affect OPTIMIZE and ZORDER performance?

Does enabling Change Data Feed on a Delta table affect OPTIMIZE and ZORDER performance?After enabling CDF on several large Delta tables, our OPTIMIZE jobs are taking noticeably longer. Is this expected, and are there any tuning parameters to minimize...

  • 76 Views
  • 1 replies
  • 0 kudos
Latest Reply
Sumit_7
Honored Contributor III
  • 0 kudos

@GaneshI Yes, some overhead is expected. Databricks recommends predicate-based OPTIMIZE for large tables and Liquid Clustering over ZORDER.

  • 0 kudos
ThiagoRosetti
by New Contributor
  • 103 Views
  • 1 replies
  • 0 kudos

Serverless Compute connectivity issues with .com.br domains vs. Classic Clusters Spark hangs

Hi everyone,I'm facing two specific issues in my Databricks Premium workspace (AWS - sa-east-1).Serverless Connectivity Issue: When using Serverless compute, I can successfully call APIs ending in .com, but calls to .com.br domains fail with connecti...

  • 103 Views
  • 1 replies
  • 0 kudos
Latest Reply
GaneshI
New Contributor
  • 0 kudos

Hi there,Great breakdown of the symptoms — these are actually two distinct issues likely sharing a common root cause in your VPC/network configuration. Let me address both:Issue 1: Serverless Compute — .com.br DNS Resolution FailureRoot CauseServerle...

  • 0 kudos
andytate
by New Contributor
  • 132 Views
  • 2 replies
  • 0 kudos

Lakebase not showing up

I am fairly new to Databricks and am learning it because a company I am working on is going to use it. One of the things they are going to use is Lakebase postgres so I thought I'd set it up on my personal account. First I don't see app switcher, sec...

  • 132 Views
  • 2 replies
  • 0 kudos
Latest Reply
rdokala
New Contributor
  • 0 kudos

If it is available, you would see at Compute->Lakebase and tabs for Provisioned and Autoscaling. This option Lakebase is next to Apps. There is another option dotted grid on the top right corner, the option just before your profile name, if you expan...

  • 0 kudos
1 More Replies
GaneshI
by New Contributor
  • 113 Views
  • 1 replies
  • 0 kudos

What is the recommended approach to enforce row-level security in Unity Catalog for external BI tool

We connect Tableau and Power BI to our Databricks SQL warehouse via OAuth tokens. Does Unity Catalog row filters apply at the SQL layer regardless of the BI tool, or do we need additional enforcement at the warehouse level?

  • 113 Views
  • 1 replies
  • 0 kudos
Latest Reply
Lu_Wang_ENB_DBX
Databricks Employee
  • 0 kudos

Unity Catalog row filters apply at the SQL/query layer, so if Tableau or Power BI is querying a Databricks SQL warehouse, the filters are enforced there — you do not need a separate warehouse-level row-filter feature. Row filters and column masks are...

  • 0 kudos
DazzaiDe
by New Contributor III
  • 148 Views
  • 2 replies
  • 1 kudos

Best Practices: 1 job per 1 target table

We’re currently designing our Medallion Architecture pipelines using Lakeflow Jobs, and I wanted to get some opinions on orchestration best practices.Right now, our approach is essentially 1 job per target table (for example, each Bronze/Silver/Gold ...

  • 148 Views
  • 2 replies
  • 1 kudos
Latest Reply
LBoydston
New Contributor II
  • 1 kudos

We typically organize our workloads with one job per catalog, and then use one or more pipelines to load tables into the appropriate schemas. As our data engineers ingest raw data, this structure is primarily applied in the Silver and Gold layers of ...

  • 1 kudos
1 More Replies
Garybary
by New Contributor III
  • 1620 Views
  • 3 replies
  • 2 kudos

Resolved! Scheduling jobs with table update triggers

Hi all,Lately I've been experimenting with the newish feature of scheduling jobs on a table update trigger. There's one thing thats blokcing me from implementing it however and I was hoping someone found a solution to it.We occasionally perform a vac...

  • 1620 Views
  • 3 replies
  • 2 kudos
Latest Reply
SteveOstrowski
Databricks Employee
  • 2 kudos

Hi @Garybary, Quick clarification on how table update triggers actually behave, because this changes the answer significantly. Table update triggers fire on data-changing operations only (writes, merges, updates, deletes). A standalone VACUUM does NO...

  • 2 kudos
2 More Replies
TalessRocha
by New Contributor II
  • 6094 Views
  • 11 replies
  • 8 kudos

Resolved! Connect to azure data lake storage using databricks free edition

Hello guys, i'm using databricks free edition (serverless) and i am trying to connect to a azure data lake storage.The problem I'm having is that in the free edition we can't configure the cluster so I tried to make the connection via notebook using ...

  • 6094 Views
  • 11 replies
  • 8 kudos
Latest Reply
pjvi
New Contributor II
  • 8 kudos

If you want to read from your Azure storage account using Databricks Free Edition, you can add a specific option when reading:spark.read.option("fs.azure.account.key.<storage-account-name>.dfs.core.windows.net",                  "your_storage_account...

  • 8 kudos
10 More Replies
maikel
by Contributor II
  • 488 Views
  • 4 replies
  • 1 kudos

Resolved! Uploading file to volume and start ingestion job

Hello Community!I am writing to you with my idea about data ingestion job which we have to implement in our project.The data which we have are in CSV file format and depending on the case it differs a little bit. Before uploading we pivoting csv file...

  • 488 Views
  • 4 replies
  • 1 kudos
Latest Reply
maikel
Contributor II
  • 1 kudos

Yeah, understood. Thank you very much once again! 

  • 1 kudos
3 More Replies
maikel
by Contributor II
  • 100 Views
  • 0 replies
  • 0 kudos

Job tasks monitoring

Hello Community,We have a case in our project that we would like to solve in an elegant and scalable manner. As always, I would really appreciate your suggestions and experience.In short:We have a multi-step job consisting of 4 stages. In one of the ...

  • 100 Views
  • 0 replies
  • 0 kudos
Danish11052000
by Contributor
  • 1234 Views
  • 7 replies
  • 1 kudos

Resolved! How should I correctly extract the full table name from request_params in audit logs?

’m trying to build a UC usage/refresh tracking table for every workspace. For each workspace, I want to know how many times a UC table was refreshed or accessed each month. To do this, I’m reading the Databricks audit logs and I need to extract only ...

  • 1234 Views
  • 7 replies
  • 1 kudos
Latest Reply
SteveOstrowski
Databricks Employee
  • 1 kudos

Hi @Danish11052000, You are on the right track with the COALESCE approach. The reason for the inconsistency is that different Unity Catalog action types populate different keys in request_params. Here is a breakdown of the key fields and which action...

  • 1 kudos
6 More Replies
Labels