cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

IONA
by New Contributor III
  • 1093 Views
  • 1 replies
  • 2 kudos

Move/Merge Catalogs

HiWe have a couple of unity catalogs. In each are schema's, some used, some old, same named temp_data etc. etc Within those schema are tables with the same erratic approach. The result of a couple of years of bad housekeeping. (who not experienced th...

  • 1093 Views
  • 1 replies
  • 2 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 2 kudos

For moving tables and entire schemas consider looking at the "SYNC" command.   The SYNC Command - Your Primary Migration Tool Databricks provides a SYNC command specifically designed for this exact use case - migrating tables from one location to ano...

  • 2 kudos
liu
by Contributor
  • 656 Views
  • 3 replies
  • 2 kudos

Permission issue for pandas to read local files

I can use pandas to read local files in a notebook, such as those located in tmp.However, when I run two consecutive notebooks within the same job and read files with pandas in both, I encounter a permission error in the second notebook stating that ...

  • 656 Views
  • 3 replies
  • 2 kudos
Latest Reply
BS_THE_ANALYST
Esteemed Contributor III
  • 2 kudos

@liu is there anything preventing you from moving this file into a Volume on the Unity Catalog & then reading it from there?Based on your original message, the "permission" message is making me feel like it's having an issue with opening the file; pe...

  • 2 kudos
2 More Replies
PratikRudra
by New Contributor
  • 630 Views
  • 1 replies
  • 0 kudos

unable to create table on external location

Currently trying to connect a table on external location and it fails with error -[UNAUTHORIZED_ACCESS] Unauthorized access: PERMISSION_DENIED: request not authorized SQLSTATE: 42501which seems like a pretty straight forward error but unable to find ...

  • 630 Views
  • 1 replies
  • 0 kudos
Latest Reply
Khaja_Zaffer
Esteemed Contributor
  • 0 kudos

Hello @PratikRudra Thank you for sharing the error: I think probably there is a component that is missing. Writing table metadata (for example, to the _delta_log directory) requires the CREATE EXTERNAL TABLE capability on the external location; this ...

  • 0 kudos
tana_sakakimiya
by Contributor
  • 1196 Views
  • 7 replies
  • 7 kudos

Resolved! MATERIALZIED VIEW TRIGGER ON UPDATE with external table as upstream table

goal: implement event driven architecture without trigger on file arrivalI would like to know whether materialzied view can update itself when its source table which is external table updated.given that the source external table referencing data in d...

  • 1196 Views
  • 7 replies
  • 7 kudos
Latest Reply
tana_sakakimiya
Contributor
  • 7 kudos

it seems that my idea is a bad idea because it seems that materialzied view doesn't support incremental udpate for external locationIncremental refresh for materialized views - Azure Databricks | Microsoft Learn

  • 7 kudos
6 More Replies
IONA
by New Contributor III
  • 482 Views
  • 1 replies
  • 0 kudos

Changing paths to tables

HiMy organization has many notebooks that reference tables in schemas with the three part pathcatalog.schema.tablenameWith a lack of foresight we hardcoded all of these paths in the code and now the inevitable is happening and there is a need to rest...

  • 482 Views
  • 1 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 0 kudos

Hi @IONA ,Definitely, I would say that’s even a common practice. Create a feature branch and make the necessary changes there. But once a day, merge into that feature branch all the changes that have appeared on your main branch. That way, you will a...

  • 0 kudos
Malthe
by Contributor III
  • 2096 Views
  • 10 replies
  • 4 kudos

Resolved! Intermittent failures with auto loader on Azure

We've been using auto loader to ingest data from a storage account on Azure (format "cloudFiles").Today, we're starting to see failures during the setup of event notification:25/09/11 19:06:28 ERROR MicroBatchExecution: Non-interrupted exception thro...

  • 2096 Views
  • 10 replies
  • 4 kudos
Latest Reply
Khaja_Zaffer
Esteemed Contributor
  • 4 kudos

Hello @Malthe @Saska @MehdiJafariWhen it was showing such error, did you try any alternative methods like using databricks service credentials. For example  stream = spark.readStream.format("cloudFiles")  .option("cloudFiles.format", "json")  .option...

  • 4 kudos
9 More Replies
ManojkMohan
by Honored Contributor II
  • 1039 Views
  • 1 replies
  • 2 kudos

Resolved! Simulating Real Time Streaming in Databricks free edition

Use Case:Kafka real time steaming network telemetry logsIn a real use case approx. 40 TB of data can be real time streamed in a dayArchitecture:issue encountered:  when i try to simulate kakfa like streaming in databricks itself , as this is a free e...

ManojkMohan_0-1757780238326.png
  • 1039 Views
  • 1 replies
  • 2 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 2 kudos

Hi @ManojkMohan ,Unfortunately there is no workaround here. Free Edition supports only serverless compute. And serverless has following streaming limitation - one of which you just encountered - there is no support for default or time-based trigger i...

  • 2 kudos
LucasAntoniolli
by New Contributor III
  • 1145 Views
  • 4 replies
  • 4 kudos

Resolved! Problems with cluster shutdown in DLT

[Issue] DLT finishes processing, but cluster remains active due to log write errorHi everyone, I'm running into a problem with my DLT pipeline and was hoping someone here could help or has experienced something similar.Problem DescriptionThe pipeline...

  • 1145 Views
  • 4 replies
  • 4 kudos
Latest Reply
nayan_wylde
Esteemed Contributor
  • 4 kudos

Can you please try one more option. If you’re on Preview, move to Current (or vice versa). Sometimes the regression only exists in one channel.

  • 4 kudos
3 More Replies
IONA
by New Contributor III
  • 493 Views
  • 1 replies
  • 0 kudos

Decoupling power bi reports from schemas

HiI'm sure many of you have power bi reports that use the native Databricks connector to pull data from a schema to fuel a wonderful dashboard. If the source schemes was moved to a different catalog or a table renamed then the pbi connection would be...

  • 493 Views
  • 1 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 0 kudos

Hi @IONA ,You can parametrize your data source using power query parameters. Just define parameter for catalog, schema etc. I have such setup in one of the project. We parametrized our PBI semantic models and when we are deploying semantic model to a...

  • 0 kudos
RevanthV
by New Contributor III
  • 747 Views
  • 3 replies
  • 6 kudos

Resolved! Issue with Auto Liquid clustering

 I have written data to a table using clusterByAuto set to trueBut the clustering keys are not selected automatically when i do a desc detail on the table.Screenshot belowWhy are clustering columns not being selected automatically?Repro steps:Create ...

  • 747 Views
  • 3 replies
  • 6 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 6 kudos

Hi @RevanthV ,As @K_Anudeep  correctly suggested it could be the case that your table is to small to benefit from liquid clustering.Another possibility it that you're using runtime lower than 15.4 LTS.

  • 6 kudos
2 More Replies
Soumenkumar
by New Contributor
  • 390 Views
  • 1 replies
  • 2 kudos
  • 390 Views
  • 1 replies
  • 2 kudos
Latest Reply
K_Anudeep
Databricks Employee
  • 2 kudos

Hello @Soumenkumar ,I believe OGG has a Databricks target that stages to cloud storage (ADLS on S3) and then runs MERGE into Delta tables. This is designed for UC and documented in the official Oracle Docs.In Unity Catalog, create a storage credentia...

  • 2 kudos
arajshree
by New Contributor
  • 443 Views
  • 1 replies
  • 0 kudos

Cross region delta table creation in Azure

Hi Community.I have a usecase as below.1. I have a Azure Databricks [ADB] in NCUS which has Unity Catalog [UC] enabled using the NCUS region metastore.2. The ADLS Gen2 storage is in EUSI am trying to registers a existing delta table in UC and also cr...

  • 443 Views
  • 1 replies
  • 0 kudos
Latest Reply
nayan_wylde
Esteemed Contributor
  • 0 kudos

Seems like an networking issue. In a notebook attached to the custom cluster, try dbutils.fs.ls("abfss://<container>@<account>.dfs.core.windows.net/<path>") to confirm reachability before running CREATE EXTERNAL TABLE. (If this hangs or errors, it’s ...

  • 0 kudos
guptaharsh
by New Contributor III
  • 1164 Views
  • 5 replies
  • 4 kudos

Resolved! How to add webhook notification in DLT pipeline through yml

Hi team,I am trying to create a slack webhook notification in a DLT pipeline leveraging jobs.yml. targets:  dev:    # The default target uses 'mode: development' to create a development copy.    # - Deployed resources get prefixed with '[dev my_user_...

  • 1164 Views
  • 5 replies
  • 4 kudos
Latest Reply
guptaharsh
New Contributor III
  • 4 kudos

@szymon_dybczak , I gone through the Databricks docs.. I have added the webhook notifications in DLT pipeline. it works fine for me . # The job that triggers api_data_pipeline.resources:  jobs:    api_data_job:      name: api_data_job      schedule: ...

  • 4 kudos
4 More Replies
AanchalSoni
by New Contributor III
  • 787 Views
  • 4 replies
  • 0 kudos

Azure not getting listed in create external location

Hi,I'm trying to create a pipeline using Azure, however, Azure is not getting listed in the drop down of Catalog Explorer -> Create External Location. I'm using community version for practice. Please advice.

  • 787 Views
  • 4 replies
  • 0 kudos
Latest Reply
nayan_wylde
Esteemed Contributor
  • 0 kudos

@AanchalSoni Yes Databricks free edition have this limitation you cannot create customized external location It has only support for s3 now and that location is managed by databricks. Look at the limitations below. 

  • 0 kudos
3 More Replies

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels