cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

thiagoawstest
by Contributor
  • 1116 Views
  • 1 replies
  • 0 kudos

Azure Devops - Entra ID - AWS Databricks

Hi, I need to integrate Azure Devops repos with AWS Databricks, but not via personal token.I need it via main service, integrated with Azure Entra ID, using Azure Databricks when I go to create main service, "Entra ID application ID" appears, but in ...

  • 1116 Views
  • 1 replies
  • 0 kudos
christian_chong
by New Contributor III
  • 1278 Views
  • 1 replies
  • 0 kudos

Resolved! unity catalog with external table and column masking

Hi everbody, I am facing a issue with spark structured steaming. here is a sample of my code:   df = spark.readStream.load(f"{bronze_table_path}") df.writeStream \ .format("delta") \ .option("checkpointLocation", f"{silver_checkpoint}") \ .option("me...

  • 1278 Views
  • 1 replies
  • 0 kudos
Latest Reply
christian_chong
New Contributor III
  • 0 kudos

My first message was not well formatted. i wrote :  df = spark.readStream.load(f"{bronze_table_path}") df.writeStream \ .format("delta") \ .option("checkpointLocation", f"{silver_checkpoint}") \ .option("mergeSchema", "true") \ .trigger(availabl...

  • 0 kudos
philipkd
by New Contributor III
  • 2247 Views
  • 1 replies
  • 0 kudos

Cannot get past Query Data tutorial for Azure Databricks

I created a new workspace on Azure Databricks, and I can't get past this first step in the tutorial: DROP TABLE IF EXISTS diamonds; CREATE TABLE diamonds USING CSV OPTIONS (path "/databricks-datasets/Rdatasets/data-001/csv/ggplot2/diamonds.csv", hea...

  • 2247 Views
  • 1 replies
  • 0 kudos
Latest Reply
dollyb
Contributor
  • 0 kudos

Struggling with this as well. So using dbfs:/ with CREATE TABLE statement works on AWS, but not Azure?

  • 0 kudos
Devsql
by New Contributor III
  • 4262 Views
  • 1 replies
  • 0 kudos

Measure size of all tables in Azure databricks

Hi Team,Currently I am trying to find size of all tables in my Azure databricks, as i am trying to get idea of current data loading trends, so i can plan for data forecast ( i.e. Last 2 months, approx 100 GB data came-in, so in next 2-3 months there ...

  • 4262 Views
  • 1 replies
  • 0 kudos
Latest Reply
Devsql
New Contributor III
  • 0 kudos

Hi @Retired_mod,1-  Regarding this issue i had found below link:https://kb.databricks.com/sql/find-size-of-table#:~:text=You%20can%20determine%20the%20size,stats%20to%20return%20the%20sizeNow to try above link, I need to decide: Delta-Table Vs Non-De...

  • 0 kudos
yvuignie
by Contributor
  • 1308 Views
  • 1 replies
  • 0 kudos

Asset Bundles webhook not working

Hello,The webhook notifications in databricks jobs defined in the asset bundles are not taken into account and therefore not created. For instance, this is not working:resources: jobs: job1: name: my_job webhook_notifications: on...

  • 1308 Views
  • 1 replies
  • 0 kudos
Latest Reply
yvuignie
Contributor
  • 0 kudos

Hello @Retired_mod ,Thank you for your help.However we did check the job configuration multiple time. If we substitue 'webhook_notifications' with 'email_notifications' it works, so the syntax is correct. Here is a sample of our configuration:For the...

  • 0 kudos
N_M
by Contributor
  • 1851 Views
  • 1 replies
  • 0 kudos

Access historical injected data of COPY INTO command

Dear Community,I'm using the COPY INTO command to automate the staging of files that I get in an S3 bucket into specific delta tables (with some transformation on the fly).The command works smoothly, and files are indeed inserted only once (writing i...

  • 1851 Views
  • 1 replies
  • 0 kudos
ChingizK
by New Contributor III
  • 3791 Views
  • 2 replies
  • 1 kudos

Resolved! Workflow Failure Alert Webhooks for OpsGenie

I'm trying to set up a Workflow Job Webhook notification to send an alert to OpsGenie REST API on job failure. We've set up Teams & Email successfully.We've created the Webhook and when I configure "On Failure" I can see it in the JSON/YAML view. How...

Screenshot 2024-04-12 at 1.15.33 PM.png Screenshot 2024-04-12 at 1.17.27 PM.png
Data Engineering
jobs
opsgenie
webhooks
Workflows
  • 3791 Views
  • 2 replies
  • 1 kudos
Latest Reply
portoedu
New Contributor III
  • 1 kudos

Hi guys,I found a workaround by creating an email integration in opsgenie and then creating a databricks notification destination with that email.

  • 1 kudos
1 More Replies
AdventureAce
by New Contributor III
  • 741 Views
  • 0 replies
  • 0 kudos

Short-live token from Unity Catalog

What is this short-lived token shared by unity-catalog in step 4 and 5 here? And how does the cloud storage authenticate the token generated by unity catalog?  

AdventureAce_0-1718918698276.png
  • 741 Views
  • 0 replies
  • 0 kudos
Pálmi
by New Contributor II
  • 1434 Views
  • 2 replies
  • 1 kudos

IoT hub with kafka connector - how to decode the enqueued timestamp and device id

I'm reading data from the default endpoint of an IoT hub in azure using the kafka connector in Databricks.  Most data items are straight forward, but the device id and the timestamp I haven't been able to properly decodeFor example, the key-value map...

  • 1434 Views
  • 2 replies
  • 1 kudos
Latest Reply
Erik
Valued Contributor III
  • 1 kudos

https://github.com/Azure/azure-event-hubs-for-kafka/issues/56#issuecomment-1432006831

  • 1 kudos
1 More Replies
aozero
by New Contributor II
  • 1633 Views
  • 3 replies
  • 0 kudos

Deleting data programmatically from databricks live delta tables

Hello all, I am relatively new in data engineering and working on a project requiring me to programmatically delete data from delta live tables. However, I found that simply stopping the streaming job and deleting rows from the delta tables caused th...

  • 1633 Views
  • 3 replies
  • 0 kudos
Latest Reply
aozero
New Contributor II
  • 0 kudos

Hi @shan_chandra Full refreshing brings back the deleted data since it exists in the pubsub source.  

  • 0 kudos
2 More Replies
Eiki
by New Contributor
  • 528 Views
  • 1 replies
  • 0 kudos

How to use the same job cluster in diferents job runs inside the one workflow

I created a Workflow with notebooks and some job runs, but I would to use only one job cluster to run every job runs, without creating a new job cluster for each job run. Because I didn't want to increase the execution time with each new job cluster ...

  • 528 Views
  • 1 replies
  • 0 kudos
Latest Reply
brockb
Databricks Employee
  • 0 kudos

Hi,If I understand correctly, you are hoping to reduce overall job execution time by reducing the Cloud Service Provider instance provisioning time. Is that correct?If so, you may want to consider: Using a Pool of instances: https://docs.databricks.c...

  • 0 kudos
diego_poggioli
by Contributor
  • 3116 Views
  • 1 replies
  • 0 kudos

Streaming foreachBatch _jdf jvm attribute not supported

I'm trying to perform a merge inside a streaming foreachbatch using the command: microBatchDF._jdf.sparkSession().sql(self.merge_query)Streaming runs fine if I use a Personal cluster while if I use a Shared cluster streaming fails with the following ...

  • 3116 Views
  • 1 replies
  • 0 kudos
Latest Reply
holly
Databricks Employee
  • 0 kudos

Can you share what runtime your cluster is using?  This error doesn't surprise me, Unity Catalog Shared clusters have many security limitations, but the list is reducing over time. https://docs.databricks.com/en/compute/access-mode-limitations.html#s...

  • 0 kudos
nehaa
by New Contributor II
  • 943 Views
  • 2 replies
  • 0 kudos

Databricks dashboard publish

How to publish the dashboard created from a notebook? I don't see publish option within the File anymoreWhen referred to the old video they seem to have an option to publish the dashboard  

nehaa_0-1718822255560.png nehaa_2-1718822335195.png
  • 943 Views
  • 2 replies
  • 0 kudos
Latest Reply
Walter_C
Databricks Employee
  • 0 kudos

Can you share the link to the video you are referring?As per docs, no publish option is currently available, you can do a Present Dashboard to see it.https://docs.databricks.com/en/notebooks/dashboards.html

  • 0 kudos
1 More Replies
jenitjain
by New Contributor
  • 764 Views
  • 2 replies
  • 0 kudos

Certifications questions

What are the timings and days between which we can get certified? Can we purchase a certification at the location or are we supposed to purchase it beforehand?

  • 764 Views
  • 2 replies
  • 0 kudos
Latest Reply
Cert-Team
Databricks Employee
  • 0 kudos

Online exams can be purchased and taken anytime. Is this question related to DAIS?

  • 0 kudos
1 More Replies
guangyi
by Contributor III
  • 802 Views
  • 0 replies
  • 0 kudos

How to make a DLT pipeline trigger another pipeline?

For example, I have 2 DLT pipelines, one is used for computing user gender distribution, another is used for compute user location distribution. In the first pipeline, I follow the medallion architecture creating the bronze, silver, gold table one by...

  • 802 Views
  • 0 replies
  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels