Data Engineering

Forum Posts

Sorted by:

by wise_owl • New Contributor III

06-12-2024 1:55:49 PM

624 Views
0 replies
0 kudos

Help needed on Cluster Configuration since I'm confused AF ( Worker + Driver )

Supposedly there are 4 major types of cluster in Datbricks that are- General Purpose, Storage Optimized, Memory Optimized and Compute Optimized Clusters but I'm not able to find detailed information as on which cluster to choose specifically in which...

Data Engineering

624 Views
0 replies
0 kudos

06-12-2024 1:55:49 PM

by Shahe • New Contributor

06-04-2024 1:43:20 AM

3058 Views
2 replies
0 kudos

Azure Databricks Metrics to Prometheus?

What is the best method to expose Azure Databricks metrics to Prometheus specifically? And is it possible to get the underlying Spark metrics also? All I can see clearly defined in the documentation is the serving endpoint metrics:https://learn.micro...

Data Engineering

3058 Views
2 replies
0 kudos

06-04-2024 1:43:20 AM

View Replies

Latest Reply

DanielB
New Contributor II

06-12-2024 1:48:34 PM

0 kudos

HelloI don't have databricks running as pod in an aks cluster.. It's working on azure as saas.. What should I do the export the metrics to prometheus?

0 kudos

06-12-2024 1:48:34 PM

1 More Replies

by VanessaSousa_Ol • New Contributor

06-12-2024 1:26:55 PM

642 Views
0 replies
0 kudos

RLS and CLS with delta sharing

RLS and CLS is possible to apply in tables that are shared using unity catalog?

Data Engineering

642 Views
0 replies
0 kudos

06-12-2024 1:26:55 PM

by david1144 • New Contributor III

06-12-2024 1:06:37 PM

517 Views
1 replies
0 kudos

Streaming data with kafka

What is the Best way for implement streaming data flow dron kafka to databricks (delta tables)

Data Engineering

517 Views
1 replies
0 kudos

06-12-2024 1:06:37 PM

View Replies

Latest Reply

GCosta
New Contributor II

06-12-2024 1:16:43 PM

0 kudos

Structured streaming:https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html

0 kudos

06-12-2024 1:16:43 PM

by GCosta • New Contributor II

06-12-2024 1:01:41 PM

663 Views
0 replies
0 kudos

How to write data to Confluent Kafka with SchemaRegistry format on sparkstructured?

Hi There!I am to trying write a batch data to kafka topic with schema registry in databricks using pyspark, i serialize the data with pyspark to_avro function and write it to the topic, but the consumers can’t read the schema id. If they do not separ...

Data Engineering

663 Views
0 replies
0 kudos

06-12-2024 1:01:41 PM

by Mlaricobar94 • New Contributor

06-12-2024 12:54:57 PM

481 Views
0 replies
0 kudos

Collect Spark UI statistics to analyze the performance for several Spark Applications

To identify the reasons for a data process poor performance, we need to navigate and analyze the metrics in the Spark UI manually... However, replicating those steps for a giant group of spark applications would be very expensive in times...Given thi...

Data Engineering

481 Views
0 replies
0 kudos

06-12-2024 12:54:57 PM

by Nachatz • New Contributor

06-12-2024 12:52:40 PM

292 Views
0 replies
0 kudos

I’m at summit!

Data Engineering

292 Views
0 replies
0 kudos

06-12-2024 12:52:40 PM

by Serena • New Contributor

06-12-2024 12:46:18 PM

262 Views
0 replies
0 kudos

Data architecture

Check out our platform architecture italgas-from-gas-pipelines-to-data-pipelines-fueling-our-reporting-with-the-latest-innovations-7f00e20ba115?source=social.linkedin

Data Engineering

262 Views
0 replies
0 kudos

06-12-2024 12:46:18 PM

by Gaurav19 • New Contributor III

06-12-2024 6:02:33 AM

2753 Views
3 replies
1 kudos

Resolved! Databricks API - list job runs doesn't have 'task run id'

Hi all,I am calling get job run list API to get all task ids and refer them in dbt-artifacts view created by dbt job run. The question is I can see 'task run id' on screen but it doesn't come back in api response? Is there a way to get it? I checked ...

Data Engineering

2753 Views
3 replies
1 kudos

06-12-2024 6:02:33 AM

View Replies

Latest Reply

Gaurav19
New Contributor III

06-12-2024 12:39:07 PM

1 kudos

Never mind, I have found task_run_id present in getrun api https://docs.databricks.com/api/azure/workspace/jobs/getrunI overlooked at first instance as it is buried under nested json structuretasks[] > run_id.This clarifies and solves my problem!

1 kudos

06-12-2024 12:39:07 PM

2 More Replies

by Sbrfame84 • New Contributor

06-12-2024 12:32:56 PM

763 Views
1 replies
0 kudos

Delta Lake vs Iceberg ..Which is better ?

Which is better for meta data handling ?

Data Engineering

763 Views
1 replies
0 kudos

06-12-2024 12:32:56 PM

View Replies

Latest Reply

manjuGowda
New Contributor II

06-12-2024 12:39:05 PM

0 kudos

Delta lake

0 kudos

06-12-2024 12:39:05 PM

by thiagoawstest • Contributor

06-12-2024 9:34:03 AM

1609 Views
1 replies
0 kudos

Resolved! Unity Catalog mount S3

Hi, I still have some questions, I have a Databricks on AWS and I need to mount S3 bucksts.According to the documentation, it is recommended to do it through the Unity Catalog, but how would I go about reading data from a notebook that would be mount...

Data Engineering

1609 Views
1 replies
0 kudos

06-12-2024 9:34:03 AM

View Replies

Latest Reply

thiagoawstest
Contributor

06-12-2024 12:32:50 PM

0 kudos

Returning, I already understood, I'm marking it as resolved.

0 kudos

06-12-2024 12:32:50 PM

by NathanSundarara • Contributor

06-12-2024 12:05:30 PM

1396 Views
3 replies
0 kudos

I’m at DAIS

Having nice time at DaIs

Data Engineering

1396 Views
3 replies
0 kudos

06-12-2024 12:05:30 PM

View Replies

Latest Reply

mjiang
New Contributor II

06-12-2024 12:30:57 PM

0 kudos

Really enjoyed the keynote and breakout sessions!

0 kudos

06-12-2024 12:30:57 PM

2 More Replies

by TedHaley • New Contributor

06-12-2024 12:25:03 PM

231 Views
0 replies
0 kudos

Anyone interested in Person Data

Read out!

Data Engineering

231 Views
0 replies
0 kudos

06-12-2024 12:25:03 PM

by RP8 • New Contributor

06-12-2024 12:18:23 PM

382 Views
0 replies
0 kudos

Serverless Timelines

What’s the deadline to move to serverless compute

Data Engineering

382 Views
0 replies
0 kudos

06-12-2024 12:18:23 PM

by mmendez1012 • New Contributor

06-12-2024 11:24:29 AM

461 Views
0 replies
0 kudos

Workflows

Someone Can give me some advices about parquet size files whem moving data

Data Engineering

461 Views
0 replies
0 kudos

06-12-2024 11:24:29 AM

User

Count

1611

768

347

286

252

Databricks Community

Forum Posts

Help needed on Cluster Configuration since I'm confused AF ( Worker + Driver )

Azure Databricks Metrics to Prometheus?

RLS and CLS with delta sharing

Streaming data with kafka

How to write data to Confluent Kafka with SchemaRegistry format on sparkstructured?

Collect Spark UI statistics to analyze the performance for several Spark Applications

I’m at summit!

Data architecture

Resolved! Databricks API - list job runs doesn't have 'task run id'

Delta Lake vs Iceberg ..Which is better ?

Resolved! Unity Catalog mount S3

I’m at DAIS

Anyone interested in Person Data

Serverless Timelines

Workflows

Join Us as a Local Community Builder!

Unity Catalog Table in Databricks Asset Bundle

Databricks data engineer associate exam

How to delete/empty notebook output

Databricks Cluster Policies

toml file syntax highlighting