Knowledge Sharing Hub

by SumitSingh • Contributor

07-19-2024 8:25:47 AM

3595 Views
7 replies
11 kudos

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

In today’s data-driven world, the role of a data engineer is critical in designing and maintaining the infrastructure that allows for the efficient collection, storage, and analysis of large volumes of data. Databricks certifications holds significan...

Knowledge Sharing Hub

Reply

3595 Views
7 replies
11 kudos

07-19-2024 8:25:47 AM

View Replies

Latest Reply

sandeepmankikar
New Contributor III

03-12-2025 8:32:21 PM

11 kudos

As an additional tip for those working towards both the Associate and Professional certifications, I recommend avoiding a long gap between the two exams to maintain your momentum. If possible, try to schedule them back-to-back with just a few days in...

11 kudos

03-12-2025 8:32:21 PM

6 More Replies

by Yassine_bens • New Contributor

05-08-2024 3:24:48 AM

1275 Views
1 replies
0 kudos

How to convert txt files to delta tables

Hello members of Databricks's comunity,I am currently working on a project where we collect data from machines, that data is in .txt format. The data is currently in an Azure container, I need to clean the files and convert them to delta tables, how ...

Knowledge Sharing Hub

Reply

1275 Views
1 replies
0 kudos

05-08-2024 3:24:48 AM

View Replies

Latest Reply

feiyun0112
Honored Contributor

05-08-2024 6:37:09 PM

0 kudos

https://docs.databricks.com/en/ingestion/add-data/upload-data.html

0 kudos

05-08-2024 6:37:09 PM

by Hubert-Dudek • Esteemed Contributor III

05-08-2024 8:06:23 AM

635 Views
0 replies
0 kudos

RocksDB for storing state stream

Now, you can keep the state of stateful streaming in RocksDB. For example, retrieving keys from memory to check for duplicate records inside the watermark is now faster. #databricks

Knowledge Sharing Hub

Reply

635 Views
0 replies
0 kudos

05-08-2024 8:06:23 AM

by Hubert-Dudek • Esteemed Contributor III

05-02-2024 4:06:44 AM

724 Views
0 replies
1 kudos

State of stateful streaming

For stateful streaming in #databricks, you can now easily read what is in the state.

Knowledge Sharing Hub

Reply

724 Views
0 replies
1 kudos

05-02-2024 4:06:44 AM

by legobricks • New Contributor II

04-26-2024 3:42:49 PM

1690 Views
4 replies
0 kudos

Unable to mount GCS bucket with underscores in the name

I have two buckets with the same configurations and labels.One is named my-bucket and the other is my_bucket. I am able to mount my-bucket but get an opaque error message when trying to mount my_bucket. Is this known/expected behavior? Are underscore...

Knowledge Sharing Hub

Reply

1690 Views
4 replies
0 kudos

04-26-2024 3:42:49 PM

View Replies

Latest Reply

NandiniN
Databricks Employee

05-01-2024 4:25:38 AM

0 kudos

Hi @legobricks , Curious on the error that you are getting. However, for GCS - https://cloud.google.com/storage/docs/buckets#naming I do see underscores are allowed but there is also a note below: You can use a bucket name in a DNS record as part of ...

0 kudos

05-01-2024 4:25:38 AM

3 More Replies

by Hubert-Dudek • Esteemed Contributor III

03-24-2024 2:55:43 PM

2241 Views
2 replies
2 kudos

Debug stream

One of the easiest ways to debug a stream is to stream it to memory and query it with SQL #databricks

Knowledge Sharing Hub

Reply

2241 Views
2 replies
2 kudos

03-24-2024 2:55:43 PM

View Replies

Latest Reply

DB_Paul
Databricks Employee

04-30-2024 4:22:14 PM

2 kudos

Cool @Hubert-Dudek !

2 kudos

04-30-2024 4:22:14 PM

1 More Replies

by MichTalebzadeh • Valued Contributor

04-27-2024 12:00:43 AM

1289 Views
0 replies
0 kudos

Financial Crime detection with the help of Apache Spark, Data Mesh and Data Lake

For those interested in Data Mesh and Data Lakes for FinCrime detection:Data mesh is a relatively new architectural concept for data management that emphasizes domain-driven data ownership and self-service data availability. It promotes the decentral...

Knowledge Sharing Hub

data lakes

Data Mesh

financial crime

spark

Reply

1289 Views
0 replies
0 kudos

04-27-2024 12:00:43 AM

by Hp3 • New Contributor

04-24-2024 1:12:55 PM

1202 Views
0 replies
0 kudos

Hiring Databricks Data Architect roles

Hi,I am a recruiter and I am looking for places to post some data bricks I have coming out. I have several fully remote, high-level data databricks, architect roles. Of course I will post to LinkedIn, but I was just curious if there are any other pla...

Knowledge Sharing Hub

Reply

1202 Views
0 replies
0 kudos

04-24-2024 1:12:55 PM

by Ajay-Pandey • Esteemed Contributor III

04-19-2024 1:05:37 AM

1889 Views
1 replies
2 kudos

Databricks Notebook Workflow

Exciting news for Databricks users! The ability to view job details within the notebook workflow section, particularly for multithreaded jobs, is available now. Instead of manually inspecting each job for failures, this feature enables us to swiftly ...

Knowledge Sharing Hub

Databricks

databrickscommunity

Knowledge Sharing Hub

newfeatures

workflow

Reply

1889 Views
1 replies
2 kudos

04-19-2024 1:05:37 AM

View Replies

Latest Reply

Lakshay
Databricks Employee

04-22-2024 12:53:17 PM

2 kudos

Thank you for sharing

2 kudos

04-22-2024 12:53:17 PM

by Sujitha • Databricks Employee

03-18-2024 12:07:18 AM

2321 Views
3 replies
5 kudos

🌟 Welcome to the Knowledge Sharing Hub! 🌟

Are you passionate about sharing your discoveries and insights with the world? Look no further! Our Knowledge Sharing Hub is the perfect space for you to showcase your research and connect with like-minded individuals across the globe. Here's why you...

Knowledge Sharing Hub

Reply

2321 Views
3 replies
5 kudos

03-18-2024 12:07:18 AM

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

04-19-2024 1:09:53 AM

5 kudos

Great place to knowledge sharing and collaboration.

5 kudos

04-19-2024 1:09:53 AM

2 More Replies

by Hubert-Dudek • Esteemed Contributor III

04-15-2024 5:09:25 AM

1009 Views
1 replies
1 kudos

Spot databricks VMs - eviction rates

Before using Spot machines in #databricks, it's a good idea to check their eviction rates in your region. Azure Resource Graph Explorer and that simple query will help. SpotResources | where type =~ 'microsoft.compute/skuspotevictionrate/location' ...

Knowledge Sharing Hub

Reply

1009 Views
1 replies
1 kudos

04-15-2024 5:09:25 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

04-18-2024 2:46:50 PM

1 kudos

Thank you for sharing this @Hubert-Dudek

1 kudos

04-18-2024 2:46:50 PM

by mm_soc • New Contributor II

04-18-2024 10:54:47 AM

1667 Views
0 replies
1 kudos

Installing CrowdStrike Falcon Sensor on Databricks Workers

Greetings,Does anyone here have experience deploying the CrowdStrike Falcon sensor on Databricks worker instances? For context, the cluster is deployed in AWS and we use a Databricks Ubuntu 20.04 AMI. Databricks allows adding a bootstrap/startup scri...

Knowledge Sharing Hub

Reply

1667 Views
0 replies
1 kudos

04-18-2024 10:54:47 AM

by ThomazRossito • Contributor

04-14-2024 4:31:33 PM

2210 Views
0 replies
1 kudos

Post: Lakehouse Federation - Databricks

Lakehouse Federation - Databricks In the world of data, innovation is constant. And the most recent revolution comes with Lakehouse Federation, a fusion between data lakes and data warehouses, taking data manipulation to a new level. This advancement...

Knowledge Sharing Hub

data engineer

Lakehouse

SQL Analytics

Reply

2210 Views
0 replies
1 kudos

04-14-2024 4:31:33 PM

by data_turtle • New Contributor

04-13-2024 12:24:51 PM

857 Views
0 replies
0 kudos

Feedback request for Gradient, a tool to help optimize and monitor jobs automatically

Hi Everyone,We built Gradient, a tool to automatically optimize and monitor Databricks jobs to hit your business objectives of cost or runtime.Gradient works by applying a reinforcement ML model to automatically learn and custom tune your jobs cluste...

Knowledge Sharing Hub

Reply

857 Views
0 replies
0 kudos

04-13-2024 12:24:51 PM

by data_turtle • New Contributor

04-13-2024 12:16:32 PM

726 Views
0 replies
0 kudos

Understand why your jobs' performances are changing over time

Hi Folks -We released a new metrics view for databricks jobs in Gradient, which helps track and plot the metrics below over time to help engineers understand what's going on with their jobs over time.Job cost (DBU + Cloud fees)Job RuntimeNumber of co...

Knowledge Sharing Hub

Reply

726 Views
0 replies
0 kudos

04-13-2024 12:16:32 PM

by Danny_Lee • Valued Contributor

04-07-2024 6:22:37 AM

2613 Views
1 replies
1 kudos

Jonathan Frankel at Sigma talk

Hi @Sujitha Just to follow up on your suggestion to share my takeaways from Jonathan Frankel's talk at Sigma in NYC. The key ideas I came away with is:Building in-house custom models is more than just possible, there's advantages to itThere's danger...

Knowledge Sharing Hub

AI

ML

Reply

2613 Views
1 replies
1 kudos

04-07-2024 6:22:37 AM

View Replies

Latest Reply

Sujitha
Databricks Employee

04-07-2024 9:05:58 PM

1 kudos

@Danny_Lee This is super insightful! Really appreciate your time to share your key takeaways with us.

1 kudos

04-07-2024 9:05:58 PM

Databricks Community

Forum Posts

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

How to convert txt files to delta tables

RocksDB for storing state stream

State of stateful streaming

Unable to mount GCS bucket with underscores in the name

Debug stream

Financial Crime detection with the help of Apache Spark, Data Mesh and Data Lake

Hiring Databricks Data Architect roles

Databricks Notebook Workflow

🌟 Welcome to the Knowledge Sharing Hub! 🌟

Spot databricks VMs - eviction rates

Installing CrowdStrike Falcon Sensor on Databricks Workers

Post: Lakehouse Federation - Databricks

Feedback request for Gradient, a tool to help optimize and monitor jobs automatically

Understand why your jobs' performances are changing over time

Jonathan Frankel at Sigma talk

Join Us as a Local Community Builder!

Log Custom Transformer with Feature Engineering Cl...

Want to learn LakeFlow Pipelines in community edit...

Standardized Framework to update Databricks job de...

Feature Engineering for Data Engineers: Building B...

Timeout handling with JDBC connection to SQL Wareh...