Community Articles

by Tushar_Parekar • Databricks Employee

14 hours ago

48 Views
0 replies
0 kudos

Learning Series | Advanced Machine Learning Operations

Databricks Academy offers the free Advanced Machine Learning Operations course to help machine learning practitioners understand how to run ML projects more reliably at scale on Databricks. As the second course in the Advanced Machine Learning serie...

Learning Series (800 x 800 px) (6) (1).png

Community Articles

Reply

48 Views
0 replies
0 kudos

14 hours ago

by Agre_Celebal • New Contributor II

yesterday

122 Views
0 replies
0 kudos

Handling Sensor Dropout in IoT Pipelines: A Quarantine Pattern with Lakeflow Declarative Pipelines

Why Your Solar Forecasting Model Doesn't Trust Every ZeroA data quality pattern for sensor dropout at IoT scale, using Lakeflow Declarative Pipelines (formerly DLT)A solar panel producing zero output at 2pm on a clear day is a maintenance ticket. A s...

Community Articles

Reply

122 Views
0 replies
0 kudos

yesterday

by Tushar_Parekar • Databricks Employee

yesterday

141 Views
0 replies
0 kudos

Solution Accelerator Series | Social Determinants of Health

Driving seamless access to data on social determinants of health is important for helping healthcare professionals better understand health inequities across social groups. The Social Determinants of Health Solution Accelerator shows how healthcare d...

Community Articles

Reply

141 Views
0 replies
0 kudos

yesterday

by rishav_sharma • New Contributor III

Monday

161 Views
0 replies
0 kudos

How Databricks Unity Catalog Business Semantics creates a governed layer for metrics

The problem: technically correct, but still inconsistentMost analytics teams eventually encounter the same frustrating pattern: one dashboard reports revenue at 10.2M, another at 10.6M, and a spreadsheet says 10.4M. Each result may be technically def...

Community Articles

Reply

161 Views
0 replies
0 kudos

Monday

by ericka-lorenz • New Contributor III

Sunday

259 Views
1 replies
1 kudos

Databricks Partner Tiers Explained for the People Who Build the Delivery

Databricks swapped the old partner labels this year; Registered / Select / Elite / Global Elite are gone, replaced by Bronze, Silver, Gold, and Platinum. Most write-ups on this are aimed at buyers choosing a vendor. I wanted to flip it for the people...

Community Articles

Reply

259 Views
1 replies
1 kudos

Sunday

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

Monday

1 kudos

@ericka-lorenz , thanks for this helpful practitioner-oriented summary, I feel it important to add a couple of small footnotes. Based on the public Databricks partner-program pages, certifications clearly matter, though partner tiering seems to run o...

1 kudos

Monday

by VamsiDatabricks • New Contributor II

10-14-2025 9:44:54 AM

672 Views
1 replies
1 kudos

Validating pointer-based Delta comparison architecture using flatMapGroupsWithState in Structured St

Hi everyone,I’m leading an implementation where we’re comparing events from two real-time streams — a Source and a Target — in Databricks Structured Streaming (Scala).Our goal is to identify and emit “delta” differences between corresponding records ...

Community Articles

Reply

672 Views
1 replies
1 kudos

10-14-2025 9:44:54 AM

View Replies

Latest Reply

iyashk-DB
Databricks Employee

Sunday

1 kudos

The overall pattern is sound, but there are a few real production risks worth calling out. Delta point reads inside the state function are your biggest bottleneck. When flatMapGroupsWithState (or its replacement transformWithState) fetches a JSON fro...

1 kudos

Sunday

by AmitDECopilot • Contributor

Saturday

370 Views
0 replies
0 kudos

Databricks Performance Optimization: What Changed, What Still Matters, and What Should Be Automated

Performance optimization in Databricks used to follow a familiar playbook:Partition large Delta tables.Compact small files with OPTIMIZE.apply ZORDER BY on frequently filtered columns.Run VACUUM.Collect statistics.Increase cluster size when queries r...

Community Articles

Reply

370 Views
0 replies
0 kudos

Saturday

by Brahmareddy • Esteemed Contributor II

Thursday

193 Views
0 replies
1 kudos

Small POCs Can Become Big Data + AI Solutions

Dear Databricks Community,One thing I have learned from my data engineering experience is that big solutions do not always start big.Many times, they start with one simple question.Can we make this easier?Can we make this faster?Can we help someone m...

Community Articles

Reply

193 Views
0 replies
1 kudos

Thursday

by TriambakR • New Contributor

Thursday

342 Views
3 replies
9 kudos

Medallion Architecture in Practice: The Design Decisions Nobody Puts in the Diagram

Medallion Architecture in Practice: The Design Decisions Nobody Puts in the DiagramEvery Lakehouse conversation eventually shows the same three boxes: Bronze, Silver, Gold. It's a great mental model — but on a real enterprise migration, the diagram i...

Community Articles

Reply

342 Views
3 replies
9 kudos

Thursday

View Replies

Latest Reply

Sbm_dracarys
New Contributor

Thursday

9 kudos

Great work, man! Even though I don't know much about this field, your article made me curious and motivated me to read more about it. Thanks for sharing such valuable insights.keep posting

9 kudos

Thursday

2 More Replies

by TriambakR • New Contributor

Thursday

254 Views
1 replies
3 kudos

Not Every Pipeline Needs DLT — Here's How We Decided

Why We Used Delta Live Tables for Exactly One Pipeline (and Not for Silver/Gold)"Why isn't DLT used everywhere?" is one of the first questions I get when reviewing this architecture with other engineers — so I figured it was worth writing up properly...

Community Articles

Reply

254 Views
1 replies
3 kudos

Thursday

View Replies

Latest Reply

sasidharan_gs
New Contributor

Thursday

3 kudos

I must admit, this was a nagging question I never asked as a junior data engineer and have just taken for granted every since. Thanks for addressing an oversight I didnt even know I had!

3 kudos

Thursday

by TriambakR • New Contributor

Thursday

84 Views
0 replies
0 kudos

The Boring Truth Behind a 60% Speedup

How We Cut Refresh Time and Compute Cost by ~60% — the Mechanics, Not Just the Headline"We reduced compute costs by 60%" is a great line in a slide deck and a mostly useless one in a technical forum, because it doesn't tell you why. Here's the actual...

Community Articles

Reply

84 Views
0 replies
0 kudos

Thursday

by WiliamRosa • Databricks Partner

08-23-2025 4:09:10 AM

13676 Views
6 replies
8 kudos

Resolved! Databricks Machine Learning Professional Preparation

Recently I earned the Databricks Machine Learning Professional certification and wanted to share my study journey. Before the exam, I worked on a project as a data engineer alongside data scientists (ML models, LLMs, MLflow). That led me to build a p...

Community Articles

Reply

13676 Views
6 replies
8 kudos

08-23-2025 4:09:10 AM

View Replies

Latest Reply

tanek1
New Contributor

Thursday

8 kudos

Congratulations on earning the Databricks Machine Learning Professional certification! Your RAG project breakdown is genuinely helpful because it connects the concepts with real hands-on practice. Along with the official guide and labs, I also found ...

8 kudos

Thursday

5 More Replies

by KrisJohannesen • Valued Contributor

05-26-2026 1:44:16 AM

2583 Views
4 replies
2 kudos

Introduction to Metric Views (part 1 of 3)

This is part 1 of 3 in a series where I take you through working with Metric Views.Part 1: Introduction to Metric ViewsPart 2: Metric Views and the Databricks platform (AI/BI Dashboards, Genie, etc.)Part 3: Metric Views with Power BI and Tabular Edit...

Metric Views - Databricks Definition.png

Community Articles

Reply

2583 Views
4 replies
2 kudos

05-26-2026 1:44:16 AM

View Replies

Latest Reply

cartergray70543
New Contributor II

Thursday

2 kudos

Great introduction! A centralized semantic layer with Metric Views should make metrics more consistent and easier to govern across Databricks. Looking forward to seeing the real-world integrations in Part 2.

2 kudos

Thursday

3 More Replies

by AngelShrestha • Databricks Partner

a month ago

5112 Views
5 replies
4 kudos

Getting Certified as a Databricks Generative AI Engineer Associate: Key Takeaways and Insights

I just earned my Databricks Certified Generative AI Engineer Associate Certification, and in this post, I’m sharing the key tips, resources, and including what confused me, what actually worked, and the traps I nearly fell into. Why I Took This ExamI...

Community Articles

Reply

5112 Views
5 replies
4 kudos

a month ago

View Replies

Latest Reply

Mahesh18
New Contributor II

a week ago

4 kudos

Thanks for sharing,do you know what is the passing score to clear the exam?

4 kudos

a week ago

4 More Replies

by Rishabh_Tiwari • Community Manager

a week ago

656 Views
0 replies
3 kudos

How to Optimize Your Content for GEO: Best Practices for Writing Discoverable Community Content

Help Your Knowledge Reach More People Every Technical Blog, Community Article, MVP article, or answer you share can help someone beyond the Databricks Community. Small improvements in how you write can make your content easier to discover through C...

Community Articles

Reply

656 Views
0 replies
3 kudos

a week ago

Databricks Community

Forum Posts

Learning Series | Advanced Machine Learning Operations

Handling Sensor Dropout in IoT Pipelines: A Quarantine Pattern with Lakeflow Declarative Pipelines

Solution Accelerator Series | Social Determinants of Health

How Databricks Unity Catalog Business Semantics creates a governed layer for metrics

Databricks Partner Tiers Explained for the People Who Build the Delivery

Validating pointer-based Delta comparison architecture using flatMapGroupsWithState in Structured St

Databricks Performance Optimization: What Changed, What Still Matters, and What Should Be Automated

Small POCs Can Become Big Data + AI Solutions

Medallion Architecture in Practice: The Design Decisions Nobody Puts in the Diagram

Not Every Pipeline Needs DLT — Here's How We Decided

The Boring Truth Behind a 60% Speedup

Resolved! Databricks Machine Learning Professional Preparation

Introduction to Metric Views (part 1 of 3)

Getting Certified as a Databricks Generative AI Engineer Associate: Key Takeaways and Insights

How to Optimize Your Content for GEO: Best Practices for Writing Discoverable Community Content

How would you design a Spark pipeline to process b...

Refresh PBI Dataset is consuming unnecessary compu...

CI/CD on Databricks with Asset Bundles (DABs) and ...

Custom asset bundles file name

Designing a Cost-Efficient Databricks Lakehouse, P...