Community Articles

by AbhaySingh • Databricks Employee

7 hours ago

10 Views
0 replies
0 kudos

Distributed ML on Databricks Serverless

You can now run distributed ML (Spark MLlib in Python, Optuna tuning, MLflow Spark, Joblib Spark) on serverless notebooks/jobs and on standard clusters, not just dedicated ML clusters.It reuses the same Unity Catalog + Lakeguard stack you already use...

Community Articles

Reply

10 Views
0 replies
0 kudos

7 hours ago

by AbhaySingh • Databricks Employee

yesterday

45 Views
0 replies
0 kudos

Why Your Delta Lake MERGE Takes Forever (And How to Fix It)

Last month, our nightly CDC pipeline started timing out. What used to complete in 20 minutes was now crawling past the 4-hour mark—and failing. The culprit? A MERGE statement against a 2.3TB Delta table with 800 million rows that had grown steadily o...

Community Articles

Reply

45 Views
0 replies
0 kudos

yesterday

by Sourabh_13 • New Contributor II

yesterday

87 Views
0 replies
0 kudos

A Governance-First Unified Namespace:Why Manufacturers Need Unity Catalog to Scale Industry 4.0 Data

The Industrial Data ChallengeManufacturing enterprises today operate with a fundamental paradox: they're drowning in data yet starving for insights. A typical plant generates terabytes of information daily across dozens of systems—from shop floor PLC...

Community Articles

Reply

87 Views
0 replies
0 kudos

yesterday

by Abhilash_P • New Contributor II

2 weeks ago

440 Views
3 replies
4 kudos

Metric Views in Databricks: A Unified Approach to Business Metrics

Databricks has introduced a powerful feature—Metric Views—that transforms how organizations define, manage, and consume business metrics. Whether you're a data analyst, engineer, or business stakeholder, Metric Views offer a unified, governed, and re...

Community Articles

Reply

440 Views
3 replies
4 kudos

2 weeks ago

View Replies

Latest Reply

BijuThottathil
New Contributor III

Thursday

4 kudos

Any work around to publish MV to power BI workspace?

4 kudos

Thursday

2 More Replies

by BijuThottathil • New Contributor III

Wednesday

96 Views
3 replies
0 kudos

Real-Time SQL Server to Databricks Pipeline Using Debezium, Kafka, and Delta Lake

Check this medium article https://medium.com/@bijumathewt/real-time-sql-server-to-databricks-pipeline-using-debezium-kafka-and-delta-lake-26c3e191ce51?postPublishedType=initial

Community Articles

CDC

Databricks

kafka

Reply

96 Views
3 replies
0 kudos

Wednesday

View Replies

Latest Reply

Raman_Unifeye
Contributor III

Thursday

0 kudos

would you not be able to use Lakeflow Managed connector for SQL Server in this case?

0 kudos

Thursday

2 More Replies

by BijuThottathil • New Contributor III

Wednesday

55 Views
0 replies
2 kudos

🚀 Automating Databricks Unity Catalog Access with AI Agents

Tired of complex governance workflows? We’ve successfully combined the power of multi-agent AI with state-of-the-art orchestration tools to automate Databricks Unity Catalog (UC) access management!This isn’t just a basic script — it’s an intelligent,...

Community Articles

Reply

55 Views
0 replies
2 kudos

Wednesday

by BijuThottathil • New Contributor III

Wednesday

35 Views
0 replies
0 kudos

Real-World LangGraph + CrewAI Application:Intelligent Databricks Alert System with AI Prioritization

https://medium.com/@bijumathewt/real-world-langgraph-crewai-application-intelligent-databricks-alert-system-with-ai-7ba36e23f7a2You will love above article

Community Articles

Reply

35 Views
0 replies
0 kudos

Wednesday

by BijuThottathil • New Contributor III

Wednesday

42 Views
0 replies
0 kudos

Real-World LangGraph + CrewAI Application: Intelligent Databricks Alert System with AI Prioritizatio

You will love this article. Watch it. https://medium.com/@bijumathewt/real-world-langgraph-crewai-application-intelligent-databricks-alert-system-with-ai-7ba36e23f7a2

Community Articles

Reply

42 Views
0 replies
0 kudos

Wednesday

by AbhaySingh • Databricks Employee

Wednesday

73 Views
0 replies
1 kudos

Delta Lake 4.0 in the Real World

Delta Lake 4.0 is the next major open-source release aligned with Spark 4.x, adding first-class Variant for semi-structured data, safer Type Widening, improved DROP FEATURE, better transaction log handling, and a new multi-engine story via Delta Kern...

Community Articles

Reply

73 Views
0 replies
1 kudos

Wednesday

by venkat-raghavan • New Contributor III

Tuesday

112 Views
1 replies
3 kudos

PDFs to Production

Databricks just solved a huge problem - unlocking the value from unstructured data. One of the biggest challenges enterprises face when scaling agents is access to unstructured data. Nearly 80% of enterprise knowledge is trapped in PDFs, reports, and...

Community Articles

Reply

112 Views
1 replies
3 kudos

Tuesday

View Replies

Latest Reply

Advika
Databricks Employee

Wednesday

3 kudos

Thanks for sharing, @venkat-raghavan! ai_parse_document turns documents into governed, queryable assets, it's definitely the puzzle piece all needed.

3 kudos

Wednesday

by AbhaySingh • Databricks Employee

Tuesday

112 Views
1 replies
4 kudos

Python Data Source API

If you’ve ever hacked together a one-off script to pull data from some random API into Spark, you’re exactly who the new Python Data Source API is for. Databricks has made this API generally available on Apache Spark™ 4.0 with Databricks Runtime 1...

Community Articles

Reply

112 Views
1 replies
4 kudos

Tuesday

View Replies

Latest Reply

Raman_Unifeye
Contributor III

Tuesday

4 kudos

I have seen multiple glue jobs pulling data from such systems. This is certainly a solution to simplify and bring governance to it. will look forward to implement it.#Apache-4

4 kudos

Tuesday

by datadude07 • New Contributor II

a week ago

176 Views
0 replies
1 kudos

Databricks Serverless Compute: Zero Infrastructure Management

You spend hours sizing clusters, tuning autoscaling configurations, and optimizing instance types.Clusters sit idle burning money between jobs or take minutes to start when you need them immediately.Managing compute infrastructure becomes a full-time...

Community Articles

Reply

176 Views
0 replies
1 kudos

a week ago

by bianca_unifeye • New Contributor III

a week ago

149 Views
1 replies
3 kudos

Databricks + Power BI

Last night I delivered my final session on Databricks + Power BI integration and it felt like the right moment to close this chapter on a high note. Because it was the last time I was presenting this topic, I rebuilt the flow completely. Instead of ...

Community Articles

Reply

149 Views
1 replies
3 kudos

a week ago

View Replies

Latest Reply

Raman_Unifeye
Contributor III

a week ago

3 kudos

Well done Bianca. Look forward to Fabric session.

3 kudos

a week ago

by mitchellg-db • Databricks Employee

a week ago

266 Views
1 replies
8 kudos

Recap: Data & AI World Tour - Chicago, Nov 19, 2025

Yesterday in Chicago, I attended one of the final stops of the Databricks DAIWT (Data & AI World Tour). This tour brings the excitement and passion of our annual Data + AI Summit to over a dozen cities worldwide, in a condensed format. Packed into ...

Community Articles

Reply

266 Views
1 replies
8 kudos

a week ago

View Replies

Latest Reply

venkat-raghavan
New Contributor III

a week ago

8 kudos

These are great events. I had the opportunity to attend the Toronto event.

8 kudos

a week ago

by Yogesh_Verma_ • Contributor

2 weeks ago

277 Views
1 replies
2 kudos

Databricks Architecture Center

Databricks Architecture Center — Your Blueprint for Building Modern Data & AI PlatformsThe Databricks Architecture Center is a centralized knowledge hub that provides:End-to-end reference architecturesIndustry-specific patternsArchitecture decision ...

Community Articles

Reply

277 Views
1 replies
2 kudos

2 weeks ago

View Replies

Latest Reply

Raman_Unifeye
Contributor III

2 weeks ago

2 kudos

Its very useful. I suppose you missed to provide the link. Here is the link for easy accesshttps://www.databricks.com/resources/architectures

2 kudos

2 weeks ago

Databricks Community

Forum Posts

Distributed ML on Databricks Serverless

Why Your Delta Lake MERGE Takes Forever (And How to Fix It)

A Governance-First Unified Namespace:Why Manufacturers Need Unity Catalog to Scale Industry 4.0 Data

Metric Views in Databricks: A Unified Approach to Business Metrics

Real-Time SQL Server to Databricks Pipeline Using Debezium, Kafka, and Delta Lake

🚀 Automating Databricks Unity Catalog Access with AI Agents

Real-World LangGraph + CrewAI Application:Intelligent Databricks Alert System with AI Prioritization

Real-World LangGraph + CrewAI Application: Intelligent Databricks Alert System with AI Prioritizatio

Delta Lake 4.0 in the Real World

PDFs to Production

Python Data Source API

Databricks Serverless Compute: Zero Infrastructure Management

Databricks + Power BI

Recap: Data & AI World Tour - Chicago, Nov 19, 2025

Databricks Architecture Center

Join Us as a Local Community Builder!

Building an End-to-End ETL Pipeline with Data from...

My First Month Learning Databricks - Key Takeaways...

Unity Catalog Migration Strategy

🚀 Boost Databricks Performance ✅ Lazy Evaluation ...

🚀 DataFrame Caching on Delta Tables - What if und...