cancel
Showing results for 
Search instead for 
Did you mean: 
Community Articles
Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practices. Join the conversation today and unlock a wealth of collective wisdom to enhance your experience and drive success.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

rohan22sri
by New Contributor III
  • 793 Views
  • 0 replies
  • 1 kudos

From Data to Deck — Auto-Generate PowerPoint from Databricks

Creating PowerPoint decks from data is usually manual and repetitive:Export charts → take screenshots → format slides → repeat.Not anymore.You can now generate a complete PowerPoint directly from a Databricks table — in one notebook run. What this do...

ChatGPT Image Apr 9, 2026, 03_26_44 PM.png
  • 793 Views
  • 0 replies
  • 1 kudos
Brahmareddy
by Esteemed Contributor II
  • 1606 Views
  • 2 replies
  • 8 kudos

Congratulations to Matei Zaharia - CTO Databricks on the ACM Prize in Computing

When I saw the news that Matei Zaharia received the 2025 ACM Prize in Computing, I felt genuinely happy. It was not just another award announcement. It felt like a proud moment for the whole data engineering community. His work has helped shape the w...

Image 4-8-26 at 9.27 PM.jpeg
  • 1606 Views
  • 2 replies
  • 8 kudos
Latest Reply
Advika
Community Manager
  • 8 kudos

@Brahmareddy, what a beautiful tribute! It’s so inspiring to hear how that meeting at the Summit stayed with you.We’re so lucky to have contributors like you who recognize the heart behind the tech. Cheers to Matei and the whole Databricks family!

  • 8 kudos
1 More Replies
Brahmareddy
by Esteemed Contributor II
  • 626 Views
  • 1 replies
  • 2 kudos

Austin’s Practical Data + AI Meetup

Austin data community, this one looks worth attending. Databricks DevConnect Austin is happening on Tuesday, April 14, 2026, from 5:00 PM to 9:00 PM at Qubika Office in Austin. It is a technical meetup built for data and AI practitioners who want rea...

  • 626 Views
  • 1 replies
  • 2 kudos
Latest Reply
Sumit_7
Esteemed Contributor
  • 2 kudos

Great opportunity to connect with amazing and like minded minds.Would love to see more such events in India as well, specially in Lucknow, Noida, Bhopal -- Events Team

  • 2 kudos
antoalphi
by New Contributor III
  • 308 Views
  • 0 replies
  • 1 kudos

Solution Proposal (Cost-Optimized Architecture)

Core IdeaDon’t let “1 pipeline = 1 always-on cluster” become your cost trap.Instead, design for controlled parallelism + shared compute + smart grouping. A. Pipeline Sharding Strategy (Not Blind Splitting)Instead of randomly splitting 6,700 tables i...

  • 308 Views
  • 0 replies
  • 1 kudos
VinayKumarB
by Databricks Partner
  • 1894 Views
  • 1 replies
  • 6 kudos

MCP Servers on Databricks

MCP Servers on DatabricksGenerative AI is evolving rapidly, and one of the most exciting developments is standardizing how models interact with external systems. Let me walk you through how we got here and why the Model Context Protocol (MCP)—especia...

Screenshot 2026-04-08 125928.png Screenshot 2026-04-08 125951.png
  • 1894 Views
  • 1 replies
  • 6 kudos
Latest Reply
Sumit_7
Esteemed Contributor
  • 6 kudos

Insightful, detailed and incisive guide @VinayKumarB. Keep posting!

  • 6 kudos
szymon_dybczak
by Esteemed Contributor III
  • 505 Views
  • 2 replies
  • 3 kudos

Notifications for scheduled refreshes - now in Beta

If you’ve ever worked with scheduled refreshes for Materialized Views or Streaming Tables, you probably know this pain. If your DDL-scheduled MV or ST refresh failed, nothing happened. No email, no alert, no indication that your data was stale (until...

schedules_100.png
  • 505 Views
  • 2 replies
  • 3 kudos
Latest Reply
Advika
Community Manager
  • 3 kudos

Welcome back, @szymon_dybczak! That silent failure pain was real. These alerts are a massive win for everyone’s sanity. Thanks for the heads-up!

  • 3 kudos
1 More Replies
shubham_meshram
by Databricks Partner
  • 2971 Views
  • 1 replies
  • 1 kudos

When Did the Data Go Wrong? Using Delta Lake Time Travel for Investigation in Databricks

I. IntroductionData pipelines are the lifeblood of modern data-driven organizations. However, even the most robust pipelines can experience unexpected issues: data corruption, erroneous updates, or sudden data drops. When these problems occur, quickl...

shubham_meshram_0-1743459167949.png
  • 2971 Views
  • 1 replies
  • 1 kudos
Latest Reply
deepakachary9
New Contributor II
  • 1 kudos

Great thought to use delta time travel to determine when data drift starts!But this only works as long as retention policies allow it. With vacuum and stricter runtime enforcement in newer dbx versions, older snapshots may not be there when you need ...

  • 1 kudos
Louis_Frolio
by Databricks Employee
  • 1031 Views
  • 7 replies
  • 6 kudos

What's Your Biggest AI Pet Peeve?

Hey Team, in my last post I asked how much AI has actually changed your day to day, and the responses were fantastic. But let's talk about the other side for a minute. I'll go first — I've started second-guessing almost everything I see on social med...

Screenshot 2026-03-30 at 2.26.48 PM.png
  • 1031 Views
  • 7 replies
  • 6 kudos
Latest Reply
balajij8
Contributor III
  • 6 kudos

AI tools generate code & pipelines that work functionally but ignore efficiency, scalability & cloud implications. I have bumped into belowCode generated by AI does a SQL cross join as its simple for most natural language queries that works but kills...

  • 6 kudos
6 More Replies
Ashwin_DSA
by Databricks Employee
  • 3232 Views
  • 3 replies
  • 9 kudos

Databricks Multi-Table Transactions - Part 1

If you've ever worked on an insurance data warehouse, or really any warehouse where data arrives from different systems at different times, you know the pain of keeping things in sync. I spent years building data warehouses for a property and casual...

claim-wrapup-flow.png before-after-transactions.png Part1 Cover Pic.png
  • 3232 Views
  • 3 replies
  • 9 kudos
Latest Reply
Ashwin_DSA
Databricks Employee
  • 9 kudos

Link to Part 2   

  • 9 kudos
2 More Replies
Ashwin_DSA
by Databricks Employee
  • 686 Views
  • 0 replies
  • 2 kudos

Databricks Multi-Table Transactions - Part 2

In Part 1, we covered why multi-table transactions matter. Now let's build one. We'll create the tables from the claim wrap-up scenario, load sample P&C insurance data, and walk through what happens when the wrap-up succeeds, when it fails, and when...

s1-claim.png s1-wraplog.png s1-reserves.png s2-error.png
  • 686 Views
  • 0 replies
  • 2 kudos
Emil_Kaminski
by Contributor II
  • 17946 Views
  • 3 replies
  • 8 kudos

Materials to pass Databricks Data Engineering Associate Exam

Hi Guys, I have passed it already some time ago, but just recently have summarized all the materials which helped me to do it. Pay special attention to GitHub repository, which contains many great exercises prepared by Databricks teamhttps://youtu.be...

  • 17946 Views
  • 3 replies
  • 8 kudos
Latest Reply
Max_John
New Contributor III
  • 8 kudos

Cleared Databricks Data Engineering Associate recently. Practising real questions helped me a lot, and (Certs Topic) was a reliable resource.

  • 8 kudos
2 More Replies
Ashwin_DSA
by Databricks Employee
  • 1269 Views
  • 1 replies
  • 3 kudos

Solving Multi-Dimension Analytics in Databricks Dashboards with Views and Metric Views

If you've ever built a dashboard where you needed to track the same data across two different date dimensions, you know the frustration. You get the first chart working. You add the second. Then you realise cross-filtering just stopped working. I re...

sample-data.png naive_combined.png view-creation.png view-cumulative.png
  • 1269 Views
  • 1 replies
  • 3 kudos
Latest Reply
Nidhig
Databricks Partner
  • 3 kudos

Thanks for sharing great example with detailed explanation.

  • 3 kudos
Kirankumarbs
by Contributor III
  • 1627 Views
  • 0 replies
  • 1 kudos

How to actually get job_id and run_id in a Databricks Python wheel task (Avoid Hallucinations)

We needed job_id and run_id in a custom metrics Delta table so we could join to `system.lakeflow.job_run_timeline`. Tried four approaches before finding the one that works on serverless compute.What doesn't workspark.conf.get("spark.databricks.job.id...

  • 1627 Views
  • 0 replies
  • 1 kudos
Labels