cancel
Showing results for 
Search instead for 
Did you mean: 
Resources
Explore a comprehensive repository of resources on the Databricks Community. Access tutorials, guides, webinars, and more to enhance your skills in data analytics and machine learning.
cancel
Showing results for 
Search instead for 
Did you mean: 

Browse the Community

Get Started Resources

Explore essential resources to kickstart your journey with Databricks. Access tutorials, guides, and...

0 Posts

Events

Stay updated on Databricks events, including webinars, conferences, and workshops. Discover opportun...

108 Posts

Support FAQs

Find answers to common questions and troubleshoot issues with Databricks support FAQs. Access helpfu...

19 Posts

Technical Blog

Explore in-depth articles, tutorials, and insights on data analytics and machine learning in the Dat...

177 Posts

Knowledge Sharing Hub

Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practi...

123 Posts

Announcements

Stay up-to-date with the latest announcements from Databricks. Learn about product updates, new feat...

94 Posts

DatabricksTV

Community-produced videos to help you leverage Databricks in your Data & AI journey. Tune in to expl...

124 Posts

Activity in Resources

Sujitha
by Databricks Employee
  • 30 Views
  • 0 replies
  • 0 kudos

Jumpstart Your Data Journey with Databricks Get Started Days!

Are you ready to dive into the world of data engineering and analytics? Join us for Databricks Get Started Days, a half-day virtual event designed to accelerate your learning and equip you with essential Databricks skills! Here’s what makes Get Start...

Screenshot 2025-02-18 at 9.25.12 PM.png
  • 30 Views
  • 0 replies
  • 0 kudos
rakhidarshi
by Databricks Employee
  • 3453 Views
  • 4 replies
  • 12 kudos

Performance Tuning using Query Profile

Introduction In the fast-paced world of big data, optimizing performance is critical for maintaining efficiency and reducing costs. Databricks SQL (DBSQL) Warehouse is a robust feature of the Databricks platform that enables data analysts, data engin...

2.png 3.png 4.png 5.png
  • 3453 Views
  • 4 replies
  • 12 kudos
Latest Reply
saurabhmohale25
  • 12 kudos

Very detailed and informative. Thanks @rakhidarshi for sharing!

  • 12 kudos
3 More Replies
yadvendra_ksh
by > New Contributor
  • 33 Views
  • 0 replies
  • 1 kudos

The Hidden Pitfalls of Snowflake to Databricks Migrations

Everyone's rushing their Snowflake to Databricks migration, and they're setting themselves up for failure.After leading multiple enterprise migrations to Databricks last quarter, here's what shocked me: The technical lift isn't the hard part. It's th...

  • 33 Views
  • 0 replies
  • 1 kudos
Emil_Kaminski
by > Contributor
  • 8184 Views
  • 3 replies
  • 3 kudos

Materials to pass Databricks Data Engineering Associate Exam

Hi Guys, I have passed it already some time ago, but just recently have summarized all the materials which helped me to do it. Pay special attention to GitHub repository, which contains many great exercises prepared by Databricks teamhttps://youtu.be...

  • 8184 Views
  • 3 replies
  • 3 kudos
Latest Reply
annawilliam
New Contributor
  • 3 kudos

I almost gave up after failing Databricks-Certified-Data-Engineer-Associate once, but then I found Passexamhub. Their realistic practice tests helped me understand my weak areas and improve. Passed on my second attempt—such a relief!

  • 3 kudos
2 More Replies
Sujitha
by Databricks Employee
  • 1210 Views
  • 0 replies
  • 6 kudos

Introducing SAP Databricks

Today we are announcing a deep partnership with SAP which we think can be game changing for our industry. In short, it is the marriage between the most important business data for enterprises globally (SAP data) and the best data platform in the mark...

Screenshot 2025-02-13 at 9.07.48 PM.png
  • 1210 Views
  • 0 replies
  • 6 kudos
Mahavir_Teraiya
by Databricks Employee
  • 2814 Views
  • 5 replies
  • 28 kudos

Best practices for safe data experimentation with Databricks

In today’s data-centric world, experimentation is essential for developers and data scientists to create cutting-edge models, test hypotheses, and build robust data pipelines. However, giving these teams access to production data raises serious conce...

Screenshot 2024-10-30 at 10.05.06 AM.png
  • 2814 Views
  • 5 replies
  • 28 kudos
Latest Reply
Flaviodiasps
New Contributor II
  • 28 kudos

Great post!I went after more reading while reading each topics and I would like to add a few things here1. AnonymizationI wouldn't use uuid() like this.Using a hashing function would be better to ensure consistency across multiple runs.  F.sha2(F.con...

  • 28 kudos
4 More Replies
Rjt_de
by Databricks Employee
  • 55906 Views
  • 18 replies
  • 32 kudos

Metadata-Driven ETL Framework in Databricks (Part-1)

In modern data-driven enterprises, data flows like lifeblood through complex systems and repositories to drive decision-making and innovation. Each dataset, whether structured or unstructured, holds the potential to unlock insights and drive innovati...

ETL_Framework - E2EDesign (1).png ETL_Framework_ER.jpeg bronze.jpg silver.jpg
  • 55906 Views
  • 18 replies
  • 32 kudos
Latest Reply
Flaviodiasps
New Contributor II
  • 32 kudos

It is a great article, I am excited for the next parts.I am not sure about having these metadata tables in the lakehouse. It forces us to build a data pipeline for the metadata table. Isn't it better to just use a transactional database like mongo or...

  • 32 kudos
17 More Replies
Sai_Ponugoti
by Databricks Employee
  • 3010 Views
  • 2 replies
  • 5 kudos

Private Cross-Cloud Delta Sharing with Serverless Compute

As organizations increasingly adopt multi-cloud strategies to leverage the unique strengths of various cloud platforms, they face the dual challenge of maintaining robust security while enabling efficient data sharing. Balancing accessibility with p...

delta-sharing-primary-lockup-full-color-black-rgb-4000x660-2f69222.png Screenshot 2025-01-15 at 15.06.03.png image (10).png image (11).png
  • 3010 Views
  • 2 replies
  • 5 kudos
Latest Reply
Sai_Ponugoti
Databricks Employee
  • 5 kudos

Thank you for your feedback @Mantsama4!! Our solution is designed to tackle both B2B and Line of Business sharing by combining robust security with flexible, region-specific deployment. For B2B scenarios, we ensure that external partners access data ...

  • 5 kudos
1 More Replies
dkushari
by Databricks Employee
  • 881 Views
  • 0 replies
  • 2 kudos

How to Read Unity Catalog Tables from Trino via Open APIs

Databricks Unity Catalog (UC) is the industry’s only unified and open governance solution for data and AI, built into the Databricks Data Intelligence Platform. Unity Catalog provides a single source of truth for your organization’s data and AI asset...

dkushari_0-1739204177100.png dkushari_0-1739204906298.png dkushari_2-1739204300221.png dkushari_5-1739204516135.png
  • 881 Views
  • 0 replies
  • 2 kudos
tj-cycyota
by Databricks Employee
  • 343 Views
  • 0 replies
  • 1 kudos

Ray Monitoring Made Easy: Prometheus & Grafana with Ray on Databricks Clusters

Intro Ray is rapidly becoming the standard for logic-parallel computing, enabling many Databricks customers to accelerate a wide range of Python workloads. Since its general availability on Databricks in early 2024, Ray on Databricks has opened up ne...

tjcycyota_0-1738604153886.png tjcycyota_1-1738604203734.png tjcycyota_2-1738604269261.png tjcycyota_3-1738604284216.png
  • 343 Views
  • 0 replies
  • 1 kudos
MuraliTalluri
by Databricks Employee
  • 1956 Views
  • 1 replies
  • 4 kudos

Deep Dive - Streaming Deduplication

In this article we will cover in depth about streaming deduplication using watermarking with dropDuplicates and dropDuplicatesWithinWatermark, how they are different. This blog expects you to have a good understanding on how watermarking works in Spa...

MuraliTalluri_0-1736447301690.png MuraliTalluri_1-1736447301693.png MuraliTalluri_2-1736447301693.png MuraliTalluri_3-1736447301693.png
  • 1956 Views
  • 1 replies
  • 4 kudos
Latest Reply
Muhammad_Umer
New Contributor III
  • 4 kudos

Hi @MuraliTalluri,Thank you for such a detailed article.I am following dropDuplicatesWithinWatermark with the same steps as yours. The only difference is that I am using autoloader and reading CSV files as the source and writing data to a Delta table...

  • 4 kudos
MohanaBasak
by Databricks Employee
  • 1068 Views
  • 3 replies
  • 4 kudos

Unlocking Cost and Performance Insights in Databricks with Custom Dashboards

As organizations continue to scale their data infrastructure, efficient resource utilization, cost control, and operational transparency are paramount for success. With the growing adoption of Databricks, monitoring and optimizing compute usage and d...

Screenshot 2024-12-15 at 5.20.05 PM.png Screenshot 2024-12-15 at 4.56.02 PM.png Screenshot 2024-12-15 at 5.11.08 PM.png Screenshot 2024-12-15 at 5.06.35 PM.png
  • 1068 Views
  • 3 replies
  • 4 kudos
Latest Reply
Mantsama4
Contributor III
  • 4 kudos

Thank you Mohana for sharing the detail, really appreciate it.

  • 4 kudos
2 More Replies
Ajay-Pandey
by > Esteemed Contributor III
  • 821 Views
  • 1 replies
  • 1 kudos

📊 Simplifying CDC with Databricks Delta Live Tables & Snapshots 📊

In the world of data integration, synchronizing external relational databases (like Oracle, MySQL) with the Databricks platform can be complex, especially when Change Data Feed (CDF) streams aren’t available. Using snapshots is a powerful way to mana...

Pull-Based Snapshots.png
  • 821 Views
  • 1 replies
  • 1 kudos
Latest Reply
BilalHaniff1
New Contributor II
  • 1 kudos

Hi AjayCan apply changes into snapshot handle re-processing of an older snapshot? UseCase:- Source has delivered data on day T, T1 and T2.  - Consumers realise there is an error on the day T data, and make a correction in the source. The source redel...

  • 1 kudos
kamalendubiswas
by Databricks Employee
  • 94842 Views
  • 6 replies
  • 30 kudos

Top 10 query performance tuning tips for Databricks Serverless SQL

Databricks Serverless SQL (DBSQL) is the latest offering from Databricks to build data warehouses on the Lakehouse. It incorporates all the Lakehouse features like open format, unified analytics, and collaborative platforms across the different data ...

Image 04-09-2023 at 17.27.jpeg _2370d7a7-e70c-4692-a487-a84f2d2be932.jpeg
  • 94842 Views
  • 6 replies
  • 30 kudos
Latest Reply
Mantsama4
Contributor III
  • 30 kudos

This is a great solution! The post provides an in-depth, structured approach to optimizing Databricks SQL Serverless, highlighting key tips such as resource optimization, query performance improvements, and the best practices for data types and cachi...

  • 30 kudos
5 More Replies
ChsAIkrishna
by > New Contributor III
  • 541 Views
  • 1 replies
  • 2 kudos

Consideration Before Migrating Hive Tables to Unity Catalog

Databricks recommends four methods to migrate Hive tables to Unity Catalog, each with its pros and cons. The choice of method depends on specific requirements.SYNC: A SQL command that migrates schema or tables to Unity Catalog external tables. Howeve...

highresrollsafe piz.PNG
  • 541 Views
  • 1 replies
  • 2 kudos
Latest Reply
Mantsama4
Contributor III
  • 2 kudos

This is a great solution! The post effectively outlines the methods for migrating Hive tables to Unity Catalog while emphasizing the importance of not just performing a simple migration but transforming the data architecture into something more robus...

  • 2 kudos
Top Kudoed Authors