cancel
Showing results for 
Search instead for 
Did you mean: 
Community Articles
Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practices. Join the conversation today and unlock a wealth of collective wisdom to enhance your experience and drive success.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Coffee77
by Contributor III
  • 376 Views
  • 0 replies
  • 2 kudos

🚀 Spark Caching vs Databricks Disk Caching

As promised @BS_THE_ANALYST , in this new video and summarized in post, I try to explain what Spark Caching and Databricks Disk Caching are and how Caching strategy can be leveraged by making these cool features work together: Spark Caching vs Databr...

Coffee77_1-1756394420204.png Coffee77_0-1756394294274.jpeg
  • 376 Views
  • 0 replies
  • 2 kudos
sandy311
by New Contributor III
  • 326 Views
  • 0 replies
  • 1 kudos

Parallel Model Training & Data Pipelines on Databricks (ForEach Tasks+ Asset Bundles + Pydantic)

As companies double down on machine learning (ML), one thing is obvious: a single model can’t solve every problem. Different datasets, different timelines, and different requirements make managing multiple models pretty tricky. And if you’ve ever wor...

sandy311_0-1756395092044.png sandy311_1-1756395091510.png
Community Articles
ForEach
jobs
MLOPS
pydantic
python
  • 326 Views
  • 0 replies
  • 1 kudos
WiliamRosa
by Contributor III
  • 2185 Views
  • 3 replies
  • 7 kudos

Resolved! Data Quality with PySpark and Great Expectations on Databricks

Data governance is one of the most important pillars in any modern architecture. When building pipelines that process data at scale, ensuring data quality is not just a best practice—it is a critical necessity.Tools like Great Expectations (GX) were ...

  • 2185 Views
  • 3 replies
  • 7 kudos
Latest Reply
BR_DatabricksAI
Contributor III
  • 7 kudos

@WiliamRosaWiliamRosa:  Thanks for sharing the link. I will explore. 

  • 7 kudos
2 More Replies
nayan_wylde
by Esteemed Contributor
  • 639 Views
  • 0 replies
  • 1 kudos

Tracking Query History and Optimizing Queries in Databricks

Optimizing queries in Databricks isn’t just about adding indexes or tweaking SQL syntax — it’s about visibility. You can’t improve what you can’t measure. Fortunately, Databricks provides rich telemetry around query history that you can use to analyz...

  • 639 Views
  • 0 replies
  • 1 kudos
Isi
by Honored Contributor III
  • 1010 Views
  • 1 replies
  • 4 kudos

[Blog] Databricks Serverless vs Classic: Who Wins the Cost Sprint?

Hi everyone! I wanted to share with you a post I wrote on Medium a while ago — it’s still very useful if you want to understand how to properly calculate Databricks cluster costs and get a realistic view of the differences: Databricks Serverless vs C...

  • 1010 Views
  • 1 replies
  • 4 kudos
Latest Reply
Coffee77
Contributor III
  • 4 kudos

Really interesting topic I'll take a look when possible Always interested in improving performance and saving cloud costs. Thanks for sharing.

  • 4 kudos
Coffee77
by Contributor III
  • 804 Views
  • 2 replies
  • 3 kudos

Resolved! 🚀 Boost Databricks Performance ✅ Lazy Evaluation and Spark DataFrame Caching ☕

Published new video in my recently created youtube channel about one of my favorite topics: performance.  So, here is a new video whose goal is to explain clearly what lazy evaluation is and how to use caching to boost performance.   https://studio.y...

CAFE77_4-1756040519167.png
  • 804 Views
  • 2 replies
  • 3 kudos
Latest Reply
Coffee77
Contributor III
  • 3 kudos

Not sure if previous link works fine, here is the correct one https://youtu.be/pLGAr1VXQSQ?si=QaKzaNfzrNl_0Tv9

  • 3 kudos
1 More Replies
Coffee77
by Contributor III
  • 773 Views
  • 4 replies
  • 3 kudos

Resolved! Introduction to Databricks for Beginners Video

Here is the first episode in ENGLISH VERSION of a list of simple videos on Introduction to Databricks for beginners:Introduction to Databricks for Beginners - Episode 1 It contains previous and basic concepts to master before moving forward with Data...

  • 773 Views
  • 4 replies
  • 3 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 3 kudos

Hi @Coffee77 ,Thanks for sharing with us. It looks promising. Can't wait for another episode

  • 3 kudos
3 More Replies
snehamore811
by New Contributor III
  • 1464 Views
  • 4 replies
  • 7 kudos

Resolved! Databricks for RAG: Build, Run, Evaluate

What is RAG?RAG (Retrieval-Augmented Generation) on Databricks refers to building and running AI applications that combine:Retrieval systems (like vector databases or search over documents)Generative AI models (such as LLMs like GPT)within the Databr...

  • 1464 Views
  • 4 replies
  • 7 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 7 kudos

Thanks for sharing @snehamore811 

  • 7 kudos
3 More Replies
WiliamRosa
by Contributor III
  • 6820 Views
  • 5 replies
  • 8 kudos

Resolved! Databricks Machine Learning Professional Preparation

Recently I earned the Databricks Machine Learning Professional certification and wanted to share my study journey. Before the exam, I worked on a project as a data engineer alongside data scientists (ML models, LLMs, MLflow). That led me to build a p...

WiliamRosa_0-1755947321744.png
  • 6820 Views
  • 5 replies
  • 8 kudos
Latest Reply
WiliamRosa
Contributor III
  • 8 kudos

Thanks a lot, my friend @BS_THE_ANALYST ! Really glad you found it useful . I’m sure when you dive into ML later this year, you’ll do awesome things with it. Appreciate the kind words about the project — means a lot! All the best to you too, and let’...

  • 8 kudos
4 More Replies
Coffee77
by Contributor III
  • 960 Views
  • 4 replies
  • 3 kudos

Resolved! Introduction to Databricks 🇪🇸

Here is the first episode of a serie of simple videos on Introduction to Databricks for beginners :https://youtu.be/kvglz79Ob-M?si=KnyCH74_HQ8jiO7SIt contains previous and basic concepts to master before moving forward with Databricks.

  • 960 Views
  • 4 replies
  • 3 kudos
Latest Reply
Coffee77
Contributor III
  • 3 kudos

English version is ready : INTRODUCTION to DATABRICKS in English - Episode 1 ‌ 

  • 3 kudos
3 More Replies
nayan_wylde
by Esteemed Contributor
  • 1132 Views
  • 1 replies
  • 3 kudos

Databricks Free Edition: The Announcement from Data + AI Summit 2025

The Data + AI Summit 2025 delivered several groundbreaking announcements, but none were more democratizing than the launch of the new Databricks Free Edition. Announced alongside a massive $100 million investment in training, this new offering provid...

  • 1132 Views
  • 1 replies
  • 3 kudos
Latest Reply
Advika
Databricks Employee
  • 3 kudos

Nicely outlined, @nayan_wylde! 

  • 3 kudos
NandiniN
by Databricks Employee
  • 29905 Views
  • 3 replies
  • 2 kudos

Databricks Community Edition Login - Sign Up/Sign In/Forgot Password

Sign Up Go to https://www.databricks.com/try-databricksFill in the 2 steps box on the right hand side  Note - It is important to select the Personal use section in the above step. Sign In Enter your details here https://accounts.cloud.databricks.com/...

Screenshot 2024-06-02 at 02.32.46.png Screenshot 2024-06-02 at 02.28.30.png Screenshot 2024-06-02 at 02.29.29.png Screenshot 2024-06-02 at 02.30.23.png
  • 29905 Views
  • 3 replies
  • 2 kudos
Latest Reply
siva-Manohar
New Contributor II
  • 2 kudos

Hi,I cannot signup to Community edition. When I try to sign up using this link https://www.databricks.com/try-databricks it first shows this pop up, non of these two options allows me to signup for community edition.  I don't find option 'get started...

  • 2 kudos
2 More Replies
WiliamRosa
by Contributor III
  • 856 Views
  • 5 replies
  • 7 kudos

Resolved! Generating a PostgreSQL Table Schema for ETL in Databricks

In a data migration project, I needed to generate the schema of a PostgreSQL table to use in my ETL process. I’d like to share the code snippet in case someone else needs it one day:from pyspark.sql import SparkSession import json import os from typi...

WiliamRosa_0-1755612791624.png
  • 856 Views
  • 5 replies
  • 7 kudos
Latest Reply
WiliamRosa
Contributor III
  • 7 kudos

tks so much

  • 7 kudos
4 More Replies
WiliamRosa
by Contributor III
  • 634 Views
  • 1 replies
  • 0 kudos

Resolved! Automating Notebook Documentation in Databricks with LLMs

In one of my projects, I needed to generate structured documentation for an entire directory of Databricks notebooks.This solution uses the Databricks Workspace API together with a Serving Endpoint (LLM) to automatically create HTML documentation for...

  • 634 Views
  • 1 replies
  • 0 kudos
Latest Reply
WiliamRosa
Contributor III
  • 0 kudos

Suggestions are always welcome — I hope this helps anyone looking to automate notebook documentation in Databricks.

  • 0 kudos
BS_THE_ANALYST
by Esteemed Contributor III
  • 404 Views
  • 2 replies
  • 7 kudos

Data Security at the level of columns or rows or Data masking

Hi everyone, I'm currently going through the Data Analyst learning path. I've just learned about Dynamic Views and I wanted to share the article on them: https://docs.databricks.com/aws/en/views/dynamic#before-you-begin There are some limitations on ...

BS_THE_ANALYST_0-1755383776603.png
  • 404 Views
  • 2 replies
  • 7 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 7 kudos

@BS_THE_ANALYST  Cool stuff, right!  Have read about Attribute-based Access Control (ABAC) yet? Check it out: https://docs.databricks.com/aws/en/data-governance/unity-catalog/abac/ Let me know what you think. Cheers, Louis.

  • 7 kudos
1 More Replies

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels