- 13292 Views
- 3 replies
- 7 kudos
Resolved! Data Quality with PySpark and Great Expectations on Databricks
Data governance is one of the most important pillars in any modern architecture. When building pipelines that process data at scale, ensuring data quality is not just a best practice—it is a critical necessity.Tools like Great Expectations (GX) were ...
- 13292 Views
- 3 replies
- 7 kudos
- 7 kudos
@WiliamRosaWiliamRosa: Thanks for sharing the link. I will explore.
- 7 kudos
- 3486 Views
- 0 replies
- 1 kudos
Tracking Query History and Optimizing Queries in Databricks
Optimizing queries in Databricks isn’t just about adding indexes or tweaking SQL syntax — it’s about visibility. You can’t improve what you can’t measure. Fortunately, Databricks provides rich telemetry around query history that you can use to analyz...
- 3486 Views
- 0 replies
- 1 kudos
- 3514 Views
- 1 replies
- 4 kudos
[Blog] Databricks Serverless vs Classic: Who Wins the Cost Sprint?
Hi everyone! I wanted to share with you a post I wrote on Medium a while ago — it’s still very useful if you want to understand how to properly calculate Databricks cluster costs and get a realistic view of the differences: Databricks Serverless vs C...
- 3514 Views
- 1 replies
- 4 kudos
- 4 kudos
Really interesting topic I'll take a look when possible Always interested in improving performance and saving cloud costs. Thanks for sharing.
- 4 kudos
- 2229 Views
- 2 replies
- 3 kudos
Resolved! 🚀 Boost Databricks Performance ✅ Lazy Evaluation and Spark DataFrame Caching ☕
Published new video in my recently created youtube channel about one of my favorite topics: performance. So, here is a new video whose goal is to explain clearly what lazy evaluation is and how to use caching to boost performance. https://studio.y...
- 2229 Views
- 2 replies
- 3 kudos
- 3 kudos
Not sure if previous link works fine, here is the correct one https://youtu.be/pLGAr1VXQSQ?si=QaKzaNfzrNl_0Tv9
- 3 kudos
- 2182 Views
- 4 replies
- 3 kudos
Resolved! Introduction to Databricks for Beginners Video
Here is the first episode in ENGLISH VERSION of a list of simple videos on Introduction to Databricks for beginners:Introduction to Databricks for Beginners - Episode 1 It contains previous and basic concepts to master before moving forward with Data...
- 2182 Views
- 4 replies
- 3 kudos
- 3 kudos
Hi @Coffee77 ,Thanks for sharing with us. It looks promising. Can't wait for another episode
- 3 kudos
- 3686 Views
- 4 replies
- 7 kudos
Resolved! Databricks for RAG: Build, Run, Evaluate
What is RAG?RAG (Retrieval-Augmented Generation) on Databricks refers to building and running AI applications that combine:Retrieval systems (like vector databases or search over documents)Generative AI models (such as LLMs like GPT)within the Databr...
- 3686 Views
- 4 replies
- 7 kudos
- 7 kudos
Thanks for sharing @snehamore811
- 7 kudos
- 11528 Views
- 5 replies
- 8 kudos
Resolved! Databricks Machine Learning Professional Preparation
Recently I earned the Databricks Machine Learning Professional certification and wanted to share my study journey. Before the exam, I worked on a project as a data engineer alongside data scientists (ML models, LLMs, MLflow). That led me to build a p...
- 11528 Views
- 5 replies
- 8 kudos
- 8 kudos
Thanks a lot, my friend @BS_THE_ANALYST ! Really glad you found it useful . I’m sure when you dive into ML later this year, you’ll do awesome things with it. Appreciate the kind words about the project — means a lot! All the best to you too, and let’...
- 8 kudos
- 2211 Views
- 4 replies
- 3 kudos
Resolved! Introduction to Databricks 🇪🇸
Here is the first episode of a serie of simple videos on Introduction to Databricks for beginners :https://youtu.be/kvglz79Ob-M?si=KnyCH74_HQ8jiO7SIt contains previous and basic concepts to master before moving forward with Databricks.
- 2211 Views
- 4 replies
- 3 kudos
- 3 kudos
English version is ready : INTRODUCTION to DATABRICKS in English - Episode 1 ‌
- 3 kudos
- 2783 Views
- 1 replies
- 3 kudos
Databricks Free Edition: The Announcement from Data + AI Summit 2025
The Data + AI Summit 2025 delivered several groundbreaking announcements, but none were more democratizing than the launch of the new Databricks Free Edition. Announced alongside a massive $100 million investment in training, this new offering provid...
- 2783 Views
- 1 replies
- 3 kudos
- 45296 Views
- 3 replies
- 2 kudos
Databricks Community Edition Login - Sign Up/Sign In/Forgot Password
Sign Up Go to https://www.databricks.com/try-databricksFill in the 2 steps box on the right hand side Note - It is important to select the Personal use section in the above step. Sign In Enter your details here https://accounts.cloud.databricks.com/...
- 45296 Views
- 3 replies
- 2 kudos
- 2 kudos
Hi,I cannot signup to Community edition. When I try to sign up using this link https://www.databricks.com/try-databricks it first shows this pop up, non of these two options allows me to signup for community edition. I don't find option 'get started...
- 2 kudos
- 1701 Views
- 5 replies
- 7 kudos
Resolved! Generating a PostgreSQL Table Schema for ETL in Databricks
In a data migration project, I needed to generate the schema of a PostgreSQL table to use in my ETL process. I’d like to share the code snippet in case someone else needs it one day:from pyspark.sql import SparkSession import json import os from typi...
- 1701 Views
- 5 replies
- 7 kudos
- 2309 Views
- 1 replies
- 0 kudos
Resolved! Automating Notebook Documentation in Databricks with LLMs
In one of my projects, I needed to generate structured documentation for an entire directory of Databricks notebooks.This solution uses the Databricks Workspace API together with a Serving Endpoint (LLM) to automatically create HTML documentation for...
- 2309 Views
- 1 replies
- 0 kudos
- 0 kudos
Suggestions are always welcome — I hope this helps anyone looking to automate notebook documentation in Databricks.
- 0 kudos
- 779 Views
- 2 replies
- 7 kudos
Data Security at the level of columns or rows or Data masking
Hi everyone, I'm currently going through the Data Analyst learning path. I've just learned about Dynamic Views and I wanted to share the article on them: https://docs.databricks.com/aws/en/views/dynamic#before-you-begin There are some limitations on ...
- 779 Views
- 2 replies
- 7 kudos
- 7 kudos
@BS_THE_ANALYST Cool stuff, right! Have read about Attribute-based Access Control (ABAC) yet? Check it out: https://docs.databricks.com/aws/en/data-governance/unity-catalog/abac/ Let me know what you think. Cheers, Louis.
- 7 kudos
- 1276 Views
- 0 replies
- 2 kudos
Databricks Asset Bundles with Python!
Databricks Asset Bundles now support Python!If you’re a Python fan, you can define jobs and pipelines in Python (or keep YAML) with DABs. You can create jobs from simple metadata, modify them during deployment with mutators, and convert existing jobs...
- 1276 Views
- 0 replies
- 2 kudos
- 3108 Views
- 0 replies
- 1 kudos
Lakehouse vs. Lakehouse Federation - Bridging the Next Evolution in Data Platforms
Over the past few years, the Lakehouse architecture has become the gold standard for managing modern data workloads. By combining the low-cost storage of data lakes with the reliability and performance of data warehouses, Lakehouses have redefined ho...
- 3108 Views
- 0 replies
- 1 kudos
-
Access Data
1 -
Access Delta Tables
1 -
ADF Linked Service
1 -
ADF Pipeline
1 -
Advanced Data Engineering
5 -
agent bricks
1 -
Agentic AI
3 -
AI
1 -
AI Agents
4 -
AI Readiness
1 -
AIBI
1 -
Analytics Engineering
1 -
Apache spark
3 -
Apache Spark 3.0
2 -
ApacheSpark
1 -
Architecture
1 -
Associate Certification
1 -
Audit
1 -
Auto-loader
1 -
Automation
1 -
AWSDatabricksCluster
2 -
Azure
1 -
Azure databricks
3 -
Azure Databricks Delta Table
1 -
Azure Databricks Job
2 -
Azure Delta Lake
3 -
Azure devops integration
1 -
AzureDatabricks
2 -
BI
1 -
BI Integrations
1 -
Big data
1 -
Billing and Cost Management
2 -
Blog
1 -
Caching
2 -
CDC
1 -
CICD
1 -
CICDForDatabricksWorkflows
1 -
Cluster
1 -
Cluster Policies
1 -
Cluster Pools
1 -
Collect
1 -
Community Event
1 -
CommunityArticle
2 -
Cost Optimization Effort
2 -
CostOptimization
2 -
custom compute policy
1 -
CustomLibrary
1 -
DAIS 0206
3 -
Dashboards
1 -
Data
1 -
Data Analysis with Databricks
1 -
Data Architecture
2 -
Data Driven AI Roadmap
1 -
Data Engineering
9 -
Data Governance
2 -
Data Ingestion
1 -
Data Ingestion & connectivity
1 -
data layout
1 -
Data Mesh
1 -
data optimization
1 -
Data Processing
1 -
Data Quality
1 -
Data warehouse
1 -
databricks
1 -
Databricks App
1 -
Databricks Apps
1 -
Databricks Assistant
2 -
Databricks Community
1 -
Databricks Dashboard
2 -
Databricks Delta Table
2 -
Databricks Demo Center
1 -
databricks genie
1 -
Databricks Job
1 -
Databricks Lakeflow
1 -
Databricks Lakehouse
2 -
Databricks Migration
3 -
Databricks Mlflow
1 -
Databricks News
1 -
Databricks Notebooks
1 -
Databricks Pyspark
3 -
Databricks Serverless
1 -
Databricks Support
1 -
Databricks Training
1 -
Databricks Unity Catalog
3 -
Databricks Workflows
2 -
DatabricksML
1 -
DBR Versions
1 -
Declartive Pipelines
1 -
DeepLearning
1 -
Delta Lake
8 -
Delta Live Table
1 -
Delta Live Tables
1 -
Delta Time Travel
1 -
DevOps
1 -
DimensionTables
1 -
DLT
2 -
DLT Pipelines
3 -
DLT-Meta
1 -
Dns
1 -
Dynamic
1 -
ETL Pipelines
1 -
fastapi
1 -
Free Databricks
3 -
Free Edition
1 -
GenAI agent
2 -
GenAI and LLMs
3 -
GenAIGeneration AI
2 -
Generation AI
1 -
Generative AI
1 -
Genie
3 -
Governance
1 -
Governed Tag
1 -
hackathon
1 -
Hive metastore
1 -
Hubert Dudek
42 -
Hybrid Lakehouse
1 -
Kafka streaming
1 -
Lakeflow Pipelines
1 -
Lakehouse
2 -
Lakehouse Migration
1 -
Lazy Evaluation
1 -
Learning
1 -
Library Installation
1 -
Lineage
1 -
Live Tables CDC
1 -
Llama
1 -
LLMs
1 -
Machine Learning
1 -
mcp
2 -
Medallion Architecture
3 -
MERGE Performance
1 -
Metadata
1 -
Metric Views
2 -
Microsoft Teams
1 -
Migrations
1 -
MSExcel
3 -
Multi-Table Transactions
1 -
Multiagent
3 -
Networking
2 -
New Features
1 -
NotMvpArticle
1 -
Optimize Command
1 -
Partitioning
1 -
Partner
1 -
Performance
2 -
Performance Tuning
3 -
Private Link
1 -
Pyspark
3 -
Pyspark Code
1 -
Pyspark Databricks
1 -
Pytest
1 -
Python
1 -
Reading-excel
2 -
SAP
1 -
Sap Hana Driver
1 -
Scala Code
1 -
Scripting
1 -
SDK
1 -
Security
1 -
Semantic Layer
1 -
Serverless
2 -
slack
1 -
Spark
5 -
Spark Caching
1 -
Spark Performance
1 -
SparkSQL
1 -
SQL
2 -
Sql Scripts
2 -
SQL Serverless
1 -
streamlit
1 -
Structured streaming
1 -
Students
2 -
Support Ticket
1 -
Sync
1 -
Training
1 -
Tutorial
3 -
UCSD
1 -
Unit Test
1 -
Unity Catalog
9 -
Unity Catlog
1 -
University Alliance
1 -
VACUUM Command
1 -
Variant
1 -
Warehousing
1 -
Workflow Jobs
1 -
Workflows
8 -
Zerobus
1
- « Previous
- Next »
| User | Count |
|---|---|
| 85 | |
| 74 | |
| 56 | |
| 44 | |
| 42 |