- 4738 Views
- 6 replies
- 1 kudos
Databricks App Availability
Hi there,I recently came across this post about databricks apps that says it available for public previewhttps://www.databricks.com/blog/introducing-databricks-appsHowever, when I go to previews in the workspace, I don't see an option to enable it, i...
- 4738 Views
- 6 replies
- 1 kudos
- 1 kudos
Yea same...I got the same warning "Requested region australiasoutheast in cloud azure is not supported."Has there been any further updates?
- 1 kudos
- 2389 Views
- 3 replies
- 1 kudos
Databricks Apps Crashes Unexpectedly Without Showing any Logs
Hi All,I coded up a Databricks app using fastapi which seems to be crashing on deployment. Databricks throws the error that "app crashed at startup" but the logging page is empty. So I do not know what went wrong. Any ideas on how to debug the proble...
- 2389 Views
- 3 replies
- 1 kudos
- 1 kudos
I want to update that I have been able to solve the problem and get the app running. The problem was related to trying to import from anoter folder instead of downloading the package using its wheel file. Just for the record, had already tried the l...
- 1 kudos
- 9129 Views
- 1 replies
- 8 kudos
Connecting VS Code and GitHub Copilot to the Databricks Managed MCP Server
Recently, Databricks released a preview version of the Managed MCP Server. Upon seeing this, I immediately wanted to integrate Databricks Genie with VS Code and GitHub Copilot agent mode. Below, I will briefly share the setup process:Step 1: Prepare ...
- 9129 Views
- 1 replies
- 8 kudos
- 8 kudos
Thanks for the example this is something I am looking for
- 8 kudos
- 4144 Views
- 2 replies
- 3 kudos
Getting Started with Notebooks in Databricks
Databricks notebooks are a powerful tool for data scientists and engineers to collaborate, explore data, and build machine learning models. This guide will help you get started with creating and using notebooks in Databricks.Why Use Databricks Notebo...
- 4144 Views
- 2 replies
- 3 kudos
- 3 kudos
Thanks for sharing @bhanu_gautam. This will surely help beginners get started with Databricks Notebooks.
- 3 kudos
- 2794 Views
- 4 replies
- 4 kudos
Resolved! Unity Catalog Migration Strategy
Zero-Downtime Unity Catalog Migration for 500TB Data LakeJust completed migrating 500TB to Unity Catalog without a single minute of downtime. Here's how:The Challenge500 TB across 12,000 tables200+ concurrent usersZero tolerance for downtimeMixed Hiv...
- 2794 Views
- 4 replies
- 4 kudos
- 4 kudos
Thanks, @Khaja_Zaffer and @BS_THE_ANALYST!@Khaja_Zaffer:The toolkit has 5 main components:Pre-migration analyzer Compatibility scoringDrift monitor Real time consistency checksPermission migrator: Automated ACL copyingQuery rewriter: Hive→UC SQL con...
- 4 kudos
- 572 Views
- 0 replies
- 3 kudos
GetRunbook_Failed :: Bootstrap timeout - cluster failure during startup
After creating a new workspace, if you come across Failed to get instance bootstrap steps from the Databricks Control Plane. Please check that instances have connectivity to the Databricks Control Plane. Instance bootstrap inferred timeout reason: Ge...
- 572 Views
- 0 replies
- 3 kudos
- 6787 Views
- 17 replies
- 62 kudos
Level Up Your Databricks Game - Episode 1: Widgets
Hi all,TheOC here with my first of (hopefully!) many blogs on the Databricks Community. I'm hoping in this series to share quick, practical tips to help you get the most out of Databricks. Today's topic is: Widgets.If you're anything like me, you've ...
- 6787 Views
- 17 replies
- 62 kudos
- 62 kudos
hey @Rishabh_Tiwari,Thank you!Watch this space for the next instalment
- 62 kudos
- 2125 Views
- 4 replies
- 12 kudos
[Blog] Building a Scalable Telco CDR Processing Pipeline with Databricks Delta Live Tables - Part 1
Hey everyone! I wanted to share what I'm working with daily in the Databricks ecosystem and how amazing it is that we can achieve everything within one platform!Just published a deep dive on building a Telco CDR Processing Pipeline using: Delta Live ...
- 2125 Views
- 4 replies
- 12 kudos
- 12 kudos
@Pat here I was, cup of tea in hand, ready and eager to read the blog, only then did I discover I was redirected to another place that actually hosts the full blog .Personally, I think it'd be nice for it to be on here, especially if it's dedicated t...
- 12 kudos
- 1024 Views
- 0 replies
- 1 kudos
Technical Deep Dive
Bloom Filters + Zonemaps: The Ultimate Query Optimization ComboAfter my zonemap post last week got great feedback, several of you asked about Bloom filter integration. Here's the complete implementation!Why Bloom Filters Changed EverythingZonemaps ar...
- 1024 Views
- 0 replies
- 1 kudos
- 4566 Views
- 12 replies
- 7 kudos
Resolved! 🚀 DataFrame Caching on Delta Tables - What if underlying data is updated?
Just published new video on Databricks Performance Series to try to clearly explain how DataFrame caching over Delta Tables behaves when updates on underlying table are performed. I came across this use case in my recent project and struggled a littl...
- 4566 Views
- 12 replies
- 7 kudos
- 7 kudos
Source Code with samples available at https://github.com/CafeConData/Spark-Caching-on-Delta-Tables
- 7 kudos
- 2042 Views
- 0 replies
- 2 kudos
🚀 Spark Caching vs Databricks Disk Caching
As promised @BS_THE_ANALYST , in this new video and summarized in post, I try to explain what Spark Caching and Databricks Disk Caching are and how Caching strategy can be leveraged by making these cool features work together: Spark Caching vs Databr...
- 2042 Views
- 0 replies
- 2 kudos
- 921 Views
- 0 replies
- 1 kudos
Parallel Model Training & Data Pipelines on Databricks (ForEach Tasks+ Asset Bundles + Pydantic)
As companies double down on machine learning (ML), one thing is obvious: a single model can’t solve every problem. Different datasets, different timelines, and different requirements make managing multiple models pretty tricky. And if you’ve ever wor...
- 921 Views
- 0 replies
- 1 kudos
- 13738 Views
- 3 replies
- 7 kudos
Resolved! Data Quality with PySpark and Great Expectations on Databricks
Data governance is one of the most important pillars in any modern architecture. When building pipelines that process data at scale, ensuring data quality is not just a best practice—it is a critical necessity.Tools like Great Expectations (GX) were ...
- 13738 Views
- 3 replies
- 7 kudos
- 7 kudos
@WiliamRosaWiliamRosa: Thanks for sharing the link. I will explore.
- 7 kudos
- 3613 Views
- 0 replies
- 1 kudos
Tracking Query History and Optimizing Queries in Databricks
Optimizing queries in Databricks isn’t just about adding indexes or tweaking SQL syntax — it’s about visibility. You can’t improve what you can’t measure. Fortunately, Databricks provides rich telemetry around query history that you can use to analyz...
- 3613 Views
- 0 replies
- 1 kudos
- 3590 Views
- 1 replies
- 4 kudos
[Blog] Databricks Serverless vs Classic: Who Wins the Cost Sprint?
Hi everyone! I wanted to share with you a post I wrote on Medium a while ago — it’s still very useful if you want to understand how to properly calculate Databricks cluster costs and get a realistic view of the differences: Databricks Serverless vs C...
- 3590 Views
- 1 replies
- 4 kudos
- 4 kudos
Really interesting topic I'll take a look when possible Always interested in improving performance and saving cloud costs. Thanks for sharing.
- 4 kudos
-
Access Data
1 -
Access Delta Tables
1 -
ADF Linked Service
1 -
ADF Pipeline
1 -
Advanced Data Engineering
5 -
agent bricks
1 -
Agentic AI
3 -
AI
1 -
AI Agents
4 -
AI Readiness
1 -
AIBI
1 -
Analytics Engineering
1 -
Apache spark
3 -
Apache Spark 3.0
2 -
ApacheSpark
1 -
Architecture
1 -
Associate Certification
1 -
Audit
1 -
Auto-loader
1 -
Automation
1 -
AWSDatabricksCluster
2 -
Azure
2 -
Azure databricks
3 -
Azure Databricks Delta Table
1 -
Azure Databricks Job
2 -
Azure Delta Lake
3 -
Azure devops integration
1 -
AzureDatabricks
2 -
BI
1 -
BI Integrations
1 -
Big data
1 -
Billing and Cost Management
2 -
Blog
1 -
Caching
2 -
CDC
1 -
CICD
2 -
CICDForDatabricksWorkflows
1 -
Cluster
1 -
Cluster Policies
1 -
Cluster Pools
1 -
Collect
1 -
Community Event
1 -
CommunityArticle
2 -
Cost Optimization Effort
2 -
CostOptimization
2 -
custom compute policy
1 -
CustomLibrary
1 -
DABs
1 -
DAIS 0206
3 -
Dashboards
2 -
Data
1 -
Data Analysis with Databricks
1 -
Data Architecture
2 -
Data Driven AI Roadmap
1 -
Data Engineering
10 -
Data Governance
2 -
Data Ingestion
1 -
Data Ingestion & connectivity
1 -
data layout
1 -
Data Mesh
1 -
data optimization
1 -
Data Processing
1 -
Data Quality
1 -
Data warehouse
1 -
databricks
1 -
Databricks App
1 -
Databricks Apps
1 -
Databricks Assistant
2 -
Databricks Community
1 -
Databricks Dashboard
2 -
Databricks Delta Table
2 -
Databricks Demo Center
1 -
databricks genie
1 -
Databricks Job
2 -
Databricks Lakeflow
2 -
Databricks Lakehouse
2 -
Databricks Migration
3 -
Databricks Mlflow
1 -
Databricks News
1 -
Databricks Notebooks
1 -
Databricks Pyspark
3 -
Databricks Serverless
1 -
Databricks Support
1 -
Databricks Training
1 -
Databricks Unity Catalog
3 -
Databricks Workflows
3 -
DatabricksML
1 -
DBR Versions
1 -
Declartive Pipelines
1 -
DeepLearning
1 -
Delta Lake
9 -
Delta Live Table
1 -
Delta Live Tables
1 -
Delta Time Travel
1 -
DevOps
2 -
DimensionTables
1 -
DLT
2 -
DLT Pipelines
3 -
DLT-Meta
1 -
Dns
1 -
Dynamic
1 -
ETL Pipelines
1 -
fastapi
1 -
Free Databricks
3 -
Free Edition
1 -
GenAI agent
2 -
GenAI and LLMs
3 -
GenAIGeneration AI
2 -
Generation AI
1 -
Generative AI
1 -
Genie
3 -
Git
1 -
Google Bigquery
1 -
Google cloud
1 -
Governance
1 -
Governed Tag
1 -
hackathon
1 -
Hive metastore
1 -
Hubert Dudek
42 -
Hybrid Lakehouse
1 -
Kafka streaming
2 -
Lakeflow Pipelines
1 -
Lakehouse
2 -
Lakehouse Migration
1 -
Lazy Evaluation
1 -
Learning
1 -
Library Installation
1 -
Lineage
1 -
Live Tables CDC
1 -
Llama
1 -
LLMs
1 -
Machine Learning
1 -
mcp
2 -
Medallion Architecture
3 -
MERGE Performance
1 -
Metadata
1 -
Metric Views
2 -
Microsoft Teams
1 -
Migrations
1 -
MSExcel
3 -
Multi-Table Transactions
1 -
Multiagent
3 -
Networking
2 -
New Features
1 -
NotMvpArticle
1 -
Optimize Command
1 -
Partitioning
1 -
Partner
1 -
Performance
2 -
Performance Tuning
3 -
Powerbi
1 -
Private Link
1 -
Pyspark
4 -
Pyspark Code
1 -
Pyspark Databricks
1 -
Pytest
1 -
Python
1 -
Reading-excel
2 -
SAP
1 -
Sap Hana Driver
1 -
Scala Code
1 -
Scripting
1 -
SDK
1 -
Security
1 -
Semantic Layer
1 -
Serverless
2 -
slack
1 -
Spark
5 -
Spark Caching
1 -
Spark Performance
1 -
SparkSQL
1 -
SQL
2 -
Sql Scripts
2 -
SQL Serverless
1 -
streamlit
1 -
Structured streaming
1 -
Students
2 -
Support Ticket
1 -
Sync
1 -
Training
1 -
Tutorial
3 -
UCSD
1 -
Unit Test
1 -
Unity Catalog
9 -
Unity Catlog
1 -
University Alliance
1 -
VACUUM Command
1 -
Variant
1 -
Warehousing
1 -
Workflow Jobs
1 -
Workflows
8 -
Zerobus
1
- « Previous
- Next »
| User | Count |
|---|---|
| 85 | |
| 74 | |
| 57 | |
| 44 | |
| 44 |