- 3023 Views
- 1 replies
- 1 kudos
Installing CrowdStrike Falcon Sensor on Databricks Workers
Greetings,Does anyone here have experience deploying the CrowdStrike Falcon sensor on Databricks worker instances? For context, the cluster is deployed in AWS and we use a Databricks Ubuntu 20.04 AMI. Databricks allows adding a bootstrap/startup scri...
- 3023 Views
- 1 replies
- 1 kudos
- 2909 Views
- 6 replies
- 1 kudos
Databricks App Availability
Hi there,I recently came across this post about databricks apps that says it available for public previewhttps://www.databricks.com/blog/introducing-databricks-appsHowever, when I go to previews in the workspace, I don't see an option to enable it, i...
- 2909 Views
- 6 replies
- 1 kudos
- 1 kudos
Yea same...I got the same warning "Requested region australiasoutheast in cloud azure is not supported."Has there been any further updates?
- 1 kudos
- 1159 Views
- 3 replies
- 1 kudos
Databricks Apps Crashes Unexpectedly Without Showing any Logs
Hi All,I coded up a Databricks app using fastapi which seems to be crashing on deployment. Databricks throws the error that "app crashed at startup" but the logging page is empty. So I do not know what went wrong. Any ideas on how to debug the proble...
- 1159 Views
- 3 replies
- 1 kudos
- 1 kudos
I want to update that I have been able to solve the problem and get the app running. The problem was related to trying to import from anoter folder instead of downloading the package using its wheel file. Just for the record, had already tried the l...
- 1 kudos
- 3703 Views
- 1 replies
- 6 kudos
Connecting VS Code and GitHub Copilot to the Databricks Managed MCP Server
Recently, Databricks released a preview version of the Managed MCP Server. Upon seeing this, I immediately wanted to integrate Databricks Genie with VS Code and GitHub Copilot agent mode. Below, I will briefly share the setup process:Step 1: Prepare ...
- 3703 Views
- 1 replies
- 6 kudos
- 6 kudos
Thanks for the example this is something I am looking for
- 6 kudos
- 1570 Views
- 2 replies
- 3 kudos
Getting Started with Notebooks in Databricks
Databricks notebooks are a powerful tool for data scientists and engineers to collaborate, explore data, and build machine learning models. This guide will help you get started with creating and using notebooks in Databricks.Why Use Databricks Notebo...
- 1570 Views
- 2 replies
- 3 kudos
- 3 kudos
Thanks for sharing @bhanu_gautam. This will surely help beginners get started with Databricks Notebooks.
- 3 kudos
- 1327 Views
- 4 replies
- 4 kudos
Resolved! Unity Catalog Migration Strategy
Zero-Downtime Unity Catalog Migration for 500TB Data LakeJust completed migrating 500TB to Unity Catalog without a single minute of downtime. Here's how:The Challenge500 TB across 12,000 tables200+ concurrent usersZero tolerance for downtimeMixed Hiv...
- 1327 Views
- 4 replies
- 4 kudos
- 4 kudos
Thanks, @Khaja_Zaffer and @BS_THE_ANALYST!@Khaja_Zaffer:The toolkit has 5 main components:Pre-migration analyzer Compatibility scoringDrift monitor Real time consistency checksPermission migrator: Automated ACL copyingQuery rewriter: Hive→UC SQL con...
- 4 kudos
- 338 Views
- 0 replies
- 3 kudos
GetRunbook_Failed :: Bootstrap timeout - cluster failure during startup
After creating a new workspace, if you come across Failed to get instance bootstrap steps from the Databricks Control Plane. Please check that instances have connectivity to the Databricks Control Plane. Instance bootstrap inferred timeout reason: Ge...
- 338 Views
- 0 replies
- 3 kudos
- 2520 Views
- 17 replies
- 62 kudos
Level Up Your Databricks Game - Episode 1: Widgets
Hi all,TheOC here with my first of (hopefully!) many blogs on the Databricks Community. I'm hoping in this series to share quick, practical tips to help you get the most out of Databricks. Today's topic is: Widgets.If you're anything like me, you've ...
- 2520 Views
- 17 replies
- 62 kudos
- 62 kudos
hey @Rishabh_Tiwari,Thank you!Watch this space for the next instalment
- 62 kudos
- 1293 Views
- 4 replies
- 12 kudos
[Blog] Building a Scalable Telco CDR Processing Pipeline with Databricks Delta Live Tables - Part 1
Hey everyone! I wanted to share what I'm working with daily in the Databricks ecosystem and how amazing it is that we can achieve everything within one platform!Just published a deep dive on building a Telco CDR Processing Pipeline using: Delta Live ...
- 1293 Views
- 4 replies
- 12 kudos
- 12 kudos
@Pat here I was, cup of tea in hand, ready and eager to read the blog, only then did I discover I was redirected to another place that actually hosts the full blog .Personally, I think it'd be nice for it to be on here, especially if it's dedicated t...
- 12 kudos
- 381 Views
- 0 replies
- 1 kudos
Technical Deep Dive
Bloom Filters + Zonemaps: The Ultimate Query Optimization ComboAfter my zonemap post last week got great feedback, several of you asked about Bloom filter integration. Here's the complete implementation!Why Bloom Filters Changed EverythingZonemaps ar...
- 381 Views
- 0 replies
- 1 kudos
- 2111 Views
- 12 replies
- 7 kudos
Resolved! 🚀 DataFrame Caching on Delta Tables - What if underlying data is updated?
Just published new video on Databricks Performance Series to try to clearly explain how DataFrame caching over Delta Tables behaves when updates on underlying table are performed. I came across this use case in my recent project and struggled a littl...
- 2111 Views
- 12 replies
- 7 kudos
- 7 kudos
Source Code with samples available at https://github.com/CafeConData/Spark-Caching-on-Delta-Tables
- 7 kudos
- 538 Views
- 0 replies
- 2 kudos
🚀 Spark Caching vs Databricks Disk Caching
As promised @BS_THE_ANALYST , in this new video and summarized in post, I try to explain what Spark Caching and Databricks Disk Caching are and how Caching strategy can be leveraged by making these cool features work together: Spark Caching vs Databr...
- 538 Views
- 0 replies
- 2 kudos
- 417 Views
- 0 replies
- 1 kudos
Parallel Model Training & Data Pipelines on Databricks (ForEach Tasks+ Asset Bundles + Pydantic)
As companies double down on machine learning (ML), one thing is obvious: a single model can’t solve every problem. Different datasets, different timelines, and different requirements make managing multiple models pretty tricky. And if you’ve ever wor...
- 417 Views
- 0 replies
- 1 kudos
- 3416 Views
- 3 replies
- 7 kudos
Resolved! Data Quality with PySpark and Great Expectations on Databricks
Data governance is one of the most important pillars in any modern architecture. When building pipelines that process data at scale, ensuring data quality is not just a best practice—it is a critical necessity.Tools like Great Expectations (GX) were ...
- 3416 Views
- 3 replies
- 7 kudos
- 7 kudos
@WiliamRosaWiliamRosa: Thanks for sharing the link. I will explore.
- 7 kudos
- 810 Views
- 0 replies
- 1 kudos
Tracking Query History and Optimizing Queries in Databricks
Optimizing queries in Databricks isn’t just about adding indexes or tweaking SQL syntax — it’s about visibility. You can’t improve what you can’t measure. Fortunately, Databricks provides rich telemetry around query history that you can use to analyz...
- 810 Views
- 0 replies
- 1 kudos
Join Us as a Local Community Builder!
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now-
Access Data
1 -
ADF Linked Service
1 -
ADF Pipeline
1 -
Advanced Data Engineering
3 -
Agentic AI
2 -
AI Agents
2 -
AI Readiness
1 -
Apache spark
1 -
ApacheSpark
1 -
Associate Certification
1 -
Automation
1 -
AWSDatabricksCluster
1 -
Azure
1 -
Azure databricks
3 -
Azure devops integration
1 -
AzureDatabricks
2 -
BI Integrations
1 -
Big data
1 -
Billing and Cost Management
1 -
Blog
1 -
Caching
2 -
CICDForDatabricksWorkflows
1 -
Cluster
1 -
Cluster Policies
1 -
Cluster Pools
1 -
Community Event
1 -
Cost Optimization Effort
1 -
CostOptimization
1 -
custom compute policy
1 -
CustomLibrary
1 -
Data
1 -
Data Analysis with Databricks
1 -
Data Engineering
5 -
Data Governance
1 -
Data Ingestion & connectivity
1 -
Data Mesh
1 -
Data Processing
1 -
Data Quality
1 -
Databricks Assistant
1 -
Databricks Community
1 -
Databricks Dashboard
2 -
Databricks Delta Table
1 -
Databricks Demo Center
1 -
Databricks Job
1 -
Databricks Lakehouse
1 -
Databricks Migration
2 -
Databricks Mlflow
1 -
Databricks Notebooks
1 -
Databricks Support
1 -
Databricks Training
1 -
Databricks Unity Catalog
2 -
Databricks Workflows
1 -
DatabricksML
1 -
DBR Versions
1 -
Declartive Pipelines
1 -
DeepLearning
1 -
Delta Lake
2 -
Delta Live Table
1 -
Delta Live Tables
1 -
Delta Time Travel
1 -
Devops
1 -
DimensionTables
1 -
DLT
2 -
DLT Pipelines
3 -
DLT-Meta
1 -
Dns
1 -
Dynamic
1 -
Free Databricks
3 -
Free Edition
1 -
GenAI agent
1 -
GenAI and LLMs
2 -
GenAIGeneration AI
1 -
Generative AI
1 -
Genie
1 -
Governance
1 -
Hive metastore
1 -
Hubert Dudek
23 -
Lakeflow Pipelines
1 -
Lakehouse
1 -
Lakehouse Migration
1 -
Lazy Evaluation
1 -
Learning
1 -
Library Installation
1 -
Llama
1 -
mcp
1 -
Medallion Architecture
2 -
Metric Views
1 -
Migrations
1 -
MSExcel
2 -
Multiagent
2 -
Networking
2 -
Partner
1 -
Performance
1 -
Performance Tuning
1 -
Private Link
1 -
Pyspark
2 -
Pyspark Code
1 -
Pyspark Databricks
1 -
Pytest
1 -
Python
1 -
Reading-excel
1 -
Scala Code
1 -
Scripting
1 -
SDK
1 -
Serverless
2 -
Spark
2 -
Spark Caching
1 -
SparkSQL
1 -
SQL
1 -
SQL Serverless
1 -
Students
1 -
Support Ticket
1 -
Sync
1 -
Training
1 -
Tutorial
1 -
Unit Test
1 -
Unity Catalog
5 -
Unity Catlog
1 -
Warehousing
1 -
Workflow Jobs
1 -
Workflows
3
- « Previous
- Next »
| User | Count |
|---|---|
| 71 | |
| 54 | |
| 43 | |
| 38 | |
| 33 |