- 5498 Views
- 1 replies
- 2 kudos
9 Powerful 🚀 Spark Optimization Techniques in Databricks (With Real Examples)
IntroductionOne of our ETL pipelines used to take 10 hours to complete. After tuning and scaling in Databricks, it finished in just about 1 hour — a 90% reduction in runtime.That’s the power of Spark tuning.Databricks, built on Apache Spark, is a po...
- 5498 Views
- 1 replies
- 2 kudos
- 2 kudos
This is a fantastic breakdown of Spark optimization techniques, @savlahanish27!Definitely helpful for anyone working on performance tuning in Databricks.
- 2 kudos
- 1389 Views
- 0 replies
- 4 kudos
Semantic Modelling and Data Products using Agent
Hey Community!We have built something cool for Data Engineers in Databricks!Raw Files -> Semantic Model -> Data Products without writing ETL/ELT Code.Demo/ Guide - https://youtu.be/wjQYXrBwA-oNotebook - https://github.com/Intugle/data-tools/blob/main...
- 1389 Views
- 0 replies
- 4 kudos
- 2511 Views
- 2 replies
- 7 kudos
Data Quality at Scale: My Experience Using Databricks and AWS
Over the past few years working as a data engineer, I’ve seen how quickly companies are moving their platforms to Databricks and AWS. The flexibility and scale these platforms provide are amazing, but one challenge always comes up again and again: ho...
- 2511 Views
- 2 replies
- 7 kudos
- 7 kudos
Hi @Brahmareddy very good insights , I can summarize this as follows: Area Best Practice ExampleSchema ManagementDefine schemas in JSON/YAML, enforce with Delta LakeGovernanceUse Unity Catalog for access, lineage, and ownershipMonitoringSet up Lakeho...
- 7 kudos
- 6899 Views
- 4 replies
- 6 kudos
From Pilots to Production: Unlocking Enterprise AI Agents with Agent Bricks
Many enterprises today launch AI agents with high hopes but more often than not, those pilots never reach production. The culprit? Complexity, poor evaluations, ballooning costs, and governance gaps.Why do so many AI agent pilots never make it to pro...
- 6899 Views
- 4 replies
- 6 kudos
- 6 kudos
Agent Bricks looks solid for scaling AI, and I’ve seen platforms like Agentra.io also tackle the enterprise workflow side of this challenge.
- 6 kudos
- 11422 Views
- 4 replies
- 1 kudos
Databricks Academy Labs Coupon Instructions
To see all Databricks training and enablement offerings, please visit our Learning Library and Certifications Catalog. To use your Databricks Academy Labs coupons, please - Head to Databricks Academy Across the top navigation, select Subscriptions C...
- 11422 Views
- 4 replies
- 1 kudos
- 1 kudos
is this coupon still valid, not able to use it ?
- 1 kudos
- 1260 Views
- 1 replies
- 4 kudos
Accelerating Data Migration and Data Engineering with AI: The Future of Databricks Adoption
In today’s fast-evolving digital landscape, organizations are under immense pressure to modernize their data infrastructure for better scalability, agility, and advanced analytics. One of the most powerful shifts in recent times has been the migratio...
- 1260 Views
- 1 replies
- 4 kudos
- 4 kudos
Great summary, @JatinArora! Clear and highlights the tangible benefits perfectly.
- 4 kudos
- 15555 Views
- 13 replies
- 32 kudos
Zero to Hero - Data Analysis Certification
Hey everyone, Today I passed my Data Analyst Associate exam! 拾I'd like to say thanks to the Databricks community. The questions that people ask and interactions I've been a part of have been invaluable. Starting tomorrow, I'll be moving onto my data ...
- 15555 Views
- 13 replies
- 32 kudos
- 32 kudos
Thanks @BS_THE_ANALYST !Hoping to pick up Databricks blogging again - I love sharing my knowledge with the wider Community.
- 32 kudos
- 1336 Views
- 2 replies
- 4 kudos
What Does “Full Stack Development” Mean in the World of Databricks
Hi All, Many teams using Databricks today are referring to their work as “full stack development,” which can be a bit confusing at first. In the Databricks context, this doesn't mean a new framework — it simply means handling everything from raw data...
- 1336 Views
- 2 replies
- 4 kudos
- 4 kudos
Absolutely agree — what’s called “full stack” in Databricks truly means managing the complete data lifecycle on a unified platform,. Teams today aren’t just doing ETL; they’re orchestrating everything from real-time ingestion (Autoloader), scalable s...
- 4 kudos
- 1365 Views
- 1 replies
- 3 kudos
corrupted delta logs
ERROR: DeltaVersionsNotContiguousException: Versions (0, 2) are not contiguous. This can happen when files have been manually removed from the Delta log. Please contact Databricks support to repair the table.Cause of the error: You are getting this e...
- 1365 Views
- 1 replies
- 3 kudos
- 5721 Views
- 4 replies
- 2 kudos
Demo Deploy a Databricks Asset Bundle with Azure DevOps Pipelines
I recently wrote a blog post showing how to deploy a minimal Databricks job with notebook task using Databricks Asset Bundles and Azure DevOps pipelines. The code is available in my public GitHub repo. Thought I'd share it here!ObjectivesIn this post...
- 5721 Views
- 4 replies
- 2 kudos
- 2 kudos
HiGreat article.2 questionsHow does the deploy bundle script in the pipleine know where to pick up the bundel. Should there be a path. Mine is on my local machine becaust that where I first run it from the cli. Also can a bundle deploy resources that...
- 2 kudos
- 3604 Views
- 1 replies
- 2 kudos
Installing CrowdStrike Falcon Sensor on Databricks Workers
Greetings,Does anyone here have experience deploying the CrowdStrike Falcon sensor on Databricks worker instances? For context, the cluster is deployed in AWS and we use a Databricks Ubuntu 20.04 AMI. Databricks allows adding a bootstrap/startup scri...
- 3604 Views
- 1 replies
- 2 kudos
- 3697 Views
- 6 replies
- 1 kudos
Databricks App Availability
Hi there,I recently came across this post about databricks apps that says it available for public previewhttps://www.databricks.com/blog/introducing-databricks-appsHowever, when I go to previews in the workspace, I don't see an option to enable it, i...
- 3697 Views
- 6 replies
- 1 kudos
- 1 kudos
Yea same...I got the same warning "Requested region australiasoutheast in cloud azure is not supported."Has there been any further updates?
- 1 kudos
- 1775 Views
- 3 replies
- 1 kudos
Databricks Apps Crashes Unexpectedly Without Showing any Logs
Hi All,I coded up a Databricks app using fastapi which seems to be crashing on deployment. Databricks throws the error that "app crashed at startup" but the logging page is empty. So I do not know what went wrong. Any ideas on how to debug the proble...
- 1775 Views
- 3 replies
- 1 kudos
- 1 kudos
I want to update that I have been able to solve the problem and get the app running. The problem was related to trying to import from anoter folder instead of downloading the package using its wheel file. Just for the record, had already tried the l...
- 1 kudos
- 6209 Views
- 1 replies
- 7 kudos
Connecting VS Code and GitHub Copilot to the Databricks Managed MCP Server
Recently, Databricks released a preview version of the Managed MCP Server. Upon seeing this, I immediately wanted to integrate Databricks Genie with VS Code and GitHub Copilot agent mode. Below, I will briefly share the setup process:Step 1: Prepare ...
- 6209 Views
- 1 replies
- 7 kudos
- 7 kudos
Thanks for the example this is something I am looking for
- 7 kudos
- 2529 Views
- 2 replies
- 3 kudos
Getting Started with Notebooks in Databricks
Databricks notebooks are a powerful tool for data scientists and engineers to collaborate, explore data, and build machine learning models. This guide will help you get started with creating and using notebooks in Databricks.Why Use Databricks Notebo...
- 2529 Views
- 2 replies
- 3 kudos
- 3 kudos
Thanks for sharing @bhanu_gautam. This will surely help beginners get started with Databricks Notebooks.
- 3 kudos
-
Access Data
1 -
ADF Linked Service
1 -
ADF Pipeline
1 -
Advanced Data Engineering
3 -
agent bricks
1 -
Agentic AI
3 -
AI Agents
3 -
AI Readiness
1 -
Apache spark
3 -
Apache Spark 3.0
1 -
ApacheSpark
1 -
Associate Certification
1 -
Auto-loader
1 -
Automation
1 -
AWSDatabricksCluster
1 -
Azure
1 -
Azure databricks
3 -
Azure Databricks Job
2 -
Azure Delta Lake
2 -
Azure devops integration
1 -
AzureDatabricks
2 -
BI Integrations
1 -
Big data
1 -
Billing and Cost Management
1 -
Blog
1 -
Caching
2 -
CDC
1 -
CICDForDatabricksWorkflows
1 -
Cluster
1 -
Cluster Policies
1 -
Cluster Pools
1 -
Collect
1 -
Community Event
1 -
CommunityArticle
2 -
Cost Optimization Effort
1 -
CostOptimization
1 -
custom compute policy
1 -
CustomLibrary
1 -
Data
1 -
Data Analysis with Databricks
1 -
Data Driven AI Roadmap
1 -
Data Engineering
7 -
Data Governance
1 -
Data Ingestion
1 -
Data Ingestion & connectivity
1 -
Data Mesh
1 -
Data Processing
1 -
Data Quality
1 -
Data warehouse
1 -
databricks
1 -
Databricks Assistant
2 -
Databricks Community
1 -
Databricks Dashboard
2 -
Databricks Delta Table
1 -
Databricks Demo Center
1 -
Databricks Job
1 -
Databricks Lakehouse
1 -
Databricks Migration
3 -
Databricks Mlflow
1 -
Databricks Notebooks
1 -
Databricks Serverless
1 -
Databricks Support
1 -
Databricks Training
1 -
Databricks Unity Catalog
2 -
Databricks Workflows
1 -
DatabricksML
1 -
DBR Versions
1 -
Declartive Pipelines
1 -
DeepLearning
1 -
Delta Lake
5 -
Delta Live Table
1 -
Delta Live Tables
1 -
Delta Time Travel
1 -
Devops
1 -
DimensionTables
1 -
DLT
2 -
DLT Pipelines
3 -
DLT-Meta
1 -
Dns
1 -
Dynamic
1 -
Free Databricks
3 -
Free Edition
1 -
GenAI agent
2 -
GenAI and LLMs
2 -
GenAIGeneration AI
2 -
Generative AI
1 -
Genie
1 -
Governance
1 -
Governed Tag
1 -
Hive metastore
1 -
Hubert Dudek
42 -
Hybrid Lakehouse
1 -
Lakeflow Pipelines
1 -
Lakehouse
2 -
Lakehouse Migration
1 -
Lazy Evaluation
1 -
Learn Databricks
1 -
Learning
1 -
Library Installation
1 -
Llama
1 -
LLMs
1 -
mcp
1 -
Medallion Architecture
2 -
Metric Views
1 -
Migrations
1 -
MSExcel
3 -
Multi-Table Transactions
1 -
Multiagent
3 -
Networking
2 -
NotMvpArticle
1 -
Partitioning
1 -
Partner
1 -
Performance
2 -
Performance Tuning
2 -
Private Link
1 -
Pyspark
2 -
Pyspark Code
1 -
Pyspark Databricks
1 -
Pytest
1 -
Python
1 -
Reading-excel
2 -
Scala Code
1 -
Scripting
1 -
SDK
1 -
Serverless
2 -
Spark
4 -
Spark Caching
1 -
Spark Performance
1 -
SparkSQL
1 -
SQL
2 -
Sql Scripts
2 -
SQL Serverless
1 -
Students
1 -
Support Ticket
1 -
Sync
1 -
Training
1 -
Tutorial
1 -
Unit Test
1 -
Unity Catalog
7 -
Unity Catlog
1 -
Variant
1 -
Warehousing
1 -
Workflow Jobs
1 -
Workflows
6 -
Zerobus
1
- « Previous
- Next »
| User | Count |
|---|---|
| 85 | |
| 71 | |
| 44 | |
| 41 | |
| 41 |