Explore in-depth articles, tutorials, and insights on data analytics and machine learning in the Databricks Technical Blog. Stay updated on industry trends, best practices, and advanced techniques.
Unified governance and interoperability for unstructured data
Summary
Access unstructured data in Unity Catalog Volumes from any external tool or application using a new credential vending API that is...
You've spent the afternoon building a StatefulProcessor for your TransformWithState streaming job. It tracks per-user sessions, accumulates running totals, or deduplicates events. Now you want to know...
While data volumes are growing, the time windows to process them are getting stricter. Enterprise customers are generating more data than ever, and the speed is only accelerating with more agentic wor...
Summary
External engines — Apache Spark™ (batch and Structured Streaming), DuckDB, Apache Flink, Starburst, and StreamNative Kafka Service — can now create, read, and write Unity Catalog-managed Delta...
Introduction
LLM-powered apps have two costs that compound fast: every request costs money, and users ask the same question in many ways. "How do I reset my password?", "I need to reset my password", ...
Delta Lake has become the foundation of modern data lakehouses. Organizations worldwide rely on it to bring ACID transactions, schema enforcement, and time travel capabilities to their data lakes. Yet...
If you've worked with Delta Lake at scale, you've encountered this: your MERGE operation that once completed in seconds now takes minutes or hours, as your table grows.
The culprit? Without optimizati...
Introduction
Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) have made it possible to have natural-language conversations with data, and for many use cases, this is a genuine ...
The Use Case
Data quality questions are everywhere in a data team's day. Is this table stale? Did any pipeline fail last night? Why does this dashboard look off? How did we fix this last time? The ans...
Django and Databricks Apps
Did you know that you can run Django on Databricks Apps? If you’re a Django developer looking for a simple way to host your apps and bring them closer to your data, you've f...