Explore in-depth articles, tutorials, and insights on data analytics and machine learning in the Databricks Technical Blog. Stay updated on industry trends, best practices, and advanced techniques.
Problem Statement
Technologies used: Ray, GPUs, Unity Catalog, MLflow, XGBoost
For many data scientists, eXtreme Gradient Boosting (XGBoost) remains a popular algorithm for tackling regression and cla...
Financial institutions face a critical challenge: as member bases grow, how do you deliver personalized retirement advice at scale without proportionally increasing costs? More importantly, how do you...
Introduction
In today’s AI-native world, applications no longer rely on exact keyword matches—they understand meaning. This shift is powered by embeddings: numerical representations of text that captu...
The Hidden Story in Every Service Visit
It’s a busy Tuesday morning at a dealership. A customer pulls in for what should be a simple oil change. The technician performs the inspection, then notices so...
In enterprise GenAI deployments, prompts are the critical interface between users and AI models—yet most organizations manage them like scattered text files. This creates bottlenecks that prevent GenA...
Intro
Unlock Unity Catalog governance and performance by upgrading Hive Metastore (HMS) and AWS Glue foreign tables to Managed Tables using the new Upgrade Foreign Table workflow. Managed Tables provi...
As a global software-as-a-service (SaaS) company specializing in providing intuitive, AI-powered business solutions designed to enhance customer and employee experiences, Freshworks depends on real-ti...
IntroductionAfter Databricks' Data + AI Summit, an electrifying hackathon lit up the London HQ. Teams from across industries came together to build bold, tangible solutions using DBX’s latest features...
Introduction
Unity Catalog Metric Views provide the ability to build a semantic layer in Databricks, using YAML and SQL. This shifts the semantic layer left into your catalog, moving it closer to your...
Operationalize Your Lakehouse: Lakebase for Low-Latency Apps & APIs
The Databricks Data Intelligence Platform unifies data, AI, and governance so organizations can put all of their data to work. Until...