Community Articles

by DILIPKHANDELWAL • New Contributor

08-12-2025 12:17:30 AM

728 Views
0 replies
0 kudos

Apache Spark 4.0 — Big Data Engineering!

The latest Spark 4.0 release delivers powerful enhancements across SQL, Python, streaming, and connectivity — all aimed at making big data workloads more efficient, reliable, and developer-friendly.With Databricks Runtime 17.0, these capabilities are...

Community Articles

Reply

728 Views
0 replies
0 kudos

08-12-2025 12:17:30 AM

by RahulGupta • New Contributor III

08-10-2025 2:30:21 AM

5360 Views
0 replies
2 kudos

Databricks AI/BI Genie: The Future of Conversational Analytics

The Rise of AI in Data AnalyticsOver the last decade, organizations have collected massive amounts of data from customer transactions to IoT sensors, web logs, and financial records. But collecting data is just the first step. The real challenge lies...

Community Articles

Reply

5360 Views
0 replies
2 kudos

08-10-2025 2:30:21 AM

by devipriya • New Contributor III

08-08-2025 11:09:50 PM

3030 Views
0 replies
1 kudos

Pipelines to Prompts: Getting started with Databricks and AWS

NAVIGATION:Why Data EngineeringThe Role of Data Engineering in GenAIWhat is Databricks? Unifying Data and AI on One PlatformDatabricks on AWS: A Full-Stack Platform for GenAIHands-On ExerciseFuture-Proofing: Why Data + AI Skills Matter Now More Than ...

Community Articles

Reply

3030 Views
0 replies
1 kudos

08-08-2025 11:09:50 PM

by RahulGupta • New Contributor III

07-30-2025 11:00:59 AM

5920 Views
1 replies
3 kudos

Understanding Liquid Clustering in Databricks - The Next Evolution in Data Optimisation

In the world of big data, organising data smartly is just as important as collecting it. When working with large datasets in Databricks using Delta Lake, how your data is stored and ordered can greatly impact performance, especially for queries. Trad...

Community Articles

Reply

5920 Views
1 replies
3 kudos

07-30-2025 11:00:59 AM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

07-30-2025 3:26:40 PM

3 kudos

Great post, Rahul! You’ve nailed the key trade-offs perfectly. The Appeal: LC is “set it and forget it” data management—no more manual OPTIMIZE jobs or performance firefighting. The Reality Check: Single-column clustering works great for high-cardina...

3 kudos

07-30-2025 3:26:40 PM

by Charansai • New Contributor III

07-25-2025 10:38:53 AM

1704 Views
2 replies
0 kudos

Recommendations for Designing Cluster Policies Across Dev/QA/Prod Environments for DE and DA teams

Hi Community,We are working on implementing Databricks cluster policies across our organization and are seeking advice on best practices to enforce governance, security, and cost control across different environments.We have two main teams using Data...

Community Articles

Reply

1704 Views
2 replies
0 kudos

07-25-2025 10:38:53 AM

View Replies

Latest Reply

Charansai
New Contributor III

07-30-2025 7:48:21 AM

0 kudos

I just want to confirm one more thing here is that me as admin managing the cluster creation and no user will have access to create them me know how the cluster policies help me in this perspective.

0 kudos

07-30-2025 7:48:21 AM

1 More Replies

by RiyazAliM • Honored Contributor

07-21-2025 9:09:03 PM

4109 Views
4 replies
8 kudos

The Open Source DLT Meta Framework

DLT Meta is an open-source framework developed by Databricks Labs that enables the automation of bronze and silver data pipelines through metadata configuration rather than manual code development.At its core, the framework uses a Dataflowspec - a JS...

Community Articles

Reply

4109 Views
4 replies
8 kudos

07-21-2025 9:09:03 PM

View Replies

Latest Reply

sridharplv
Valued Contributor II

07-23-2025 11:38:53 AM

8 kudos

Great Article Riyaz. keep Sharing more knowledge

8 kudos

07-23-2025 11:38:53 AM

3 More Replies

by RahulGupta • New Contributor III

07-26-2025 8:21:48 AM

5716 Views
0 replies
2 kudos

Databricks Lakeflow - Redefining Data Engineering for the Modern AI Stack

Introduction to LakeflowAt the Databricks Data + AI Summit 2025, Databricks unveiled Lakeflow, a revolutionary approach to data engineering. While many of us have used Delta Live Tables (DLT) for declarative pipeline management, Lakeflow goes beyond,...

Community Articles

Reply

5716 Views
0 replies
2 kudos

07-26-2025 8:21:48 AM

by ayushbadhera1 • Databricks Partner

07-23-2025 11:54:48 AM

6412 Views
2 replies
6 kudos

Databricks LLM Evolution and Future Prospects

Databricks LLM Evolution and Future ProspectsDatabricks has progressed from a big-data compute engine to a full-stack AI powerhouse that designs, trains, and serves state‐of‐the-art large language models (LLMs). This article explores two key technica...

Community Articles

Reply

6412 Views
2 replies
6 kudos

07-23-2025 11:54:48 AM

View Replies

Latest Reply

ayushbadhera1
Databricks Partner

07-25-2025 3:04:40 AM

6 kudos

Thanks, @RiyazAliM, for checking out the blog post!More insights on Databricks LLM and Dolly are on the way in the next one. Stay tuned and keep learning!Best,Ayush

6 kudos

07-25-2025 3:04:40 AM

1 More Replies

by hozefa413 • Databricks Partner

07-24-2025 3:52:27 AM

6891 Views
3 replies
9 kudos

Modernizing Legacy Data Platforms to Lakehouse for AI-Readiness

As organizations increasingly migrate from legacy platforms—like on-prem SQL Server, Oracle Exadata, Teradata, Informatica, Cloudera, or Netezza—to modern cloud architectures, one critical question often arises:"Are we just lifting and shifting the s...

Community Articles

Reply

6891 Views
3 replies
9 kudos

07-24-2025 3:52:27 AM

View Replies

Latest Reply

sridharplv
Valued Contributor II

07-24-2025 10:05:56 AM

9 kudos

Great article @hozefa413 , It shows all your expertise and delivery excellence

9 kudos

07-24-2025 10:05:56 AM

2 More Replies

by Traxccel • Databricks Partner

07-23-2025 6:34:25 AM

2352 Views
0 replies
0 kudos

Implementing data contracts on Databricks for industrial AI pipelines

Enforce schema consistency using declarative contracts on Databricks Lakehouse.Industrial AI is transforming how operations are optimized, from forecasting equipment failure to streamlining supply chains. But even the most advanced models are only as...

Community Articles

Reply

2352 Views
0 replies
0 kudos

07-23-2025 6:34:25 AM

by gdschld • New Contributor

07-22-2025 9:15:43 AM

3075 Views
2 replies
3 kudos

Establishing Trust relationship for Databricks on AWS

Hello.Our databricks is on Azure. We are trying to connect with AWS S3 as an external source from Unity Catalog.We have followed all steps given here, is there anything additional required?https://docs.databricks.com/aws/en/connect/unity-catalog/clou...

Community Articles

Reply

3075 Views
2 replies
3 kudos

07-22-2025 9:15:43 AM

View Replies

Latest Reply

Pat
Esteemed Contributor

07-22-2025 1:00:41 PM

3 kudos

Hi @gdschld ,what ID have you used here:"sts:ExternalId": "<STORAGE-CREDENTIAL-EXTERNAL-ID>"I haven't done this for some time and got a bit confused with this STORAGE-CREDENTIAL-EXTERNAL_ID. I used to put there Databricks Account ID.I found this, it ...

3 kudos

07-22-2025 1:00:41 PM

1 More Replies

by DouglasMoore • Databricks Employee

05-28-2024 11:58:36 AM

7671 Views
2 replies
1 kudos

How to enable unity catalog system tables?

Unity Catalog system tables provide lots of metadata & log data related to the operations of Databricks. System tables are organized into separate schemas containing one to a few tables owned and updated by Databricks. The storage and the cost of the...

Community Articles

Reply

7671 Views
2 replies
1 kudos

05-28-2024 11:58:36 AM

View Replies

Latest Reply

AlanD
Databricks MVP

07-21-2025 7:37:13 AM

1 kudos

It's in the Databricks CLI Unity Catalog section Databricks CLI commands | Databricks DocumentationmetastoresCommands to manage metastores, which are the top-level container of objects in Unity Catalog:assign, create, current, delete, get, list, summ...

1 kudos

07-21-2025 7:37:13 AM

1 More Replies

by Pat • Esteemed Contributor

07-11-2025 12:47:21 AM

2346 Views
1 replies
3 kudos

Building DLT Pipelines with Databricks Free Edition and Amazon Q Developer

How AI-powered development accelerated my data engineering workflow Watch the Complete Development Process YouTube Video: See the entire 30-minute development sessionThis is a screen recording without voice narration showing the complete development ...

Community Articles

Reply

2346 Views
1 replies
3 kudos

07-11-2025 12:47:21 AM

View Replies

Latest Reply

Advika
Community Manager

07-21-2025 2:09:44 AM

3 kudos

This is super insightful @Pat, thanks for sharing this with the Community!

3 kudos

07-21-2025 2:09:44 AM

by smpa01 • Contributor

06-19-2025 7:43:02 AM

1623 Views
2 replies
0 kudos

Resolved! Databricks VS code extension to add cell title

I use the databricks extension in vs code for all my work. Is there any way for me to add a cell title from the extension itself?. There is no point in adding in the server version of this notebook cause when I sync the local to sever, it will overwr...

Community Articles

Reply

1623 Views
2 replies
0 kudos

06-19-2025 7:43:02 AM

View Replies

Latest Reply

smpa01
Contributor

07-18-2025 12:06:33 PM

0 kudos

One needs to use # DBTITLE 1,cell_title in a py file # COMMAND ---------- # DBTITLE 1,Title 1 from pyspark.sql import SparkSession from delta.tables import DeltaTable from pyspark.sql.functions import *

0 kudos

07-18-2025 12:06:33 PM

1 More Replies

by RiyazAliM • Honored Contributor

07-18-2025 5:39:45 AM

2323 Views
1 replies
4 kudos

The Databricks Python SDK

The Databricks SDK is a script (written in Python, in our case) which lets you control and automate actions on Databricks using the methods available in the WorkSpaceClient (more about this below).Why do we need Databricks SDK:- Automation: You can d...

Community Articles

Reply

2323 Views
1 replies
4 kudos

07-18-2025 5:39:45 AM

View Replies

Latest Reply

sridharplv
Valued Contributor II

07-18-2025 5:48:51 AM

4 kudos

Good Article @RiyazAliM.

4 kudos

07-18-2025 5:48:51 AM

Databricks Community

Forum Posts

Apache Spark 4.0 — Big Data Engineering!

Databricks AI/BI Genie: The Future of Conversational Analytics

Pipelines to Prompts: Getting started with Databricks and AWS

Understanding Liquid Clustering in Databricks - The Next Evolution in Data Optimisation

Recommendations for Designing Cluster Policies Across Dev/QA/Prod Environments for DE and DA teams

The Open Source DLT Meta Framework

Databricks Lakeflow - Redefining Data Engineering for the Modern AI Stack

Databricks LLM Evolution and Future Prospects

Modernizing Legacy Data Platforms to Lakehouse for AI-Readiness

Implementing data contracts on Databricks for industrial AI pipelines

Establishing Trust relationship for Databricks on AWS

How to enable unity catalog system tables?

Building DLT Pipelines with Databricks Free Edition and Amazon Q Developer

Resolved! Databricks VS code extension to add cell title

The Databricks Python SDK

CI/CD on Databricks with Asset Bundles (DABs) and ...

Custom asset bundles file name

Designing a Cost-Efficient Databricks Lakehouse, P...

Data Driven AI Roadmap Databricks Governance Best ...

Building an End-to-End ETL Pipeline with Data from...