Community Articles

by prasannac • New Contributor

06-11-2025 8:04:06 AM

660 Views
0 replies
0 kudos

Request for a guest post

Hi, I hope you're doing well. My name is Prasanna. C, Digital Marketing Strategist at Express Analytics, a company that understands consumer behavior and provides analytics solutions and services to businesses. Express Analytics primarily offers...

Community Articles

Reply

660 Views
0 replies
0 kudos

06-11-2025 8:04:06 AM

by mai_luca • Contributor

06-10-2025 2:26:00 AM

1314 Views
2 replies
1 kudos

Automatic Liquid Clustering and PO

I spent some time to understand how to use automatic liquid clustering with dlt pipelines. Hope this can help you as well.Enable Predictive Optimization Use this code:# Enabling Automatic Liquid Clustering on a new table @dlt.table(cluster_by_auto=Tr...

Community Articles

Reply

1314 Views
2 replies
1 kudos

06-10-2025 2:26:00 AM

View Replies

Latest Reply

mai_luca
Contributor

06-10-2025 4:51:18 AM

1 kudos

Hi @Addy0_, thanks for sharing how to set it for existing table. Unfortunately, I think ALTER cannot be used with materialized view and streaming tables defined in dlt pipelines.I was looking for something similar to @dlt.table(cluster_by_auto=True, ...

1 kudos

06-10-2025 4:51:18 AM

1 More Replies

by thedatanerd • New Contributor III

06-10-2025 12:20:42 AM

671 Views
0 replies
1 kudos

Databricks Data Classification

I encourage you to try out a new beta feature in Databricks called : Data Classification. It automatically classifies your catalog data and tag it with tags. Docs: https://docs.databricks.com/aws/en/lakehouse-monitoring/data-classification

Community Articles

Reply

671 Views
0 replies
1 kudos

06-10-2025 12:20:42 AM

by xdx001 • New Contributor III

05-22-2025 7:00:16 AM

752 Views
0 replies
1 kudos

Strong Databricks Fundamental - Gen Z

Why Databricks is the Future of Data Analytics for Gen ZIn the fast-paced world of data analytics, staying ahead of the curve is crucial. For Gen Z, who are digital natives and always on the lookout for the latest tech trends, understanding the diffe...

Community Articles

Reply

752 Views
0 replies
1 kudos

05-22-2025 7:00:16 AM

by MichTalebzadeh • Valued Contributor

04-27-2024 12:00:43 AM

3344 Views
3 replies
0 kudos

Financial Crime detection with the help of Apache Spark, Data Mesh and Data Lake

For those interested in Data Mesh and Data Lakes for FinCrime detection:Data mesh is a relatively new architectural concept for data management that emphasizes domain-driven data ownership and self-service data availability. It promotes the decentral...

Community Articles

data lakes

Data Mesh

financial crime

spark

Reply

3344 Views
3 replies
0 kudos

04-27-2024 12:00:43 AM

View Replies

Latest Reply

carrolbeau
New Contributor II

05-07-2025 3:00:58 AM

0 kudos

It's great that you're focusing on financial crime detection with advanced technologies like Apache Spark, Data Mesh, and Data Lake. For those looking to dive deeper into criminal records and related data, tools like KY criminal lookup can provide es...

0 kudos

05-07-2025 3:00:58 AM

2 More Replies

by ThomazRossito • Contributor

04-14-2024 4:31:33 PM

3490 Views
1 replies
1 kudos

Post: Lakehouse Federation - Databricks

Lakehouse Federation - Databricks In the world of data, innovation is constant. And the most recent revolution comes with Lakehouse Federation, a fusion between data lakes and data warehouses, taking data manipulation to a new level. This advancement...

Community Articles

data engineer

Lakehouse

SQL Analytics

Reply

3490 Views
1 replies
1 kudos

04-14-2024 4:31:33 PM

View Replies

Latest Reply

Freshman
New Contributor III

05-05-2025 8:08:56 PM

1 kudos

Hey Quick Question, Can we use it for the production version ? We have application server as SQL server, we are planning to use lakehouse federation so we can bypass creating and maintaining 100 of workflows. as we a small dataset I am not too sure o...

1 kudos

05-05-2025 8:08:56 PM

by Shahram • New Contributor II

05-02-2025 6:13:34 AM

764 Views
0 replies
1 kudos

Hub Star Modeling 2.0 for Medalion Architecture

Excited to share my latest publication on arXiv!“Hub Star Modeling 2.0 for Medallion Architecture” https://arxiv.org/abs/2504.08788This new version builds on the original Hub Star Modeling approach, published last year, and now tailored for the Meda...

Community Articles

Reply

764 Views
0 replies
1 kudos

05-02-2025 6:13:34 AM

by genevive_mdonça • Databricks Employee

04-22-2025 8:32:20 AM

2209 Views
1 replies
6 kudos

Handling Complex Nested JSON in Databricks Using schemaHints

When I first got into managing schemas in Databricks, it took me a while to realize that putting in a little planning up front could save me a ton of headaches later on.I was working with these deeply nested, constantly changing JSON files. At first,...

Community Articles

Reply

2209 Views
1 replies
6 kudos

04-22-2025 8:32:20 AM

View Replies

Latest Reply

Advika
Databricks Employee

04-25-2025 6:08:56 AM

6 kudos

Great tip @genevive_mdonça! schemaHints help avoid issues with evolving JSON data, making data processing more reliable and easier to maintain. Thanks for sharing.

6 kudos

04-25-2025 6:08:56 AM

by techgeorge • New Contributor III

04-15-2025 2:14:24 PM

1540 Views
1 replies
0 kudos

Understanding Coalesce, Skewed Joins, and Why AQE Doesn't Always Intervene

In Spark, data skew can be the silent killer of performance. One wide partition pulling in 90% of the data?But even with AQE (Adaptive Query Execution) turned on in Databricks, skewness isn't always automatically identified— and here’s why.What Is co...

Community Articles

Reply

1540 Views
1 replies
0 kudos

04-15-2025 2:14:24 PM

View Replies

Latest Reply

Louis_Frolio
Databricks Employee

04-17-2025 4:18:40 PM

0 kudos

@mark_ott , this question seems right up your alley. Care to comment?

0 kudos

04-17-2025 4:18:40 PM

by Yuki • Contributor

04-13-2025 7:58:13 AM

1773 Views
0 replies
1 kudos

One of the solution of [FAILED_READ_FILE.NO_HINT] Error while reading file, when display() or SELECT

One of the solution of [FAILED_READ_FILE.NO_HINT] Error while reading file, when display() or SELECTI got stuck with the above error when using `spark.read.table().display()` or directly query the table using %sql.While the display method is just one...

Community Articles

Reply

1773 Views
0 replies
1 kudos

04-13-2025 7:58:13 AM

by marvin-alpura • New Contributor II

04-10-2025 2:12:52 PM

1688 Views
0 replies
1 kudos

Power BI to Databricks Semantic Layer Generator (DAX → SQL/PySpark)

Hi everyone!I’ve just released an open-source tool that generates a semantic layer in Databricks notebooks from a Power BI dataset using the Power BI REST API. Im not an expert yet, but it gets job done and instead of using AtScale/dbt/or the PBI Sem...

Community Articles

Reply

1688 Views
0 replies
1 kudos

04-10-2025 2:12:52 PM

by techgeorge • New Contributor III

04-04-2025 1:44:08 AM

768 Views
0 replies
0 kudos

How to train a Convolutional Neural Network on Databricks with Tensorflow and Keras

Here is how to trained a lightweight Convolutional Neuronal Network (CNN) to detect pneumonia from chest X-rays pictures on Azure Databricks. I promise no LLMs, no hype, just real-world deep learning:1. Built it with TensorFlow & Keras on Databricks2...

Community Articles

Reply

768 Views
0 replies
0 kudos

04-04-2025 1:44:08 AM

by shubham_meshram • New Contributor II

03-31-2025 3:18:21 PM

1540 Views
0 replies
0 kudos

When Did the Data Go Wrong? Using Delta Lake Time Travel for Investigation in Databricks

I. IntroductionData pipelines are the lifeblood of modern data-driven organizations. However, even the most robust pipelines can experience unexpected issues: data corruption, erroneous updates, or sudden data drops. When these problems occur, quickl...

Community Articles

Reply

1540 Views
0 replies
0 kudos

03-31-2025 3:18:21 PM

by Brahmareddy • Esteemed Contributor

03-25-2025 5:33:16 PM

2100 Views
0 replies
1 kudos

Real Lessons in Databricks Schema, Streaming, and Unity Catalog

Hey Databricks community,I wanted to take a moment to share some things I’ve learned while working with Databricks in real projects—especially around schema management, Unity Catalog, Autoloader, and streaming jobs. These are the kinds of small detai...

Community Articles

Reply

2100 Views
0 replies
1 kudos

03-25-2025 5:33:16 PM

by pradeepvatsvk • New Contributor III

03-21-2025 1:58:03 AM

1044 Views
0 replies
1 kudos

Inclusion of special characters while saving or downloading as a csv

Hi All, I have data which looks like this High Corona40% 50cl Pm £13.29 but when saving it as a csv it is getting converted into High Corona40% 50cl Pm Â£13.29 . wherever we have the euro sign . I thing to note here is while displaying the data i...

Community Articles

Reply

1044 Views
0 replies
1 kudos

03-21-2025 1:58:03 AM

Databricks Community

Forum Posts

Request for a guest post

Automatic Liquid Clustering and PO

Databricks Data Classification

Strong Databricks Fundamental - Gen Z

Financial Crime detection with the help of Apache Spark, Data Mesh and Data Lake

Post: Lakehouse Federation - Databricks

Hub Star Modeling 2.0 for Medalion Architecture

Handling Complex Nested JSON in Databricks Using schemaHints

Understanding Coalesce, Skewed Joins, and Why AQE Doesn't Always Intervene

One of the solution of [FAILED_READ_FILE.NO_HINT] Error while reading file, when display() or SELECT

Power BI to Databricks Semantic Layer Generator (DAX → SQL/PySpark)

How to train a Convolutional Neural Network on Databricks with Tensorflow and Keras

When Did the Data Go Wrong? Using Delta Lake Time Travel for Investigation in Databricks

Real Lessons in Databricks Schema, Streaming, and Unity Catalog

Inclusion of special characters while saving or downloading as a csv

Join Us as a Local Community Builder!

Azure Databricks Streamlit Application - Doubts

SQL Warehouse and Unity Catalog

Building an End-to-End ETL Pipeline with Data from...

My First Month Learning Databricks - Key Takeaways...

Unity Catalog Migration Strategy