Community Articles

by Ajay-Pandey • Databricks MVP

06-23-2024 10:32:18 PM

1032 Views
1 replies
2 kudos

Accelerating discovery on Unity Catalog with a revamped Catalog Explore

Discover favorite and recent UC assets in Quick Access. You'll experience a simplified navigation with the gear icon (top left) for compute, storage, credentials, connections, DBFS, and managements features. Delta Sharing, Clean Rooms, and External D...

Community Articles

unitycatalog

Reply

1032 Views
1 replies
2 kudos

06-23-2024 10:32:18 PM

View Replies

Latest Reply

RishabhTiwari07
Databricks Employee

06-26-2024 1:42:24 AM

2 kudos

Thank you for sharing this update on the Unity Catalog! @Ajay-Pandey Appreciate the detailed overview!

2 kudos

06-26-2024 1:42:24 AM

by alysson_souza • Databricks Employee

06-10-2024 4:44:43 PM

7805 Views
1 replies
5 kudos

Configuring DNS resolution for Private Databricks Workspaces (AWS)

Intro For customers on the E2 Platform, Databricks has a feature that allows them to use AWS PrivateLink to provision secure private workspaces by creating VPC endpoints to both the front-end and back-end interfaces of the Databricks infrastructure. ...

AWS Inbound DNS Endpoints for Workspaces - Copy of Page 1.png

Community Articles

Reply

7805 Views
1 replies
5 kudos

06-10-2024 4:44:43 PM

View Replies

Latest Reply

Sujitha
Databricks Employee

06-25-2024 3:10:46 AM

5 kudos

@alysson_souza love it! Thank you for sharing

5 kudos

06-25-2024 3:10:46 AM

by SashankKotta • Databricks Employee

06-14-2024 5:17:05 AM

4036 Views
0 replies
2 kudos

CICD for databricks workflow jobs

This post is to set up Databricks workflow jobs as a CI/CD. Below are the two essential components needed for a complete CI/CD setup of workflow jobs. Databricks Asset Bundles(DABs)AzureDevOps pipeline. Databricks Asset Bundle ( From local terminal )...

Community Articles

Reply

4036 Views
0 replies
2 kudos

06-14-2024 5:17:05 AM

by RamkannanA • New Contributor II

06-12-2024 11:53:32 PM

3333 Views
1 replies
3 kudos

Resolved! RamK - Certification Update

Hi Team,My name is Ram based out of Singapore. I am new to this Community . Recently I have completed my certification in Databricks starting from Data Analyst , Data Engineering and Gen AI. Looking forward to get connected in serving the Data and AI...

Community Articles

Reply

3333 Views
1 replies
3 kudos

06-12-2024 11:53:32 PM

View Replies

Latest Reply

Rishabh-Pandey
Databricks MVP

06-13-2024 2:18:44 AM

3 kudos

Happy to see you here .

3 kudos

06-13-2024 2:18:44 AM

by sudhirgarg • New Contributor II

06-02-2024 12:09:11 AM

2395 Views
0 replies
1 kudos

Free Databricks Professional Data Engineer Practice Tests

Hi All,I came across a very good set of Practice tests on Databricks Professional Data Engineer Certification.For time being It is being given for free by instructor as promotional activity . Enroll if you are planning to go for the certificationhttp...

Community Articles

Reply

2395 Views
0 replies
1 kudos

06-02-2024 12:09:11 AM

by NandiniN • Databricks Employee

06-01-2024 11:04:10 PM

1734 Views
0 replies
1 kudos

How to deal with Slow Jobs?

Definitely configure job timeouts, and configure notifications. This will help you to identify slowness due to various factors. It is crucial to also investigate and fix the issue causing the slowness. The first step is to identify the problem. This ...

Community Articles

Reply

1734 Views
0 replies
1 kudos

06-01-2024 11:04:10 PM

by NandiniN • Databricks Employee

06-01-2024 2:46:31 PM

1653 Views
0 replies
0 kudos

Monitoring a Streaming Job

If you have a streaming job, you need to check the batch metrics to be able to understand the stream progress. However, here are some other suggestions which we can use to monitor a streaming job and be stuck in a "hung" state. Streaming Listeners sp...

Community Articles

Reply

1653 Views
0 replies
0 kudos

06-01-2024 2:46:31 PM

by NandiniN • Databricks Employee

06-01-2024 2:36:13 PM

1775 Views
0 replies
0 kudos

Why configure a job timeout?

If you use Databricks Jobs for your workloads, it is possible you might have run into a situation where you find your jobs to be in "hung" state. Before cancelling the job it is important to collect the thread dump as I described here to be able to f...

Community Articles

Reply

1775 Views
0 replies
0 kudos

06-01-2024 2:36:13 PM

by MichTalebzadeh • Valued Contributor

05-21-2024 9:02:32 AM

2129 Views
1 replies
0 kudos

A handy tool called spark-column-analyser

I just wanted to share a tool I built called spark-column-analyzer. It's a Python package that helps you dig into your Spark DataFrames with ease.Ever spend ages figuring out what's going on in your columns? Like, how many null values are there, or h...

Community Articles

Generative AI

python

spark

Reply

2129 Views
1 replies
0 kudos

05-21-2024 9:02:32 AM

View Replies

Latest Reply

MichTalebzadeh
Valued Contributor

05-21-2024 10:05:32 AM

0 kudos

An example added to README in GitHubDoing analysis for column PostcodeJson formatted output{"Postcode": {"exists": true,"num_rows": 93348,"data_type": "string","null_count": 21921,"null_percentage": 23.48,"distinct_count": 38726,"distinct_percentage"...

0 kudos

05-21-2024 10:05:32 AM

by youssefmrini • Databricks Employee

05-21-2024 7:15:58 AM

1500 Views
0 replies
2 kudos

Schema evolution clause added to SQL merge syntax

You can now add the WITH SCHEMA EVOLUTION clause to a SQL merge statement to enable schema evolution for the operation. For more information: https://docs.databricks.com/en/delta/update-schema.html#sql-evo #Databricks

Community Articles

Reply

1500 Views
0 replies
2 kudos

05-21-2024 7:15:58 AM

by Hubert-Dudek • Esteemed Contributor III

05-21-2024 7:14:54 AM

1427 Views
0 replies
2 kudos

VariantType + Parse_json()

In Spark 4.0, there are no more data type mismatches when converting dynamic JSONs, as the new data type VariantType comes with a new function to parse JSONs. Stay tuned for 4.0 release.

Community Articles

Reply

1427 Views
0 replies
2 kudos

05-21-2024 7:14:54 AM

by youssefmrini • Databricks Employee

05-21-2024 6:06:03 AM

2146 Views
0 replies
1 kudos

Type widening is in Public Preview

You can now enable type widening on tables backed by Delta Lake. Tables with type widening enabled allow changing the type of columns to a wider data type without rewriting underlying data files. For more information:https://docs.databricks.co...

Community Articles

Reply

2146 Views
0 replies
1 kudos

05-21-2024 6:06:03 AM

by Yassine_bens • New Contributor

05-08-2024 3:24:48 AM

1751 Views
1 replies
0 kudos

How to convert txt files to delta tables

Hello members of Databricks's comunity,I am currently working on a project where we collect data from machines, that data is in .txt format. The data is currently in an Azure container, I need to clean the files and convert them to delta tables, how ...

Community Articles

Reply

1751 Views
1 replies
0 kudos

05-08-2024 3:24:48 AM

View Replies

Latest Reply

feiyun0112
Honored Contributor

05-08-2024 6:37:09 PM

0 kudos

https://docs.databricks.com/en/ingestion/add-data/upload-data.html

0 kudos

05-08-2024 6:37:09 PM

by Hubert-Dudek • Esteemed Contributor III

05-08-2024 8:06:23 AM

855 Views
0 replies
0 kudos

RocksDB for storing state stream

Now, you can keep the state of stateful streaming in RocksDB. For example, retrieving keys from memory to check for duplicate records inside the watermark is now faster. #databricks

Community Articles

Reply

855 Views
0 replies
0 kudos

05-08-2024 8:06:23 AM

by Hubert-Dudek • Esteemed Contributor III

05-02-2024 4:06:44 AM

916 Views
0 replies
1 kudos

State of stateful streaming

For stateful streaming in #databricks, you can now easily read what is in the state.

Community Articles

Reply

916 Views
0 replies
1 kudos

05-02-2024 4:06:44 AM

Databricks Community

Forum Posts

Accelerating discovery on Unity Catalog with a revamped Catalog Explore

Configuring DNS resolution for Private Databricks Workspaces (AWS)

CICD for databricks workflow jobs

Resolved! RamK - Certification Update

Free Databricks Professional Data Engineer Practice Tests

How to deal with Slow Jobs?

Monitoring a Streaming Job

Why configure a job timeout?

A handy tool called spark-column-analyser

Schema evolution clause added to SQL merge syntax

VariantType + Parse_json()

Type widening is in Public Preview

How to convert txt files to delta tables

RocksDB for storing state stream

State of stateful streaming

Join Us as a Local Community Builder!

My First Month Learning Databricks - Key Takeaways...

Unity Catalog Migration Strategy

🚀 Boost Databricks Performance ✅ Lazy Evaluation ...

🚀 DataFrame Caching on Delta Tables - What if und...

Data Quality with PySpark and Great Expectations o...