Knowledge Sharing Hub

by SumitSingh • Contributor

07-19-2024 8:25:47 AM

3397 Views
7 replies
9 kudos

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

In today’s data-driven world, the role of a data engineer is critical in designing and maintaining the infrastructure that allows for the efficient collection, storage, and analysis of large volumes of data. Databricks certifications holds significan...

Knowledge Sharing Hub

Reply

3397 Views
7 replies
9 kudos

07-19-2024 8:25:47 AM

View Replies

Latest Reply

sandeepmankikar
New Contributor III

03-12-2025 8:32:21 PM

9 kudos

As an additional tip for those working towards both the Associate and Professional certifications, I recommend avoiding a long gap between the two exams to maintain your momentum. If possible, try to schedule them back-to-back with just a few days in...

9 kudos

03-12-2025 8:32:21 PM

6 More Replies

by MichTalebzadeh • Valued Contributor

03-22-2024 9:45:27 AM

1481 Views
0 replies
0 kudos

Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering

I created this article in Linkedlin to allow both this community and Apache Spark user community to have access to it.It is particularly useful for data engineers who want to have a basic understanding of what Generative AI with Spark can do.Leverag...

Knowledge Sharing Hub

Generative AI

spark

Reply

1481 Views
0 replies
0 kudos

03-22-2024 9:45:27 AM

by Hubert-Dudek • Esteemed Contributor III

03-14-2024 5:23:06 AM

3536 Views
1 replies
3 kudos

DBR 15.0 beta

databricks runtime 15 is out there!Some breaking changes. More info here https://docs.databricks.com/en/release-notes/runtime/15.0.html

Knowledge Sharing Hub

Reply

3536 Views
1 replies
3 kudos

03-14-2024 5:23:06 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

03-20-2024 2:18:34 PM

3 kudos

Thanks for sharing this information @Hubert-Dudek!!!

3 kudos

03-20-2024 2:18:34 PM

by Hubert-Dudek • Esteemed Contributor III

03-16-2024 11:24:41 AM

2826 Views
1 replies
1 kudos

Notebook IDE

This is an excellent step for #databricks notebooks. Integrated debugger and CLI in notebook terminal is a big step towards a fully functional cloud IDE.

Knowledge Sharing Hub

Reply

2826 Views
1 replies
1 kudos

03-16-2024 11:24:41 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

03-20-2024 2:17:35 PM

1 kudos

Thank you for sharing this @Hubert-Dudek!!!

1 kudos

03-20-2024 2:17:35 PM

by MichTalebzadeh • Valued Contributor

03-19-2024 4:40:39 AM

4745 Views
2 replies
0 kudos

Build a machine learning model to detect fraudulent transactions using PySpark's MLlib library

IntroductionFinancial fraud is a significant concern for businesses and consumers alike. I have written about this concern a few times in Linkedlin articles. Machine learning offers powerful tools to combat this issue by automatically identifying sus...

Knowledge Sharing Hub

Financial Fraud

PySpark MLlib

spark

Reply

4745 Views
2 replies
0 kudos

03-19-2024 4:40:39 AM

View Replies

Latest Reply

deborah621
New Contributor II

03-20-2024 2:42:27 AM

0 kudos

Looking to build a machine learning model for detecting fraudulent transactions using PySpark’s MLlib. Generate synthetic transaction data. Provides a dataset for model training without using sensitive real-world data. Enables the creation of diverse...

0 kudos

03-20-2024 2:42:27 AM

1 More Replies

by alexgv12 • New Contributor III

03-13-2024 9:33:41 AM

1548 Views
1 replies
2 kudos

is it possible to have a class level separation in databricks or implement a design pattern in datab

if you have thought about making your code inside databricks and notebooks more reusable and organized and you have thought about implementing a design pattern or class level separation in databricks the answer is yes, I am going to tell you the deta...

Knowledge Sharing Hub

Reply

1548 Views
1 replies
2 kudos

03-13-2024 9:33:41 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

03-20-2024 1:14:43 AM

2 kudos

tnx! I have spent quite some time on figuring out what the best way is. Your approach is certainly a valid one.Myself I prefer to package reused classes in a jar (we mainly code in scala). Works fine too.

2 kudos

03-20-2024 1:14:43 AM

by MichTalebzadeh • Valued Contributor

03-06-2024 1:09:33 PM

4788 Views
1 replies
1 kudos

Building Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration

I recently saw an article from Databricks titled "Scalable Spark Structured Streaming for REST API Destinations". A great article focusing on continuous Spark Structured Streaming (SSS). About a year old. I then decided, given customer demands to wo...

Knowledge Sharing Hub

Event-driven architecture

Flask

spark

Spark Structure Streaming

Spark Structured Streaming

Reply

4788 Views
1 replies
1 kudos

03-06-2024 1:09:33 PM

View Replies

by Hubert-Dudek • Esteemed Contributor III

02-22-2024 9:07:20 AM

1749 Views
0 replies
0 kudos

stored procedures

The plan for stored procedures in databricks spark has been announced in a few places. How can stored procedures look in Spark SQL?

Knowledge Sharing Hub

spark

Reply

1749 Views
0 replies
0 kudos

02-22-2024 9:07:20 AM

by hanlinsun • Databricks Employee

02-07-2024 5:06:10 PM

584 Views
0 replies
0 kudos

Redesigned Move File & Clone File Experiences

Hi everyone! We are redesigning the Move File and Clone File experiences. We want to make it as seamless as possible to organize your files, and would love your feedback on the designs! Move File: Move Option 1 Move Option 2: Clone File: Cl...

Knowledge Sharing Hub

Reply

584 Views
0 replies
0 kudos

02-07-2024 5:06:10 PM

by Hubert-Dudek • Esteemed Contributor III

07-20-2023 6:41:29 AM

1723 Views
1 replies
2 kudos

liquid partitioning

Based on my experience with data partitioning, it often diminishes performance rather than enhancing it. There are exceptions, like when handling tables over 1TB, or when EVERY single query utilizes partition in the WHERE clause - for instance, a Pow...

Knowledge Sharing Hub

optimize

Partitions

Reply

1723 Views
1 replies
2 kudos

07-20-2023 6:41:29 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

01-10-2024 11:59:28 AM

2 kudos

Thank you for sharing this @Hubert-Dudek !!

2 kudos

01-10-2024 11:59:28 AM

by Taha_Hussain • Databricks Employee

11-22-2023 1:58:00 PM

2576 Views
0 replies
2 kudos

Your updated resource guide to the Databricks Data Intelligence Platform

Want to increase your Databricks knowledge? Look no further!Here’s a guide filled with key resources you’ll need while working on the Databricks Data Inteligence Platform. Bookmark these pages for future reference, or apply these learnings during the...

Knowledge Sharing Hub

Reply

2576 Views
0 replies
2 kudos

11-22-2023 1:58:00 PM

by Taha_Hussain • Databricks Employee

10-20-2023 11:22:38 AM

3055 Views
0 replies
0 kudos

✨New in Notebooks: AI-powered Databricks Assistant, improved visualizations, web terminal and more!

We are excited to share some of the latest updates in Databricks Notebooks. From AI-powered Databricks Assistant that automates code development to new charts with better performance, these features help you build faster. See the latest features live...

Knowledge Sharing Hub

Reply

3055 Views
0 replies
0 kudos

10-20-2023 11:22:38 AM

by Sujitha • Databricks Employee

09-06-2023 11:50:47 PM

10545 Views
1 replies
1 kudos

Simplify complex workflows with modular jobs

Thousands of Databricks customers use Databricks Workflows every day to orchestrate business-critical workloads on the Databricks Lakehouse Platform. A great way to simplify those critical workloads is through modular orchestration. This is now possi...

Knowledge Sharing Hub

jobs

Modular Orchestration

run jobs

Workflows

Reply

10545 Views
1 replies
1 kudos

09-06-2023 11:50:47 PM

View Replies

Latest Reply

UiliamVenerio
New Contributor II

10-18-2023 7:31:53 AM

1 kudos

Hello, is the "if/else condition" task type available for testing?

1 kudos

10-18-2023 7:31:53 AM

by Sujitha • Databricks Employee

09-06-2023 11:43:44 PM

8301 Views
0 replies
0 kudos

Set up AI-driven optimizations in Databricks SQL

With Predictive I/O for reads (GA) and updates (Public Preview), Databricks SQL can now analyze historical read and write patterns to intelligently build indexes and optimize DELETE, MERGE, and UPDATE operations. What is Predictive I/O? Predictive I/...

Knowledge Sharing Hub

Databricks SQL

DELETE

MERGE

Predictive IO

Update

Reply

8301 Views
0 replies
0 kudos

09-06-2023 11:43:44 PM

by Sujitha • Databricks Employee

09-06-2023 11:40:21 PM

9866 Views
0 replies
0 kudos

Share views and schemas with your team

Delta Sharing is a great way to securely share data across different Unity Catalog metastores in your own Databricks account. This now includes using Delta Sharing to share views and schemas directly with other Databricks recipients from within the D...

Screenshot 2023-09-14 at 12.10.53 AM.png

Knowledge Sharing Hub

DATA SHARING

DELTA SHARING

SCHEMAS

views

Reply

9866 Views
0 replies
0 kudos

09-06-2023 11:40:21 PM

by Krause • New Contributor

06-29-2023 3:34:59 PM

1520 Views
0 replies
0 kudos

Power BI with Databricks

Connecting Power BI to Databricks is very easy. There's an extension you can use within Power BI that allows you to insert data and create charts based on the databricks data.

Knowledge Sharing Hub

Summit23

Reply

1520 Views
0 replies
0 kudos

06-29-2023 3:34:59 PM

Databricks Community

Forum Posts

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering

DBR 15.0 beta

Notebook IDE

Build a machine learning model to detect fraudulent transactions using PySpark's MLlib library

is it possible to have a class level separation in databricks or implement a design pattern in datab

Building Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration

stored procedures

Redesigned Move File & Clone File Experiences

liquid partitioning

Your updated resource guide to the Databricks Data Intelligence Platform

✨New in Notebooks: AI-powered Databricks Assistant, improved visualizations, web terminal and more!

Simplify complex workflows with modular jobs

Set up AI-driven optimizations in Databricks SQL

Share views and schemas with your team

Power BI with Databricks

Join Us as a Local Community Builder!

Log Custom Transformer with Feature Engineering Cl...

Want to learn LakeFlow Pipelines in community edit...

Standardized Framework to update Databricks job de...

Feature Engineering for Data Engineers: Building B...

Timeout handling with JDBC connection to SQL Wareh...