cancel
Showing results for 
Search instead for 
Did you mean: 
Knowledge Sharing Hub
Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practices. Join the conversation today and unlock a wealth of collective wisdom to enhance your experience and drive success.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

avrm91
by New Contributor III
  • 916 Views
  • 0 replies
  • 0 kudos

GCP - Initial External Location to GCP Bucket is wrong

When creating a new Workspace in GCP the default GCP External Location is wrong.Its easily fixed by Catalog (on the left) > External Data (on the bottom) > External Locations > choose the connection and edit the URL by deleting the second BucketId af...

avrm91_1-1711447524318.png avrm91_0-1711447511611.png
  • 916 Views
  • 0 replies
  • 0 kudos
Emil_Kaminski
by Contributor
  • 2856 Views
  • 3 replies
  • 2 kudos

Materials to pass Databricks Data Engineering Associate Exam

Hi Guys, I have passed it already some time ago, but just recently have summarized all the materials which helped me to do it. Pay special attention to GitHub repository, which contains many great exercises prepared by Databricks teamhttps://youtu.be...

  • 2856 Views
  • 3 replies
  • 2 kudos
Latest Reply
PurtiSharma
New Contributor II
  • 2 kudos

Thanks for sharing. It is indeed very useful.

  • 2 kudos
2 More Replies
MichTalebzadeh
by Contributor III
  • 945 Views
  • 0 replies
  • 0 kudos

Feature article: Leveraging Generative AI with Apache Spark: Transforming Data Engineering

I created this article in Linkedlin to allow both this community and Apache Spark user community to have access to it.It is particularly useful for data engineers who want to have a basic understanding of what  Generative AI with Spark can do.Leverag...

Knowledge Sharing Hub
Generative AI
spark
  • 945 Views
  • 0 replies
  • 0 kudos
Hubert-Dudek
by Esteemed Contributor III
  • 2465 Views
  • 1 replies
  • 3 kudos

DBR 15.0 beta

databricks runtime 15 is out there!Some breaking changes. More info here https://docs.databricks.com/en/release-notes/runtime/15.0.html

15.png
  • 2465 Views
  • 1 replies
  • 3 kudos
Latest Reply
jose_gonzalez
Moderator
  • 3 kudos

Thanks for sharing this information @Hubert-Dudek!!!

  • 3 kudos
Hubert-Dudek
by Esteemed Contributor III
  • 2087 Views
  • 1 replies
  • 1 kudos

Notebook IDE

This is an excellent step for #databricks notebooks. Integrated debugger and CLI in notebook terminal is a big step towards a fully functional cloud IDE.

ide.png
  • 2087 Views
  • 1 replies
  • 1 kudos
Latest Reply
jose_gonzalez
Moderator
  • 1 kudos

Thank you for sharing this @Hubert-Dudek!!!

  • 1 kudos
MichTalebzadeh
by Contributor III
  • 2098 Views
  • 2 replies
  • 0 kudos

Build a machine learning model to detect fraudulent transactions using PySpark's MLlib library

IntroductionFinancial fraud is a significant concern for businesses and consumers alike. I have written about this concern a few times in Linkedlin articles. Machine learning offers powerful tools to combat this issue by automatically identifying sus...

Knowledge Sharing Hub
Financial Fraud
PySpark MLlib
spark
  • 2098 Views
  • 2 replies
  • 0 kudos
Latest Reply
deborah621
New Contributor II
  • 0 kudos

Looking to build a machine learning model for detecting fraudulent transactions using PySpark’s MLlib. Generate synthetic transaction data. Provides a dataset for model training without using sensitive real-world data. Enables the creation of diverse...

  • 0 kudos
1 More Replies
alexgv12
by New Contributor III
  • 958 Views
  • 1 replies
  • 2 kudos

is it possible to have a class level separation in databricks or implement a design pattern in datab

if you have thought about making your code inside databricks and notebooks more reusable and organized and you have thought about implementing a design pattern or class level separation in databricks the answer is yes, I am going to tell you the deta...

  • 958 Views
  • 1 replies
  • 2 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 2 kudos

tnx! I have spent quite some time on figuring out what the best way is. Your approach is certainly a valid one.Myself I prefer to package reused classes in a jar (we mainly code in scala). Works fine too.

  • 2 kudos
MichTalebzadeh
by Contributor III
  • 3257 Views
  • 3 replies
  • 2 kudos

Resolved! Building Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration

 I recently saw an article from Databricks titled "Scalable Spark Structured Streaming for REST API Destinations". A great article focusing on continuous Spark Structured Streaming (SSS). About a year old. I then decided, given customer demands to wo...

Knowledge Sharing Hub
Event-driven architecture
Flask
spark
Spark Structure Streaming
Spark Structured Streaming
  • 3257 Views
  • 3 replies
  • 2 kudos
Latest Reply
Kaniz_Fatma
Community Manager
  • 2 kudos

Hi @MichTalebzadeh, Thank you for sharing your insightful article, "Building an Event-Driven Real-Time Data Processor with Spark Structured Streaming and API Integration." Your approach to integrating Apache Sparkâ„¢, REST API, and Flask to create a ro...

  • 2 kudos
2 More Replies
Hubert-Dudek
by Esteemed Contributor III
  • 946 Views
  • 0 replies
  • 0 kudos

stored procedures

The plan for stored procedures in databricks spark has been announced in a few places. How can stored procedures look in Spark SQL?

stored.png
  • 946 Views
  • 0 replies
  • 0 kudos
hanlinsun
by New Contributor
  • 333 Views
  • 0 replies
  • 0 kudos

Redesigned Move File & Clone File Experiences

Hi everyone! We are redesigning the Move File and Clone File experiences. We want to make it as seamless as possible to organize your files, and would love your feedback on the designs!   Move File: Move Option 1   Move Option 2:     Clone File: Cl...

hanlinsun_0-1707354147837.png hanlinsun_1-1707354172264.png hanlinsun_2-1707354211904.png hanlinsun_3-1707354282734.png
  • 333 Views
  • 0 replies
  • 0 kudos
Hubert-Dudek
by Esteemed Contributor III
  • 1081 Views
  • 1 replies
  • 2 kudos

liquid partitioning

Based on my experience with data partitioning, it often diminishes performance rather than enhancing it. There are exceptions, like when handling tables over 1TB, or when EVERY single query utilizes partition in the WHERE clause - for instance, a Pow...

ezgif-1-900d8d6a4e.gif
Knowledge Sharing Hub
optimize
Partitions
  • 1081 Views
  • 1 replies
  • 2 kudos
Latest Reply
jose_gonzalez
Moderator
  • 2 kudos

Thank you for sharing this @Hubert-Dudek !!

  • 2 kudos
Taha_Hussain
by Valued Contributor II
  • 2091 Views
  • 0 replies
  • 2 kudos

Your updated resource guide to the Databricks Data Intelligence Platform

Want to increase your Databricks knowledge? Look no further!Here’s a guide filled with key resources you’ll need while working on the Databricks Data Inteligence Platform. Bookmark these pages for future reference, or apply these learnings during the...

Screenshot 2023-11-22 at 2.06.35 PM.png
  • 2091 Views
  • 0 replies
  • 2 kudos
Taha_Hussain
by Valued Contributor II
  • 2465 Views
  • 0 replies
  • 0 kudos

✨New in Notebooks: AI-powered Databricks Assistant, improved visualizations, web terminal and more!

We are excited to share some of the latest updates in Databricks Notebooks. From AI-powered Databricks Assistant that automates code development to new charts with better performance, these features help you build faster. See the latest features live...

generatecode.gif explain_code.gif fix_code (3).gif Screenshot 2023-10-20 at 10.11.24 AM.png
  • 2465 Views
  • 0 replies
  • 0 kudos
Sujitha
by Community Manager
  • 4846 Views
  • 1 replies
  • 1 kudos

Simplify complex workflows with modular jobs

Thousands of Databricks customers use Databricks Workflows every day to orchestrate business-critical workloads on the Databricks Lakehouse Platform. A great way to simplify those critical workloads is through modular orchestration. This is now possi...

Sujitha_0-1694069414336.png Sujitha_1-1694069414523.png
Knowledge Sharing Hub
jobs
Modular Orchestration
run jobs
Workflows
  • 4846 Views
  • 1 replies
  • 1 kudos
Latest Reply
UiliamVenerio
New Contributor II
  • 1 kudos

Hello, is the "if/else condition" task type available for testing?

  • 1 kudos
Sujitha
by Community Manager
  • 2757 Views
  • 0 replies
  • 0 kudos

Set up AI-driven optimizations in Databricks SQL

With Predictive I/O for reads (GA) and updates (Public Preview), Databricks SQL can now analyze historical read and write patterns to intelligently build indexes and optimize DELETE, MERGE, and UPDATE operations. What is Predictive I/O? Predictive I/...

Knowledge Sharing Hub
Databricks SQL
DELETE
MERGE
Predictive IO
Update
  • 2757 Views
  • 0 replies
  • 0 kudos