cancel
Showing results for 
Search instead for 
Did you mean: 
Knowledge Sharing Hub
Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practices. Join the conversation today and unlock a wealth of collective wisdom to enhance your experience and drive success.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

SumitSingh
by Contributor
  • 1824 Views
  • 5 replies
  • 7 kudos

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

In today’s data-driven world, the role of a data engineer is critical in designing and maintaining the infrastructure that allows for the efficient collection, storage, and analysis of large volumes of data. Databricks certifications holds significan...

SumitSingh_0-1721402402230.png SumitSingh_1-1721402448677.png SumitSingh_2-1721402469214.png
  • 1824 Views
  • 5 replies
  • 7 kudos
Latest Reply
SumitSingh
Contributor
  • 7 kudos

@ms_ccg You are correct. I got that error too. Seems like Databricks has removed some of these. I would suggest you to search for those separately via Databricks Academy or external resources. Let me know if you need any help.

  • 7 kudos
4 More Replies
Sourav-Kundu
by Contributor
  • 356 Views
  • 1 replies
  • 2 kudos

Use Retrieval-augmented generation (RAG) to boost performance of LLM applications

Retrieval-augmented generation (RAG) is a method that boosts the performance of large language model (LLM) applications by utilizing tailored data.It achieves this by fetching pertinent data or documents related to a specific query or task and presen...

  • 356 Views
  • 1 replies
  • 2 kudos
Latest Reply
Advika
Databricks Employee
  • 2 kudos

Thanks for sharing such valuable insight, @Sourav-Kundu . Your breakdown of how RAG enhances LLMs is spot on- clear and concise!

  • 2 kudos
Sourav-Kundu
by Contributor
  • 794 Views
  • 1 replies
  • 2 kudos

You can use Low Shuffle Merge to optimize the Merge process in Delta lake

Low Shuffle Merge in Databricks is a feature that optimizes the way data is merged when using Delta Lake, reducing the amount of data shuffled between nodes.- Traditional merges can involve heavy data shuffling, as data is redistributed across the cl...

  • 794 Views
  • 1 replies
  • 2 kudos
Latest Reply
Advika
Databricks Employee
  • 2 kudos

Great post, @Sourav-Kundu. The benefits you've outlined, especially regarding faster execution and cost efficiency, are valuable for anyone working with large-scale data processing. Thanks for sharing!

  • 2 kudos
Sourav-Kundu
by Contributor
  • 421 Views
  • 0 replies
  • 1 kudos

Databricks Asset Bundles package and deploy resources like notebooks and workflows as a single unit.

Databricks Asset Bundles help implement software engineering best practices like version control, testing and CI/CD for data and AI projects.1. They allow you to define resources such as jobs and notebooks as source files, making project structure, t...

  • 421 Views
  • 0 replies
  • 1 kudos
Sourav-Kundu
by Contributor
  • 758 Views
  • 0 replies
  • 1 kudos

How to recover Dropped Tables in Databricks

Have you ever accidentally dropped a table in Databricks, or had someone else mistakenly drop it?Databricks offers a useful feature that allows you to view dropped tables and recover them if needed.1. You need to first execute SHOW TABLES DROPPED2. T...

  • 758 Views
  • 0 replies
  • 1 kudos
SumitSingh
by Contributor
  • 1824 Views
  • 5 replies
  • 7 kudos

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

In today’s data-driven world, the role of a data engineer is critical in designing and maintaining the infrastructure that allows for the efficient collection, storage, and analysis of large volumes of data. Databricks certifications holds significan...

SumitSingh_0-1721402402230.png SumitSingh_1-1721402448677.png SumitSingh_2-1721402469214.png
  • 1824 Views
  • 5 replies
  • 7 kudos
Latest Reply
SumitSingh
Contributor
  • 7 kudos

@ms_ccg You are correct. I got that error too. Seems like Databricks has removed some of these. I would suggest you to search for those separately via Databricks Academy or external resources. Let me know if you need any help.

  • 7 kudos
4 More Replies
SashankKotta
by Databricks Employee
  • 2387 Views
  • 6 replies
  • 4 kudos

Library Management via Custom Compute Policies and ADF Job Triggering

This guide is intended for those looking to install libraries on a cluster using a Custom Compute Policy and trigger Databricks jobs from an Azure Data Factory (ADF) linked service. While many users rely on init scripts for library installation, it i...

Screenshot 2024-06-16 at 12.34.09 PM.png Screenshot 2024-06-16 at 12.38.33 PM.png
  • 2387 Views
  • 6 replies
  • 4 kudos
Latest Reply
Wojciech_BUK
Valued Contributor III
  • 4 kudos

Hi @hassan2 I had same issue and found solution.When I created POOL i created it as On-demand (not spot) and then policy only worked when I removed  entire section "azure_attributes.spot_bid_max_price" from policy.Looks like "azure_attributes.spot_bi...

  • 4 kudos
5 More Replies
nafikazi
by New Contributor III
  • 1104 Views
  • 4 replies
  • 2 kudos

Resolved! Want to learn LakeFlow Pipelines in community edition.

Hello Everyone. I want to explore LakeFlow Pipelines in the community version but don’t have access to Azure or AWS. I had a bad experience with Azure, where I was charged $85 while just trying to learn. Is there a less expensive, step-by-step learni...

  • 1104 Views
  • 4 replies
  • 2 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 2 kudos

Hi @nafikazi ,Sorry, this is not possible in community edition. Your only option is to have AWS or Azure account. 

  • 2 kudos
3 More Replies
Ajay-Pandey
by Esteemed Contributor III
  • 3139 Views
  • 6 replies
  • 7 kudos

🚀 Databricks Custom Apps! 🚀

Whether you're a data scientist or a sales executive, Databricks is making it easier than ever to build, host, and share secure data applications. With our platform, you can now run any Python code on serverless compute, share it with non-technical c...

  • 3139 Views
  • 6 replies
  • 7 kudos
Latest Reply
PiotrU
Contributor II
  • 7 kudos

Can we somehow play with hosting, and expose this app outside?

  • 7 kudos
5 More Replies
Thusharr
by New Contributor II
  • 1024 Views
  • 5 replies
  • 1 kudos

Writing append blob files to unity catalog volum

The workspace is assigned to unity catalog, and all the access to the ADLS Gen2 is now handled via unity catalog only, means no SPN, no connection string, access keys etc. I have to create append blob files in a volume, Is this is possible in a works...

  • 1024 Views
  • 5 replies
  • 1 kudos
Latest Reply
Witold
Honored Contributor
  • 1 kudos

Now I got your point. No, you can't create Append Blob files directly in Volumes, as this is a native Azure functionality. A volume is basically just an abstraction over a native storage.You will still need to use libraries like azure-storage-blob wi...

  • 1 kudos
4 More Replies
Thusharr
by New Contributor II
  • 1574 Views
  • 1 replies
  • 0 kudos

Unity cataog

Disaster recovery is possible in Unity catalog now?Means, for data level, we have enabled with geo redundancy, what about the objects, permissions, an other components in Unity catalog ? Can we restore the unity catalog metadata in another region ?

  • 1574 Views
  • 1 replies
  • 0 kudos
Latest Reply
gchandra
Databricks Employee
  • 0 kudos

Official product release in development will be available as PrPr in a few months. 

  • 0 kudos
arjungoel1995
by New Contributor
  • 1230 Views
  • 2 replies
  • 0 kudos

Resolved! Standardized Framework to update Databricks job definition using CI/CD

Hi Databricks support, I am looking for a standardized Databricks framework to update job definition using DevOps from non-production till it get productionized. Our current process of updating the Databricks job definition is as follows:In our sourc...

  • 1230 Views
  • 2 replies
  • 0 kudos
Latest Reply
nicole_lu_PM
Databricks Employee
  • 0 kudos

Hi from the Git folders/Repos PM: DAB is the way to go, and we are working on an integration to author DABs directly in the workspace.  Here's a DAIS talk where the DAB PM and I demo'ed some recommendations for source controlling jobs: https://www.da...

  • 0 kudos
1 More Replies
ThierryBa
by New Contributor III
  • 2582 Views
  • 3 replies
  • 4 kudos

Build & Refresh a Calendar Dates Table

IntroductionMaintaining accurate and up-to-date calendar date tables is crucial for reliable reporting, yet manual updates can be time-consuming and prone to error. This fundamental component serves as the backbone for date-based analysis, enabling a...

ThierryBa_0-1726544617449.png ThierryBa_1-1726544617449.png ThierryBa_2-1726544617450.png
  • 2582 Views
  • 3 replies
  • 4 kudos
Latest Reply
may-tun
New Contributor II
  • 4 kudos

Nice article, very informative! 

  • 4 kudos
2 More Replies
MichTalebzadeh
by Valued Contributor
  • 1949 Views
  • 2 replies
  • 0 kudos

Resolved! Feature Engineering for Data Engineers: Building Blocks for ML Success

For a  UK Government Agency, I made a Comprehensive presentation titled " Feature Engineering for Data Engineers: Building Blocks for ML Success".  I made an article of it in Linkedlin together with the relevant GitHub code. In summary the code delve...

Knowledge Sharing Hub
feature engineering
ML
python
  • 1949 Views
  • 2 replies
  • 0 kudos
Latest Reply
Anushree_Tatode
Contributor III
  • 0 kudos

Hi,Excellent presentation and article! Your insights on feature engineering and practical code examples are incredibly useful for building strong ML models. Thanks for sharing! Thanks,Anushree    

  • 0 kudos
1 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group