Get Started Discussions

by chris0991 • New Contributor III

09-30-2024 11:46:50 PM

2016 Views
2 replies
1 kudos

Best practices for optimizing Spark jobs

What are some best practices for optimizing Spark jobs in Databricks, especially when dealing large datasets? Any tips or resources would be greatly appreciated! I’m trying to analyze data on restaurant menu prices so that insights would be especiall...

Get Started Discussions

Reply

2016 Views
2 replies
1 kudos

09-30-2024 11:46:50 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-01-2024 3:12:07 AM

1 kudos

There are so many.Here are a few:- look for data skew- shuffle as less as possible- avoid many small files- use spark and not only pure python- if using an autoscale cluster: check if you don't lose a lot of time scaling up/down

1 kudos

10-01-2024 3:12:07 AM

1 More Replies

by stucas • New Contributor II

07-14-2025 7:13:50 AM

734 Views
1 replies
0 kudos

Logging: Unable to read a /volume based file

Hi We've just started using databricks and so am a little naive into the file system, especially regarding unity catalog.The issue is that we're creating a loggeer and wanting to write the files based on a queue handler/listener pattern. The patternn...

Get Started Discussions

Reply

734 Views
1 replies
0 kudos

07-14-2025 7:13:50 AM

View Replies

Latest Reply

FedeRaimondi
Contributor II

07-15-2025 12:02:15 AM

0 kudos

When using the CLI you need to add the scheme:dbfs:/Volumes/...The rest should be fine to refer with "/Volumes/...", for more info Manage files in volumes | Databricks Documentation.Hope this solves the issue!

0 kudos

07-15-2025 12:02:15 AM

by esistfred • New Contributor III

07-14-2025 1:48:22 AM

2836 Views
3 replies
6 kudos

Resolved! How to use variable-overrides.json for environment-specific configuration in Asset Bundles?

Hi all,Could someone clarify the intended usage of the variable-overrides.json file in Databricks Asset Bundles?Let me give some context. Let's say my repository layout looks like this:databricks/ ├── notebooks/ │ └── notebook.ipynb ├── resources/ ...

Get Started Discussions

Reply

2836 Views
3 replies
6 kudos

07-14-2025 1:48:22 AM

View Replies

Latest Reply

esistfred
New Contributor III

07-14-2025 6:54:21 AM

6 kudos

It does. Thanks for the reponse. I also continued playing around with it and found a way using the variable-overrides.json file. I'll leave it here just in case anyone is interested:Repository layout:databricks/ ├── notebooks/ │ └── notebook.ipynb ...

6 kudos

07-14-2025 6:54:21 AM

2 More Replies

by Phani1 • Valued Contributor II

07-11-2025 10:30:18 AM

1109 Views
1 replies
0 kudos

Resolved! Workspace Consolidation Strategy in Databricks

Hi Team,The customer is facing a challenge related to increasing Databricks workspace maintenance costs. Apparently, every project is creating its own workspace for specific functionalities, and this has become a standard practice. As a result, the n...

Get Started Discussions

Reply

1109 Views
1 replies
0 kudos

07-11-2025 10:30:18 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

07-14-2025 1:45:39 AM

0 kudos

This is something that you should discuss with your Databricks rep imo. Even with standard tools, migrating consolidating 200 workspaces is something that needs very careful planning and testing.

0 kudos

07-14-2025 1:45:39 AM

by davidwilliam006 • New Contributor

07-13-2025 11:09:10 PM

663 Views
0 replies
0 kudos

Introduction Dario Schiraldi Deutsche Bank Executive

Dario Schiraldi Deutsche Bank Executive, known for his strong leadership in the financial and banking sector. Dario Schiraldi brings 20 years of leadership experience to major worldwide organizations where his expertise extends into both market acqui...

Get Started Discussions

Reply

663 Views
0 replies
0 kudos

07-13-2025 11:09:10 PM

by sastopy • New Contributor II

07-12-2025 10:50:54 PM

642 Views
0 replies
0 kudos

SAS TO DATABRICKS MIGRATION

SAS to PY is an AI/ML-based Accelerator designed for "SAS to Python or PySpark" code migration. This Accelerator is engineered to convert SAS legacy proprietary codes to the more flexible, open-source Python or PySpark environment with 95% automatica...

Get Started Discussions

Reply

642 Views
0 replies
0 kudos

07-12-2025 10:50:54 PM

by darioschiraldi9 • New Contributor II

07-07-2025 3:40:32 AM

612 Views
1 replies
0 kudos

Dario Schiraldi : How do I integrate Databricks with AWS?

Hi everyone,I am Dario Schiraldi, CEO of Travel Works, and I am reaching out to the community for some insights. We are in the process of integrating Databricks with AWS for a new project, and I have love to hear from anyone who has experience with t...

Get Started Discussions

Reply

612 Views
1 replies
0 kudos

07-07-2025 3:40:32 AM

View Replies

Latest Reply

Khaja_Zaffer
Contributor III

07-11-2025 2:06:11 PM

0 kudos

Hello Dario Good to meet you. You can connect with your account manager of databricks. Also Azure provides first partner assistance to databricks. you can check Azure services as well. Thank you.

0 kudos

07-11-2025 2:06:11 PM

by Alexandru • New Contributor III

04-12-2024 4:07:52 AM

4808 Views
4 replies
0 kudos

Resolved! vscode python project for development

Hi,I'm trying to set up a local development environment using python / vscode / poetry. Also, linting is enabled (Microsoft pylance extension) and the python.analysis.typeCheckingMode is set to strict.We are using python files for our code (.py) whit...

Get Started Discussions

Reply

4808 Views
4 replies
0 kudos

04-12-2024 4:07:52 AM

View Replies

Latest Reply

A_N
New Contributor II

07-11-2025 8:10:14 AM

0 kudos

How did you solve the type error checks on `pyspark.sql ` ? mypy doesn't create the missing stubs for that one?

0 kudos

07-11-2025 8:10:14 AM

3 More Replies

by chandataeng • New Contributor

07-09-2025 1:55:13 AM

1446 Views
1 replies
1 kudos

Resolved! How to trigger Power BI refresh from Databricks pipeline without keeping cluster alive?

I have a Databricks pipeline that pulls data from AWS, which takes ~90 minutes. After this, I need to refresh a series of Power BI dataflows (~45 mins) and then datasets (~45 mins).I want to trigger the Power BI refresh automatically from Databricks ...

Get Started Discussions

Reply

1446 Views
1 replies
1 kudos

07-09-2025 1:55:13 AM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

07-09-2025 2:46:05 AM

1 kudos

Hi @chandataeng ,The current Power BI task that is available in databricks workflow will wait for refresh process to return correct status (whether it succeeded or failed).But you can start refresh process by using asynchronous REST API call. The ref...

1 kudos

07-09-2025 2:46:05 AM

by KIRKQUINBAR • New Contributor III

04-03-2025 10:04:58 AM

2513 Views
2 replies
0 kudos

Resolved! information_schema not populating with columns

We started migrating databases from hive_metastore into unity catalog back in October 2024 and ive noticed that periodically the Catalog UI will not show columns or a data preview for some tables, but not all of them that were migrated. After some di...

Get Started Discussions

Reply

2513 Views
2 replies
0 kudos

04-03-2025 10:04:58 AM

View Replies

Latest Reply

KIRKQUINBAR
New Contributor III

07-07-2025 5:40:57 AM

0 kudos

this is definitely a bug related to older instances of azure databricks that were upgraded to use unity platform. after going back and forth with MS support for 2+ months, we made the decision to just spin up a new instance of azure databricks and co...

0 kudos

07-07-2025 5:40:57 AM

1 More Replies

by alex-syk • New Contributor II

08-11-2023 2:11:34 PM

7625 Views
8 replies
0 kudos

Delta Sharing - Alternative to config.share

I was recently given a credential file to access shared data via delta sharing. I am following the documentation from https://docs.databricks.com/en/data-sharing/read-data-open.html. The documentation wants the contents of the credential file in a fo...

Get Started Discussions

Reply

7625 Views
8 replies
0 kudos

08-11-2023 2:11:34 PM

View Replies

Latest Reply

Debayan
Databricks Employee

08-21-2023 11:48:31 PM

0 kudos

Hi, the most feasible way would be to convert the contents of your key file into base64 and only mention the spark config as below: credentials <base 64 encoded code>

0 kudos

08-21-2023 11:48:31 PM

7 More Replies

by Nietzsche • New Contributor III

06-28-2025 4:25:38 AM

2968 Views
3 replies
2 kudos

Resolved! is Spark UI available on the Databricks Free Edition?

Hi allI have a noob question, I am currently using the Databricks free edition, which runs on serverless compute.To access the Spark UI normally one would click on the attached compute, however, with serverless, I can not find the menu to access Spar...

Get Started Discussions

Reply

2968 Views
3 replies
2 kudos

06-28-2025 4:25:38 AM

View Replies

Latest Reply

dyusuf
New Contributor II

07-06-2025 9:27:32 PM

2 kudos

So, there is no way we can run spark in free edition as we need general purpose clusters?

2 kudos

07-06-2025 9:27:32 PM

2 More Replies

by Ganeshch • New Contributor III

06-28-2025 9:54:27 PM

3204 Views
4 replies
0 kudos

Databricks Features

Hi All, I am new to the Databricks, am using community version. So far, I have noticed some limitations , features like DBFS (File System) are restricted and Cluster Configuration is Locked. So I am thinking to use trial version, it will give 14 da...

Get Started Discussions

Reply

3204 Views
4 replies
0 kudos

06-28-2025 9:54:27 PM

View Replies

Latest Reply

Khaja_Zaffer
Contributor III

07-05-2025 7:55:00 AM

0 kudos

Translator Hello However, the DBFS file browser is often disabled by default in the user interface. It can typically be re-enabled through the admin settings. In the free edition, you would face some limitations with cluster size. However, if you...

0 kudos

07-05-2025 7:55:00 AM

3 More Replies

by SmileyVille • New Contributor III

03-27-2025 1:12:56 PM

4628 Views
7 replies
0 kudos

Capture data from a Specific SharePoint Site (List) in M365 into Azure DataBricks

Hello. We are using Azure Databricks and would like to ingest data from a specific M365 SharePoint Online Site/List. I was originally trying to use this recommendation, https://learn.microsoft.com/en-us/answers/questions/2116616/service-principal-a...

Get Started Discussions

M365

Service Principal

SharePoint Online

Reply

4628 Views
7 replies
0 kudos

03-27-2025 1:12:56 PM

View Replies

Latest Reply

Divya_Bhadauria
New Contributor III

07-05-2025 12:29:22 PM

0 kudos

We achieved the same using the SharePoint API. You can follow the steps outlined in this documentation: https://learn.microsoft.com/en-us/graph/auth-v2-service?tabs=http.Additionally, you can grant the Sites.Selected permission to the Azure AD applic...

0 kudos

07-05-2025 12:29:22 PM

6 More Replies

by ds01 • New Contributor

07-02-2025 10:35:28 PM

1026 Views
2 replies
1 kudos

Dario Schiraldi Deutsche Bank Executive : Excited to Join

I’m Dario Schiraldi Deutsche Bank Executive. During my time there, I led global institutional sales and investment businesses, honing my expertise in strategy, leadership, and financial markets. As someone who’s passionate about the transformative p...

Get Started Discussions

Reply

1026 Views
2 replies
1 kudos

07-02-2025 10:35:28 PM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

07-03-2025 4:27:11 AM

1 kudos

Hi @ds01 ,Welcome, Dario! It’s great to have someone with your deep experience in finance and leadership join the Databricks community. Looking forward to your insights and contributions!

1 kudos

07-03-2025 4:27:11 AM

1 More Replies

Databricks Community

Forum Posts

Best practices for optimizing Spark jobs

Logging: Unable to read a /volume based file

Resolved! How to use variable-overrides.json for environment-specific configuration in Asset Bundles?

Resolved! Workspace Consolidation Strategy in Databricks

Introduction Dario Schiraldi Deutsche Bank Executive

SAS TO DATABRICKS MIGRATION

Dario Schiraldi : How do I integrate Databricks with AWS?

Resolved! vscode python project for development

Resolved! How to trigger Power BI refresh from Databricks pipeline without keeping cluster alive?

Resolved! information_schema not populating with columns

Delta Sharing - Alternative to config.share

Resolved! is Spark UI available on the Databricks Free Edition?

Databricks Features

Capture data from a Specific SharePoint Site (List) in M365 into Azure DataBricks

Dario Schiraldi Deutsche Bank Executive : Excited to Join

Join Us as a Local Community Builder!

Data bricks is not mounting with storage account g...

External MCP representing user data permissions

serialized_dashboard

how to import sample notebook to azure databricks ...

Request to Extend Partner Tech Summit Lab Access