Databricks Community

HariharaSam · 09-20-2022

Hi Everyone ,I am trying to run a databricks notebook in parallel using ThreadPoolExecutor .Can anyone suggest how to reduce the time taken based on the below findings so far.Current Performance:Time taken - 25 minutes ThreadPoolExecutor max_workers ...

HariharaSam · 09-20-2022

How to enable fair scheduler from Databricks notebook using python commands?

HariharaSam · 09-15-2022

Does anyone know how to fix this ..??

HariharaSam · 09-15-2022

How to convert the rows of a spark dataframe to list without using Pandas.Input Spark Dataframe :Expected Output:[['A','B','C'],['1','2','3'],['4','5','6'],['7','8','9']]

HariharaSam · 09-09-2022

I have a scenario where I need to run same databricks notebook multiple times in parallel.What is the best approach to do this ?

HariharaSam · 03-24-2023

Hi @Hubert Dudek ,I have a similar requirement where I am trying to query a table in Databricks by passing a parameter from Power BI report builder. So I have two queries out of which one is working and the other is not working.Can you help in ident...

HariharaSam · 09-20-2022

Hi @Hubert Dudek ,You have mentioned that ThreadPoolExecutor will not help , so if I want to run a same databricks notebooks for 100 different input values and running them in sequence takes more time to complete.So how to achieve this scenario?

HariharaSam · 09-09-2022

Hi @Leszek ,After going through the link that you shared and exploring further I found that it is best suited for I/O operations.But mine is CPU bound operations where lot of computations takes place and one more thing is that I need to run my note...

HariharaSam · 09-07-2022

Hi Hubert ,As you have mentioned that it can not be used for everything , in my case also it doesn't suit as I have a lot variables declaration and having a function created for each variable doesn't look good.

HariharaSam · 01-14-2022

Hi @Hubert Dudek Your approach is working for me.Thank you.

Databricks Community

User Stats

User Activity

Performance Tuning of Databricks Notebook

Enabling Fair Scheduler from Databricks Notebook

DRIVER Garbage Collection

Converting Rows of Spark Dataframe to List

Parallel Processing of Databricks Notebook

Re: How to pass parameters in SSRS/Power BI (report builder) ?

Re: Performance Tuning of Databricks Notebook

Re: Parallel Processing of Databricks Notebook

Re: Using variables in Spark SQL

Re: To get Number of rows inserted after performing an Insert operation into a table