Databricks Community

BR_DatabricksAI · ‎01-18-2024

Hello Team,

I am unable to distribute the workload to databricks different worker while using the hugging face GPT2 LLM model. Jobs always use the 1 node even though we have the min and max worker node setting with 2.

Appreciate if anyone can share any insight on this.

Thanks.

BR

Debayan · ‎01-18-2024

Hi, You can check https://docs.databricks.com/en/compute/cluster-config-best-practices.html for reference for cluster configurations best practises. About distributing workloads , could you please elaborate how is this expected? In a general scenario one run is designed to use a whole cluster until it's healthy.

BR_DatabricksAI · ‎01-21-2024

Hello,

We have around between 5k to 10K transcript files available in the ADLS gen 2 and we are using hugging face gpt2 model to train and server the model and expecting to pass the workload to different cluster nodes while serving the LLM model and my model takes around 20-30 seconds to process the model and generate the output.

Would like to run the load in the batch mode so that I can reduce the process time and process the files concurrently while selecting the T4 GPU and extending the min (2 Nodes) to max (8 Nodes) workers.

BR

Debayan · ‎01-21-2024

Hi, It would be ideal to reach your Databricks Account Executive, so that we can consider the situation with your specific architecture and provide you the best solution in this scenario.

Databricks Community

Unable to distribute the workload to different worker

Join Us as a Local Community Builder!

Solution Accelerator Series | #5 - Automating Product Review Summarization with LLMs

The next BrickTalks about the latest and greatest in AI/BI is scheduled for Oct 28!

🚀 Weekly Delta (8 - 14 October): A Look Back at This Week’s Top Community Highlights

BrickCon 2025 — Dec 3–5 | A Community Conference for Databricks Builders

🌟 Community Sparks of the Week | September 26 – October 2 🌟