cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Pricing Spot Instance vs New Job Cluster

MattM
New Contributor III

We are running multiple Databricks job via ADF. I was wondering which option out of the below is a cheaper route for databricks notebook processing from ADF. When I create a ADF linked service, which should I use to lower my cost.

  1. New Job Cluster option
  2. Existing instance pool.

Note: I am assuming interactive cluster is out of option as this is definitely more expensive.

Thanks,

Matt

1 ACCEPTED SOLUTION

Accepted Solutions

-werners-
Esteemed Contributor III

the instance pool will be cheaper if you use spot instances. But only if you size your instance pool correctly. (number of workers and scale down time)

AFAIK you cannot use spot instances for job clusters in ADF

View solution in original post

3 REPLIES 3

Anonymous
Not applicable

Hi, @Matt M​ - Thank you for bringing this new question to us, and thank you for your patience while we wait for other members to respond. 🙂

-werners-
Esteemed Contributor III

the instance pool will be cheaper if you use spot instances. But only if you size your instance pool correctly. (number of workers and scale down time)

AFAIK you cannot use spot instances for job clusters in ADF

MattM
New Contributor III

I agree with your input. I would also like to add that running Prod jobs on spot instances opens us to Azure VM eviction, so tread with caution on that front.

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now