cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Python Wheel in Serverless Job in DAB

georgemichael40
New Contributor III

Hey,

I am trying to run a job with serverless compute, that runs python scripts.

I need the paramiko package to get my scripts to work. I managed to get it working by doing:

environments:
- environment_key: default

# Full documentation of this spec can be found at:
spec:
client: "1"
dependencies:
- paramiko==4.0.0
 
in the yaml for the job.
 
However, to more centralize the project config and not having to specify the package per job, I want to make use of a python wheel.
 
I already set up my pyproject.toml with the dependecies.
 
I saw other scripts online adding the following to the databricks.yml:

# A set of artifacts to build before deploying
artifacts:
x:
type: whl
build: poetry build
path: .
 
Not sure if this is needed? 

Also, I am not sure what to give as arguments in the environment block, because I read that this is required for the serverless job. I assume I have to fill in the dependencies in another way there now?
 
How would this work?

Thanks!
 
1 ACCEPTED SOLUTION

Accepted Solutions

szymon_dybczak
Esteemed Contributor III

Hi @georgemichael40 ,

Put your whl file in the volume and then you can reference it in following way in your DAB file:

dependencies:
  - " /Volumes/workspace/default/my_volume/hellopkg-0.0.1-py3-none-any.whl"

https://docs.databricks.com/aws/en/compute/serverless/dependencies#-add-dependencies-to-the-notebook

View solution in original post

4 REPLIES 4

szymon_dybczak
Esteemed Contributor III

Hi @georgemichael40 ,

Put your whl file in the volume and then you can reference it in following way in your DAB file:

dependencies:
  - " /Volumes/workspace/default/my_volume/hellopkg-0.0.1-py3-none-any.whl"

https://docs.databricks.com/aws/en/compute/serverless/dependencies#-add-dependencies-to-the-notebook

Thanks for your reply.

But the documentation refers to notebooks. I am deploying a DAB that runs a serverless job to run Python scripts. Do you think that would make a difference?

Hi,

Can confirm this works! It's

dependencies:

string value of path to the wheel in the volume or workspace.

For me, workspace worked fine.

Thanks!

Cool, great that it worked 🙂

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now