I have a job with the same task to be executed twice with two sets of parameters. In each task is run after cloning a git repo then installing it locally and running a notebook from this repo. However, as each task clones the same repo, I was wondering how to do the install once and for all ?
I tried to add a first task that install the package from the cloned repo, and added a dependency to this first step for the two tasks. Basically:
Task 0:
* from git repo
* %sh
pip install poetry
poetry install ---will install locally cloned package named my_package---
Task 1 and 2:
* depends on Task 0
* same cluster
* from my_package import my_class ---got an exception that thereis no package my_package---
Adding the my_package package to the cluster config is not an option, I need to install it first when running the job