Databricks Community

bi123 · ‎04-02-2026

I have a job with a notebook task that utilizes python modules in another folder than the notebook itself. When I try to import the module in the notebook, it raises module not found error. I solved the problem using sys.path

But I am curious if there is a more elegant approach to handle this, maybe to set a root path for the notebook task in the yaml configuration?

szymon_dybczak · ‎04-02-2026

Hi ,

If you don't use declarative pipelines but regular notebook task then your approach is correct and aligned with what databricks recommends in their documentation:

Here are some useful links to docs:

Work with Python and R modules | Databricks on AWS

Import Python modules from Git folders or workspace files | Databricks on AWS

lingareddy_Alva · ‎04-02-2026

HI @bi123

Good question. Your sys.path approach works, but yes — there are cleaner alternatives depending on your setup.

The Most Elegant: PYTHONPATH in the job YAML
If you're using Databricks Asset Bundles (DAB), you can set the Python path at the job or task level directly in databricks.yml:

resources:
  jobs:
    my_job:
      tasks:
        - task_key: my_notebook_task
          notebook_task:
            notebook_path: ./notebooks/my_notebook
          libraries:
            - pypi:
                package: "."   # installs local package if you have setup.py/pyproject.toml

Or more directly, use job_cluster_key with a spark env variable:

yaml
new_cluster:
  spark_env_vars:
    PYTHONPATH: "/Workspace/path/to/src:/Workspace/path/to/shared"

This sets PYTHONPATH at the cluster level so every task on that cluster can resolve your modules without any sys.path manipulation in the notebook itself.

Even Cleaner: Package Your Utilities:

If src/ and shared/ are stable internal libraries, the proper solution is to give them a setup.py or pyproject.toml and install them as packages:

yaml
libraries:
  - whl: ./dist/my_utils-0.1.0-py3-none-any.whl

DAB can build and deploy the wheel automatically. Then your notebook just does import my_utils with no path hacks at all.

In short: PYTHONPATH via spark_env_vars in the YAML is the most practical upgrade from what you have, and packaging is the right long-term answer if these utilities are shared across multiple jobs.

LR

shazi · ‎04-02-2026

While this is a classic way to solve this, it can sometimes be "brittle" if your folder structure changes or if you share the notebook with others who have different file paths. In modern notebook environments.

Databricks Community

How to import python modules in a notebook?

Databricks Community Champion - June 2026 - Amira Bedhiafi

DAIS 2026 Brought 2,800 New Members to the Databricks Community - Welcome Aboard

🌟 Community Pulse: Your Weekly Roundup! June 15 – 21, 2026

Solution Accelerator Series | Creating Brand-Aligned Images Using Generative AI

Build apps without jumping through hoops