Databricks Community

auso · ‎05-15-2025

I am part of a small team of Data Engineers which started using Databricks Asset Bundles one year ago. Our code base consists of typical ETL-workloads written primarily in Jupyter notebooks (.ipynb), and jobs (.yaml) with our codebase spanning across a large number of different business domains.

Currently, our code base consists of a single monorepo with one large bundle containing all our notebooks, jobs, libraries etc.

Our code base has grown to a size where we see the need to split our single bundle into several smaller bundles - one for each business domain.

We are envisioning a setup similar to the following (simplified) structure:

monorepo/
│
├── shared_notebooks/
├── shared_libraries/
├── variables.yml
│
├── Bundle_A/
│   ├── resources/
│   ├── src/
│   └── databricks.yml
│
└── Bundle_B/
    ├── resources/
    ├── src/
    └── databricks.yml

Where the repo contains some shared notebooks and libraries which may be used in all bundles in our repository.

Does anyone have some suggestions for how this should be implemented?

How can we "import" shared assets (notebooks, libraries and variables) into our bundles?
Does our approach to splitting up our mono-bundle repository seem sensible?

Thanks in advance for your insights!

Kaspar Hauser

bee-jugger · ‎09-02-2025

@auso , did you get an answer or a solution?

Witold · ‎09-02-2025

Yes, it's feasible. In DAB you just need to use paths to import common libraries.

-werners- · ‎09-02-2025

1. the easiest way to do this is to package your shared librabries into a wheel (suppose you use python). Like that you do not have to mess with the pythonpath and you can install these libs automatically to any cluster (via policies or dabs or whatever).

2. totally makes sense, we do it in a similar way

Databricks Community

Asset Bundles: Shared libraries and notebooks in monorepo multi-bundle setup

Join Us as a Local Community Builder!

Lakehouse, Lagers & Legends — Bangalore Meetup | December 13

🌟 Community Pulse: Your Weekly Roundup! November 21 – 27, 2025

Join us for another BrickTalk: Vibe-Coding Databricks Apps in Replit with Augusto!

Celebrating Our First Brickster Champion: Louis Frolio

⭐ Setup Spark with Hadoop Anywhere : A DBR aligned local Spark+HDFS+Hive stack on Docker⭐