10-24-2024 04:16 AM
I've created a completely fresh project with a completely empty workspace
Locally I have the databricks CLI version 0.230.0 installed
I run
databricks bundle init default-python
I have auth set up with a PAT generated by an account which has workspace admin. when I run bundle deploy it deploys the resources and jobs, and created the pipeline but does not sync the src folder and files to the workspace and it does not give an error.
The resources which do deploy when viewed in the workspace give an error saying the source code for them cant be found in the workspace.
If I add a sync block to the databricks.yml file at the top level
Warning: Pattern src\** does not match any files
at sync.include[0]
in databricks.yml:11:7
Warning: There are no files to sync, please check your .gitignore
I've tried all possible formatting of src I can think of and always get the same error, I've tried
src, src/, src/*, .src, .src/, .src/*, src/*.ipynb
nothing works and nothing I do other than manually syncing my entire directory with a manual databricks bundle sync or vscode extension sync actually pushes the files to the remote workspace.
After I manually sync the files the errors on the resources in the workspace disappear
The only thing I changed was the catalogue being used by the dlt pipeline in my_project.pipline.yml file to use an existing catalogue because the workspace is enabled for UC instead of the hive_metastore
I've also tried adding the include to the top level include mapping
a month ago
have you looked into the .gitignore file?
chances are there is an entry with /resources/*
If so, you can remove everything from .gitignore which you think should be deployed.
For sure the resources folder.
a month ago
Theres no entry for the resources folder in the git ignore, its not the resources I'm having trouble with, they are created successfully in the workspace but the notebooks they use are not copied to the workspace
I've tried deleting the gitignore file entirely, and theres no references to the src folder in the git ignore to begin with anyway
a month ago
do you not see this in your databricks workspace users:
a month ago
Yes I can see the folder structure but nothing in the src folder apart from a "my_project.egg-info" subfolder with some txt files in it:
a month ago
Hi @JonathanFlint ,
1. Could you remove the below line. Let's try to make the deployment work without any filters:
a month ago
When I first encountered the issue I did not have the
2 weeks ago
Hi, I had a similar problem today. I changed the way, that we deploy our main bundle using pull requests and in order to play around with it locally, I copied python and dbt code into the databricks src dir (that is normally done during a github workflow). To avoid accidental commits, I also added the two dirs to .gitignore on the top level of our repo (i.e. one level above the databricks dir). After that, bundle deploy stopped copying files.
Is that intended? That feels like a bug to me, tbh
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group