cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Running jobs using notebooks in a remote Azure DevOps Services (Repos) Git repository is generating "Notebook not found" error.

DanielBarbosa
New Contributor III

By reading the documentation, we checked the possibility of running jobs in the Azure Databricks Workspace workflow using Azure DevOps Services repository source codes.

The instructions in the documentation were followed and we configured the git information menu with information from our repository.

To configure the git information menu (workflows > jobs > "test_azure_repos" [name of the job created for the test]> tasks > Source > Git Information), we checked the repository URL in the option to clone over HTTPS, as shown in figure 1 and we entered this address in the "Git repository URL" field in Figure 3. In addition, we include the branch name and provider type information.

imageFigure 1. URL address used to clone the repository over HTTPS

imageFigure 2. Inserting the value from figure 2 in the git information menu in the Databricks workspace.

In the Databricks workspace git provider configuration (User settings > Git integration), the configuration option chosen was "Azure DevOps Services (Azure Active Directory)", as shown in Figure 3.

imageFigure 3. Git provider configuration.

However, when executing the job, an error is returned indicating that the notebook was not found in the repository, as shown in Figure 4.

imageFigure 4. Error "Notebook not found".

The instructions for inserting the relative path of the repository and omitting the file extension were followed, as the documentation indicates.

The file path entered in the "Path" parameter of the task configuration was "src/test", which matches the structure of the Azure DevOps Service repository, as shown in Figure 6.

imageFigure 5. File path configuration

imageFigure 6. Repository structure.

In the "job run" we can verify that the identification of the commit of the repository is equivalent to what is in the remote repository. But still the execution generates this notebook not found error and accuses internal error, as shown in Figure 7

Tests were performed with Github and the executions were successful.

image 

Figure 7. Job run with error

Could you please help me?

3 REPLIES 3

Anonymous
Not applicable

Can you check if the file test.py is a valid Databricks notebook i.e. it starts with

# Databricks notebook source

This error would occur if you were trying to run a python script instead of a Databricks notebook.

Oh, perfect Vaibhav! That was exactly the problem. I added the above content and it worked perfectly.

Thank you very much.

Ulf
New Contributor II

I have the same challenge when integrating with Github repos. However I did not succeed including: '# Databricks notebook source' in the top of python files. Do you have any additional suggestions for solving this problem? @Vaibhav Sethi​ 

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.