- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-09-2021 11:05 PM
If there is a registered model and it is linked with a notebook, then the lineage breaks if you move the notebook to a different path or even pull/upload a new version of the notebook.
This is not good because when someone doing its development/testing they usually do it in a messy way but if you need your code in production then it means that either
- You need to move your code and then again re-train to generate the same model and then perform its movement to higher env. so that the lineage is maintained.... or
- You need to keep the dev source code notebook, that too on the same path, where it had been created for dev usage.
- Labels:
-
Lineage
-
Source Code
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-09-2021 11:51 PM
Hi @Saurabh Verma ,
as in your other topic, I suggest looking into MLflow as this is designed to handle all these issues.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-10-2021 01:45 AM
@Werner Stinckens : Hi Werners,
Thanks for the reply. But this issue is happening in MLFLow tracking itself. I wanted to know if there is a way to mitigate it or not.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-10-2021 02:05 AM
Ok I see,
do not have a clear answer for that.
But there was a session at spark summit about this (well, more CI/CD related but it might give ideas):
https://databricks.com/session_na20/productionalizing-models-through-ci-cd-design-with-mlflow
The whole ci/cd shebang might be too much for your needs but maybe you can pick some parts which are useful?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-10-2021 12:17 PM
@Werner Stinckens - Thank you so much!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-11-2021 04:53 PM
Hi Maverick,
I'm unable to reproduce the issue you mentioned. Where is your notebook located? Is it stored in a git-versioned Repo directory (accessed via "Repos" instead of "Workspace" icon on the navigation bar)?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-12-2021 12:01 AM
@Jin Zhang : For reproducing this issue:
- create a notebook in your workspace account and generate a model from it.
- If you go to the model stats page you can see its lineage to the original notebook.
- Create a folder in your workspace and move your notebook to that folder. Now move it back to where it was before.
- Go back to the model stats page and click on “source” link which represents the original notebook lineage. It will show the error as “notebook not found” although it is on the same path where it is supposed to be.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-15-2021 04:50 PM
I also cannot reproduce this, with these exact steps (I think). After moving the notebook and moving it back, the link to it (and link to the revision) still works as expected. You are using MLflow built in to Databricks right?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2021 11:14 PM
@Sean Owen : Yes. Managed MLFlow on databricks.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-05-2021 09:15 AM
Hi @Saurabh Verma ,
Did Sean's reply help you to solve this issue or your still are waiting for a solution to unblock you?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
11-10-2021 01:52 AM
Hi @Jose Gonzalez ,
The issue still persists. I believe this is related to the workspace version that we are using.
I have ran through the exact steps in E2 workspace version and the issue is somehow resolved there.