I'm a data scientist creating versioned ML models. For compliance reasons, I need to be able to replicate the training data for each model version.
I've seen that you can version datasets by using delta, but the default retention period is around 30 days. If I update my training data and model monthly, and want to track models (and data) over years, what is the best way for me to version my data.
Is delta an appropriate solution for this?