cancel
Showing results for 
Search instead for 
Did you mean: 
Machine Learning
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

User16826994223
by Honored Contributor III
  • 565 Views
  • 1 replies
  • 0 kudos

difference between optimize and auto optimize and Optimize in delta

What would be good for me , if I should use Optimize every time or should I be using auto-optimize?

  • 565 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16869510359
Esteemed Contributor
  • 0 kudos

Optimize:  Bin-packing/Compaction. Idempotent and IncrementalOptimize + Z-Order: Helps in Data Skipping; Use Range PartitioningOptimize write: Improve the write operation to the Delta table. optimization is performed before the write/during the writ...

  • 0 kudos
User15787040559
by New Contributor III
  • 933 Views
  • 1 replies
  • 0 kudos

Can we retrieve experiment results via MLflow API or is this only possible using UI?

Yes, you can use the API https://www.mlflow.org/docs/latest/python_api/index.html

  • 933 Views
  • 1 replies
  • 0 kudos
Latest Reply
Mooune_DBU
Valued Contributor
  • 0 kudos

There are many ways you can retrieve experiments results using the mlflow API (see example if you want to retrieve and display for only a specific model (assuming you have the `model_name`:best_models = mlflow.search_runs(filter_string=f'tags.model="...

  • 0 kudos
User16826994223
by Honored Contributor III
  • 531 Views
  • 0 replies
  • 0 kudos

Delta sharing Features- Share live data directly - Easily share existing, live data in your Delta Lake without copying it to another system.Support d...

Delta sharing Features-Share live data directly - Easily share existing, live data in your Delta Lake without copying it to another system.Support diverse clients - Data recipients can directly connect to Delta Shares from Pandas, Apache Sparkâ„¢, Rus...

sharing-hero-v3
  • 531 Views
  • 0 replies
  • 0 kudos
User16789201666
by Contributor II
  • 939 Views
  • 1 replies
  • 0 kudos

When would you use the Feature Store?

For example would you use a feature store on your raw data or what's is the granularity of the features in the store?

  • 939 Views
  • 1 replies
  • 0 kudos
Latest Reply
Joseph_B
New Contributor III
  • 0 kudos

I'll try to answer the broad question first, followed by the specific ones.When would you use the Feature Store?A Feature Store is primarily used to solve 2 challenges.(1) Discoverability and governance of featuresChallenge: In a large team or organi...

  • 0 kudos
User16826993440
by New Contributor III
  • 1850 Views
  • 1 replies
  • 1 kudos

What is the best practice for applying MLFlow to clustering algorithms?

What is the best practice for applying MLFlow to clustering algorithms? What are the kinds of metrics customers track?

  • 1850 Views
  • 1 replies
  • 1 kudos
Latest Reply
Joseph_B
New Contributor III
  • 1 kudos

Good question! I'll divide my suggestions into 2 parts:(1) In terms of MLflow Tracking, clustering is pretty similar to other ML workflows, so not much changes.(2) In terms of specific parameters, metrics, etc. to track, clustering is very different...

  • 1 kudos
MoJaMa
by Valued Contributor II
  • 509 Views
  • 1 replies
  • 0 kudos
  • 509 Views
  • 1 replies
  • 0 kudos
Latest Reply
MoJaMa
Valued Contributor II
  • 0 kudos

Yes.Please see Blog1: https://databricks.com/blog/2020/06/03/customer-lifetime-value-part-1-estimating-customer-lifetimes.htmlNotebook1:https://databricks.com/notebooks/CLV_Part_1_Customer_Lifetimes.htmlBlog2: https://databricks.com/blog/2020/06/17/c...

  • 0 kudos
User16826994223
by Honored Contributor III
  • 2231 Views
  • 2 replies
  • 0 kudos

Resolved! Can we delte Mlflow experiment

I am using ML flow and my need of the hour is to delete an experiment and want to create another experiment with same run.client = MlflowClient(tracking_uri=server) client.delete_experiment(1)This deletes the experiment, but when I run a new experim...

  • 2231 Views
  • 2 replies
  • 0 kudos
Latest Reply
User16826994223
Honored Contributor III
  • 0 kudos

SQL Database:This is more tricky, as there are dependencies that need to be deleted. I am using MySQL, and these commands work for me:USE mlflow_db; # the name of your database DELETE FROM experiment_tags WHERE experiment_id=ANY( SELECT experime...

  • 0 kudos
1 More Replies
User16752240150
by New Contributor II
  • 1911 Views
  • 1 replies
  • 0 kudos

What's the best way to implement long term data versioning?

I'm a data scientist creating versioned ML models. For compliance reasons, I need to be able to replicate the training data for each model version. I've seen that you can version datasets by using delta, but the default retention period is around 30 ...

  • 1911 Views
  • 1 replies
  • 0 kudos
Latest Reply
sajith_appukutt
Honored Contributor II
  • 0 kudos

Delta, as you mentioned has a feature to do time travel and by default, delta tables retain the commit history for 30 days. Operations on history of the table are parallel but will become more expensive as the log size increasesNow, in this case - s...

  • 0 kudos
MoJaMa
by Valued Contributor II
  • 585 Views
  • 1 replies
  • 0 kudos
  • 585 Views
  • 1 replies
  • 0 kudos
Latest Reply
MoJaMa
Valued Contributor II
  • 0 kudos

Yes!You will have to pip install mlflowin your environment as a first step. For more details, see: https://docs.databricks.com/applications/mlflow/access-hosted-tracking-server.html

  • 0 kudos
Anonymous
by Not applicable
  • 657 Views
  • 1 replies
  • 0 kudos

Resolved! How is Databricks AutoML different than other AutoML products out there?

How does it provide a glass box view?

  • 657 Views
  • 1 replies
  • 0 kudos
Latest Reply
Mooune_DBU
Valued Contributor
  • 0 kudos

Depending on which solution you use, GlassBox means that any interactive work you do via point & click, we automatically generate the code behind the scene and generate notebooks used for each experiment that was ran under the hood, in addition for a...

  • 0 kudos
User16752240150
by New Contributor II
  • 519 Views
  • 1 replies
  • 1 kudos

What algorithms does Databricks AutoML use?

AutoML presumably tries a few different algorithms while hyperparameter searching. What model types are considered?

  • 519 Views
  • 1 replies
  • 1 kudos
Latest Reply
sean_owen
Honored Contributor II
  • 1 kudos

At the moment, it's really just xgboost, and sklearn implemenations like random forests, logistic regression, and linear regression as applicable. More possibilities are coming.

  • 1 kudos
Labels