Machine Learning

by MoJaMa • Valued Contributor II

06-24-2021 7:21:46 PM

1163 Views
1 replies
2 kudos

Does Databricks AutoML support Time Series Forecasting?

Machine Learning

Reply

1163 Views
1 replies
2 kudos

06-24-2021 7:21:46 PM

View Replies

Latest Reply

Mooune_DBU
Valued Contributor

06-25-2021 3:02:15 PM

2 kudos

Not yet, but stay-tuned it's being cooked in the kitchen

2 kudos

06-25-2021 3:02:15 PM

by MoJaMa • Valued Contributor II

06-25-2021 1:02:53 PM

745 Views
1 replies
0 kudos

Is storage for Feature Store in Control Plane? Where does the Delta Table live?

Machine Learning

Reply

745 Views
1 replies
0 kudos

06-25-2021 1:02:53 PM

View Replies

Latest Reply

MoJaMa
Valued Contributor II

06-25-2021 1:03:34 PM

0 kudos

Data is stored in the control plane. Metadata (eg feature table descriptions, column types, etc) is stored in the control plane. The location where the Delta table is stored is determined by the database location. The customer could call CREATE DATA...

0 kudos

06-25-2021 1:03:34 PM

by User16826990884 • New Contributor III

06-25-2021 11:59:36 AM

1049 Views
1 replies
0 kudos

Rollback cluster changes

Is it possible to rollback changes made to a cluster? The problem I'm trying to solve is to recover from an accidental change made by a user on a cluster that affects interactive and job runs. Cluster policies help, but the policy still provides the ...

Machine Learning

Reply

1049 Views
1 replies
0 kudos

06-25-2021 11:59:36 AM

View Replies

Latest Reply

sajith_appukutt
Honored Contributor II

06-25-2021 12:24:54 PM

0 kudos

You could look at automating cluster creation steps and implementing this with an infra-as-code solution like the databricks terraform provider which allows rollback

0 kudos

06-25-2021 12:24:54 PM

by User16826990884 • New Contributor III

06-25-2021 12:08:06 PM

1083 Views
0 replies
1 kudos

Dev and Prod environments

Do we have general guidance around how other customers manage Dev and Prod environments in Databricks? Is it recommended to have separate workspaces for them? What are the pros and cons of using the same workspace with folder or repo level isolation?

Machine Learning

Reply

1083 Views
0 replies
1 kudos

06-25-2021 12:08:06 PM

by User16826994223 • Honored Contributor III

06-25-2021 11:22:59 AM

1723 Views
1 replies
0 kudos

Delta Lake MERGE INTO statement error

I'm trying to run Delta Lake MergeMERGE INTO source USING updates ON source.d = updates.sessionId WHEN MATCHED THEN UPDATE * WHEN NOT MATCHED THEN INSERT *I'm getting an SQL errorParseException: mismatched input 'MERGE' expecting {'(', 'SELECT', 'FR...

Machine Learning

Reply

1723 Views
1 replies
0 kudos

06-25-2021 11:22:59 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-25-2021 11:23:35 AM

0 kudos

The merge SQL support is added in Delta Lake 0.7.0. You also need to upgrade your Apache Spark to 3.0.0 and enable the integration with Apache Spark DataSourceV2 and C

0 kudos

06-25-2021 11:23:35 AM

by User16826992666 • Valued Contributor

06-25-2021 10:47:27 AM

535 Views
0 replies
0 kudos

Should I be saving my SparkML models in MLflow using MLeap?

There's a lot of different ML formats out there and I am confused about how they should be fitting together. How should I be thinking about MLflow and MLeap working together?

Machine Learning

Reply

535 Views
0 replies
0 kudos

06-25-2021 10:47:27 AM

by User16765131552 • Contributor III

06-25-2021 10:37:47 AM

1185 Views
1 replies
0 kudos

Resolved! Setup a model serving REST endpoint?

I am trying to set up a demo with a really simple spark ML model and i see this error repeated over and over in the logs in the serving UI:/databricks/chauffeur/model-runner/lib/python3.6/site-packages/urllib3/connectionpool.py:1020: InsecureRequestW...

Machine Learning

Reply

1185 Views
1 replies
0 kudos

06-25-2021 10:37:47 AM

View Replies

Latest Reply

User16765131552
Contributor III

06-25-2021 10:38:25 AM

0 kudos

Not sure how the containers for each model version work on the endpoints, but looks like Model serving endpoints use a 7.x runtime. So those would be Spark 3.0, not Spark 3.1

0 kudos

06-25-2021 10:38:25 AM

by User16826994223 • Honored Contributor III

06-25-2021 9:38:52 AM

1261 Views
1 replies
0 kudos

Using l vacuum with a dry run in Python for a Delta Lake

I can see an example on how to call the vacuum function for a Delta lake in python here. how to use the same in python %sql VACUUM delta.`dbfs:/mnt/<myfolder>` DRY RUN

Machine Learning

Reply

1261 Views
1 replies
0 kudos

06-25-2021 9:38:52 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-25-2021 9:39:11 AM

0 kudos

The dry run for non-SQL code is not yet available in Delta version 0.8. I see there is a bug that is opened with delta opensource in git . hope it get resolved soon

0 kudos

06-25-2021 9:39:11 AM

by User16826994223 • Honored Contributor III

06-25-2021 8:03:42 AM

1147 Views
1 replies
0 kudos

Resolved! where Can I find the the logs of spark job runs in Azure storage

Hi Want to find the storage bucket where all my runs' logs are stored , I want to do analytics on logs , can you please help me knowing which bucket or path I should look for

Machine Learning

Reply

1147 Views
1 replies
0 kudos

06-25-2021 8:03:42 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-25-2021 8:04:14 AM

0 kudos

The root bucket where are not directly accessible outside databricks so you need to read the logs from databricks notebook only

0 kudos

06-25-2021 8:04:14 AM

by User16826994223 • Honored Contributor III

06-25-2021 6:22:11 AM

1495 Views
1 replies
0 kudos

Resolved! Exception: Run with UUID l567845ae5a7cf04a40902ae789076093c is already active.

I'm trying to create a new experiment on mlflow but I have this problem:Exception: Run with UUID l142ae5a7cf04a40902ae9ed7326093c is already active. snippet mlflow.set_experiment("New experiment 2") mlflow.set_tracking_uri('http://mlflow:5000') ...

Machine Learning

Reply

1495 Views
1 replies
0 kudos

06-25-2021 6:22:11 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-25-2021 6:22:31 AM

0 kudos

You have to run mlflow.end_run() to finish the first experiment. Then you can create another

0 kudos

06-25-2021 6:22:31 AM

by User16826994223 • Honored Contributor III

06-25-2021 6:17:42 AM

955 Views
1 replies
0 kudos

Resolved! What all language that ML flow Support

Machine Learning

Reply

955 Views
1 replies
0 kudos

06-25-2021 6:17:42 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-25-2021 6:18:00 AM

0 kudos

MLflow supports Java, Python, R, and REST APIs.

0 kudos

06-25-2021 6:18:00 AM

by User16826994223 • Honored Contributor III

06-25-2021 6:16:56 AM

728 Views
1 replies
0 kudos

What is the preview feature for Auto ML

Machine Learning

Reply

728 Views
1 replies
0 kudos

06-25-2021 6:16:56 AM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-25-2021 6:17:16 AM

0 kudos

A - AutoML public preview featuresThe Databricks AutoML Public Preview parallelizes training over sklearn and xgboost models for classification (binary and multiclass) and regression problems. We support datasets with numerical, categorical and times...

0 kudos

06-25-2021 6:17:16 AM

by MoJaMa • Valued Contributor II

06-24-2021 7:18:58 PM

544 Views
0 replies
0 kudos

I see that Databricks supports notebook-scoped Python Libraries. Is there similar support for R?

https://docs.databricks.com/libraries/notebooks-python-libraries.html

Machine Learning

Reply

544 Views
0 replies
0 kudos

06-24-2021 7:18:58 PM

by brickster_2018 • Esteemed Contributor

06-24-2021 11:02:59 AM

1364 Views
1 replies
0 kudos

Resolved! After applying a MERGE in a previously Z-ORDERED table, Will the table loses the entire Z-ORDER optimization or only on the files touched by the MERGE?

Machine Learning

Reply

1364 Views
1 replies
0 kudos

06-24-2021 11:02:59 AM

View Replies

Latest Reply

brickster_2018
Esteemed Contributor

06-24-2021 11:04:13 AM

0 kudos

The impact will be only on the files touched by the MERGE operation. The newly created files will not be optimized and data co-locality is not ensured. However, the files which are not touched by the MERGE operation will continue to show the improvem...

0 kudos

06-24-2021 11:04:13 AM

by User16789201666 • Contributor II

06-23-2021 7:41:19 AM

706 Views
0 replies
0 kudos

What's a best practice for Hyperopt workflow?

Choose what hyperparameters are reasonable to optimizeDefine broad ranges for each of the hyperparameters (including the default where applicable)Run a small number of trialsObserve the results in an MLflow parallel coordinate plot and select the run...

Machine Learning

Reply

706 Views
0 replies
0 kudos

06-23-2021 7:41:19 AM

Databricks Community

Forum Posts

Does Databricks AutoML support Time Series Forecasting?

Is storage for Feature Store in Control Plane? Where does the Delta Table live?

Rollback cluster changes

Dev and Prod environments

Delta Lake MERGE INTO statement error

Should I be saving my SparkML models in MLflow using MLeap?

Resolved! Setup a model serving REST endpoint?

Using l vacuum with a dry run in Python for a Delta Lake

Resolved! where Can I find the the logs of spark job runs in Azure storage

Resolved! Exception: Run with UUID l567845ae5a7cf04a40902ae789076093c is already active.

Resolved! What all language that ML flow Support

What is the preview feature for Auto ML

I see that Databricks supports notebook-scoped Python Libraries. Is there similar support for R?

Resolved! After applying a MERGE in a previously Z-ORDERED table, Will the table loses the entire Z-ORDER optimization or only on the files touched by the MERGE?

What's a best practice for Hyperopt workflow?

Vectorsearch ConnectionResetError Max retries exce...

Model flavour using feature store model training l...

databricks-cli

Deployment as code pattern with double training ef...

ML model promotion from Databricks dev workspace t...