Machine Learning

by User16826990884 • Databricks Employee

06-25-2021 12:08:06 PM

1865 Views
0 replies
1 kudos

Dev and Prod environments

Do we have general guidance around how other customers manage Dev and Prod environments in Databricks? Is it recommended to have separate workspaces for them? What are the pros and cons of using the same workspace with folder or repo level isolation?

Machine Learning

Reply

1865 Views
0 replies
1 kudos

06-25-2021 12:08:06 PM

by User16826994223 • Databricks Employee

06-25-2021 11:22:59 AM

2540 Views
1 replies
0 kudos

Delta Lake MERGE INTO statement error

I'm trying to run Delta Lake MergeMERGE INTO source USING updates ON source.d = updates.sessionId WHEN MATCHED THEN UPDATE * WHEN NOT MATCHED THEN INSERT *I'm getting an SQL errorParseException: mismatched input 'MERGE' expecting {'(', 'SELECT', 'FR...

Machine Learning

Reply

2540 Views
1 replies
0 kudos

06-25-2021 11:22:59 AM

View Replies

Latest Reply

User16826994223
Databricks Employee

06-25-2021 11:23:35 AM

0 kudos

The merge SQL support is added in Delta Lake 0.7.0. You also need to upgrade your Apache Spark to 3.0.0 and enable the integration with Apache Spark DataSourceV2 and C

0 kudos

06-25-2021 11:23:35 AM

by User16826992666 • Databricks Employee

06-25-2021 10:47:27 AM

1106 Views
0 replies
0 kudos

Should I be saving my SparkML models in MLflow using MLeap?

There's a lot of different ML formats out there and I am confused about how they should be fitting together. How should I be thinking about MLflow and MLeap working together?

Machine Learning

Reply

1106 Views
0 replies
0 kudos

06-25-2021 10:47:27 AM

by User16765131552 • Databricks Employee

06-25-2021 10:37:47 AM

3417 Views
1 replies
0 kudos

Resolved! Setup a model serving REST endpoint?

I am trying to set up a demo with a really simple spark ML model and i see this error repeated over and over in the logs in the serving UI:/databricks/chauffeur/model-runner/lib/python3.6/site-packages/urllib3/connectionpool.py:1020: InsecureRequestW...

Machine Learning

Reply

3417 Views
1 replies
0 kudos

06-25-2021 10:37:47 AM

View Replies

Latest Reply

User16765131552
Databricks Employee

06-25-2021 10:38:25 AM

0 kudos

Not sure how the containers for each model version work on the endpoints, but looks like Model serving endpoints use a 7.x runtime. So those would be Spark 3.0, not Spark 3.1

0 kudos

06-25-2021 10:38:25 AM

by User16826994223 • Databricks Employee

06-25-2021 9:38:52 AM

2033 Views
1 replies
0 kudos

Using l vacuum with a dry run in Python for a Delta Lake

I can see an example on how to call the vacuum function for a Delta lake in python here. how to use the same in python %sql VACUUM delta.`dbfs:/mnt/<myfolder>` DRY RUN

Machine Learning

Reply

2033 Views
1 replies
0 kudos

06-25-2021 9:38:52 AM

View Replies

Latest Reply

User16826994223
Databricks Employee

06-25-2021 9:39:11 AM

0 kudos

The dry run for non-SQL code is not yet available in Delta version 0.8. I see there is a bug that is opened with delta opensource in git . hope it get resolved soon

0 kudos

06-25-2021 9:39:11 AM

by User16826992666 • Databricks Employee

06-25-2021 9:21:41 AM

1950 Views
0 replies
0 kudos

MLflow not logging metrics

I have run a few MLflow experiments and I can see them in the experiment history, but none of the metrics have been logged along with them. I thought this was supposed to be automatically included. Any idea why they wouldn't be showing up?

Machine Learning

Reply

1950 Views
0 replies
0 kudos

06-25-2021 9:21:41 AM

by User16826994223 • Databricks Employee

06-25-2021 8:03:42 AM

2108 Views
1 replies
0 kudos

Resolved! where Can I find the the logs of spark job runs in Azure storage

Hi Want to find the storage bucket where all my runs' logs are stored , I want to do analytics on logs , can you please help me knowing which bucket or path I should look for

Machine Learning

Reply

2108 Views
1 replies
0 kudos

06-25-2021 8:03:42 AM

View Replies

Latest Reply

User16826994223
Databricks Employee

06-25-2021 8:04:14 AM

0 kudos

The root bucket where are not directly accessible outside databricks so you need to read the logs from databricks notebook only

0 kudos

06-25-2021 8:04:14 AM

by User16826994223 • Databricks Employee

06-25-2021 6:22:11 AM

2835 Views
1 replies
0 kudos

Resolved! Exception: Run with UUID l567845ae5a7cf04a40902ae789076093c is already active.

I'm trying to create a new experiment on mlflow but I have this problem:Exception: Run with UUID l142ae5a7cf04a40902ae9ed7326093c is already active. snippet mlflow.set_experiment("New experiment 2") mlflow.set_tracking_uri('http://mlflow:5000') ...

Machine Learning

Reply

2835 Views
1 replies
0 kudos

06-25-2021 6:22:11 AM

View Replies

Latest Reply

User16826994223
Databricks Employee

06-25-2021 6:22:31 AM

0 kudos

You have to run mlflow.end_run() to finish the first experiment. Then you can create another

0 kudos

06-25-2021 6:22:31 AM

by User16826994223 • Databricks Employee

06-25-2021 6:17:42 AM

1990 Views
1 replies
0 kudos

Resolved! What all language that ML flow Support

Machine Learning

Reply

1990 Views
1 replies
0 kudos

06-25-2021 6:17:42 AM

View Replies

Latest Reply

User16826994223
Databricks Employee

06-25-2021 6:18:00 AM

0 kudos

MLflow supports Java, Python, R, and REST APIs.

0 kudos

06-25-2021 6:18:00 AM

by User16826994223 • Databricks Employee

06-25-2021 6:16:56 AM

1213 Views
1 replies
0 kudos

What is the preview feature for Auto ML

Machine Learning

Reply

1213 Views
1 replies
0 kudos

06-25-2021 6:16:56 AM

View Replies

Latest Reply

User16826994223
Databricks Employee

06-25-2021 6:17:16 AM

0 kudos

A - AutoML public preview featuresThe Databricks AutoML Public Preview parallelizes training over sklearn and xgboost models for classification (binary and multiclass) and regression problems. We support datasets with numerical, categorical and times...

0 kudos

06-25-2021 6:17:16 AM

by MoJaMa • Databricks Employee

06-24-2021 7:18:58 PM

1050 Views
0 replies
0 kudos

I see that Databricks supports notebook-scoped Python Libraries. Is there similar support for R?

https://docs.databricks.com/libraries/notebooks-python-libraries.html

Machine Learning

Reply

1050 Views
0 replies
0 kudos

06-24-2021 7:18:58 PM

by brickster_2018 • Databricks Employee

06-24-2021 11:02:59 AM

2510 Views
1 replies
0 kudos

Resolved! After applying a MERGE in a previously Z-ORDERED table, Will the table loses the entire Z-ORDER optimization or only on the files touched by the MERGE?

Machine Learning

Reply

2510 Views
1 replies
0 kudos

06-24-2021 11:02:59 AM

View Replies

Latest Reply

brickster_2018
Databricks Employee

06-24-2021 11:04:13 AM

0 kudos

The impact will be only on the files touched by the MERGE operation. The newly created files will not be optimized and data co-locality is not ensured. However, the files which are not touched by the MERGE operation will continue to show the improvem...

0 kudos

06-24-2021 11:04:13 AM

by User16789201666 • Databricks Employee

06-23-2021 7:41:19 AM

1341 Views
0 replies
0 kudos

What's a best practice for Hyperopt workflow?

Choose what hyperparameters are reasonable to optimizeDefine broad ranges for each of the hyperparameters (including the default where applicable)Run a small number of trialsObserve the results in an MLflow parallel coordinate plot and select the run...

Machine Learning

Reply

1341 Views
0 replies
0 kudos

06-23-2021 7:41:19 AM

by User16789201666 • Databricks Employee

06-23-2021 7:38:55 AM

5281 Views
0 replies
0 kudos

When to use uniform vs log-uniform in Hyperopt?

Hyperopt offers hp.uniform and hp.loguniform, both of which produce real values in a min/max range. hp.loguniform is more suitable when one might choose a geometric series of values to try (0.001, 0.01, 0.1) rather than arithmetic (0.1, 0.2, 0.3). Wh...

Machine Learning

Reply

5281 Views
0 replies
0 kudos

06-23-2021 7:38:55 AM

by Anonymous • Not applicable

06-16-2021 2:04:04 PM

1841 Views
1 replies
0 kudos

How can we get the file size of the DBFS root bucket?

Machine Learning

Reply

1841 Views
1 replies
0 kudos

06-16-2021 2:04:04 PM

View Replies

Latest Reply

sajith_appukutt
Databricks Employee

06-23-2021 12:02:53 AM

0 kudos

aws s3 ls --summarize --human-readable --recursive s3://root-bucket-name/Bucket owner could run this command or get the info from console. Would that work ?

0 kudos

06-23-2021 12:02:53 AM

Databricks Community

Forum Posts

Dev and Prod environments

Delta Lake MERGE INTO statement error

Should I be saving my SparkML models in MLflow using MLeap?

Resolved! Setup a model serving REST endpoint?

Using l vacuum with a dry run in Python for a Delta Lake

MLflow not logging metrics

Resolved! where Can I find the the logs of spark job runs in Azure storage

Resolved! Exception: Run with UUID l567845ae5a7cf04a40902ae789076093c is already active.

Resolved! What all language that ML flow Support

What is the preview feature for Auto ML

I see that Databricks supports notebook-scoped Python Libraries. Is there similar support for R?

Resolved! After applying a MERGE in a previously Z-ORDERED table, Will the table loses the entire Z-ORDER optimization or only on the files touched by the MERGE?

What's a best practice for Hyperopt workflow?

When to use uniform vs log-uniform in Hyperopt?

How can we get the file size of the DBFS root bucket?

Join Us as a Local Community Builder!

Databricks Model Serving Endpoint Fails: “_USER no...

No option for create compute in trial version

notebook stuck at "filtering data" or waiting to r...

AutoGluon MLflow integration

MLflow Nested run with applyInPandas does not exec...