Machine Learning

by yopbibo • Contributor II

08-29-2022 12:44:54 AM

701 Views
2 replies
0 kudos

Sending R functions to worker nodes

Hi!If I need to use many workers to distributes regular pandas, I would use a pandas_UDF. (having regular python crunching a slice of my data, on each node, and combining all results back to the driver node)Is there something equivalent for R?Thanks,

Machine Learning

Reply

701 Views
2 replies
0 kudos

08-29-2022 12:44:54 AM

View Replies

Latest Reply

Kaniz
Community Manager

09-07-2022 1:00:43 AM

0 kudos

Hi @Philippe CRAVE, Can you please elaborate more on your question - "Is there something equivalent for R"?

0 kudos

09-07-2022 1:00:43 AM

1 More Replies

by 898495 • New Contributor

07-27-2022 8:25:00 AM

816 Views
2 replies
1 kudos

MLFlow Python Error for Forecast problem

Hi Team,I am trying to implement automl in python for my timeseries forecast problem.But, I was facing below error during the model training:AttributeError: 'StanModel' object has no attribute 'fit_class'Due to the above error, the experiment failed ...

Machine Learning

Reply

816 Views
2 replies
1 kudos

07-27-2022 8:25:00 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-06-2022 3:14:37 AM

1 kudos

Hi @Prakash Thavamurugan Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from y...

1 kudos

09-06-2022 3:14:37 AM

1 More Replies

by sameer_gupta • New Contributor

07-10-2022 10:30:52 PM

958 Views
3 replies
0 kudos

Error in importing feature_store

from databricks import feature_storeI am trying to import feature_store but it is showing this error.ImportError: cannot import name 'feature_store' from 'databricks' (/databricks/python/lib/python3.8/site-packages/databricks/__init__.py)

Machine Learning

Reply

958 Views
3 replies
0 kudos

07-10-2022 10:30:52 PM

View Replies

Latest Reply

Anonymous
Not applicable

09-06-2022 12:44:38 AM

0 kudos

Is this issue resolved completely? We are facing the same problem. this might help.

0 kudos

09-06-2022 12:44:38 AM

2 More Replies

by Benji • New Contributor II

07-25-2022 11:47:33 PM

2397 Views
5 replies
0 kudos

Error when running job in databricks

Hello, I am very new with databricks and MLflow. I faced with the problem about running job. When the job is run, it usually failed and retried itself, so it incasesed running time, i.e., from normally 6 hrs to 12-18 hrs. From the error log, it shows...

Machine Learning

Reply

2397 Views
5 replies
0 kudos

07-25-2022 11:47:33 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 6:25:18 AM

0 kudos

Hey there @Tanawat Benchasirirot Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hea...

0 kudos

09-05-2022 6:25:18 AM

4 More Replies

by jdigiovanni • New Contributor

07-21-2022 9:28:35 AM

760 Views
3 replies
0 kudos

EOFError trying to assign a model using a custom module

I'm in a Data Science Bootcamp, and the final case study includes data preprocessing (done), using a linear regression model on the data, then porting to SQL for visualization. The model build uses custom python code provided as part of the exercise....

Machine Learning

Reply

760 Views
3 replies
0 kudos

07-21-2022 9:28:35 AM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 5:54:30 AM

0 kudos

Hi @Joe DiGiovanni Just wanted to check in if you were able to resolve your issue or do you need more help? We'd love to hear from you.Thanks!

0 kudos

09-05-2022 5:54:30 AM

2 More Replies

by sameer_gupta • New Contributor

07-20-2022 10:14:43 PM

1131 Views
2 replies
0 kudos

Error in importing mlflow.sklearn

ImportError: cannot import name '_MIN_SKLEARN_VERSION' from 'mlflow.sklearn.utils' (/databricks/python/lib/python3.8/site-packages/mlflow/sklearn/utils.py)

Machine Learning

Reply

1131 Views
2 replies
0 kudos

07-20-2022 10:14:43 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 5:21:00 AM

0 kudos

Hi @Sameer Gupta Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thank...

0 kudos

09-05-2022 5:21:00 AM

1 More Replies

by fsyshawn • New Contributor II

07-18-2022 1:13:39 PM

530 Views
2 replies
0 kudos

How can we automate MLFLOW model serving in databricks?

Can we enable model serving either using cli or any other tools without go to the databricks model UI?

Machine Learning

Reply

530 Views
2 replies
0 kudos

07-18-2022 1:13:39 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 3:42:53 AM

0 kudos

Hi @Shawn Feng Does @Atanu Sarkar response answer your question? If yes, would you be happy to mark it as best so that other members can find the solution more quickly?We'd love to hear from you.Thanks!

0 kudos

09-05-2022 3:42:53 AM

1 More Replies

by yopbibo • Contributor II

09-02-2022 6:53:19 AM

1000 Views
2 replies
5 kudos

Deploy a ML model, trained and registered in Databricks to AKS

Hi,I can train, registered a ML Model in my Datbricks Workspace.Then, to deploy it on AKS, I need to register the model in Azure ML, and then, deploy to AKS.Is it possible to skip the Azure ML step?I would like to deploy directly into my AKS instance...

Machine Learning

Reply

1000 Views
2 replies
5 kudos

09-02-2022 6:53:19 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

09-02-2022 3:18:11 PM

5 kudos

Hi, Thanks for reaching out to Databricks. Registering a model can be done, and it is not mentioned if it is optional or not in Microsoft documents. Reference : https://docs.microsoft.com/en-gb/azure/databricks/applications/mlflow/models#register-mod...

5 kudos

09-02-2022 3:18:11 PM

1 More Replies

by Somi • New Contributor III

08-23-2022 10:45:05 AM

3730 Views
10 replies
0 kudos

Resolved! How to set sparkTrials? I am receiving this TypeError: cannot pickle '_thread.lock' object

I am trying to distribute hyperparameter tuning using hyperopt on a tensorflow.keras model. I am using sparkTrials in my fmin:spark_trials = SparkTrials(parallelism=4)...best_hyperparam = fmin(fn=CNN_HOF, space=space, ...

Machine Learning

Reply

3730 Views
10 replies
0 kudos

08-23-2022 10:45:05 AM

View Replies

Latest Reply

Dooley
Valued Contributor

08-26-2022 2:48:11 PM

0 kudos

This can happen when you try to serialize a keras model with an unserializable layer. What does your model look like? Also what is in that search space variable? What are you trying to optimize on?

0 kudos

08-26-2022 2:48:11 PM

9 More Replies

by Somi • New Contributor III

06-24-2022 11:07:35 AM

598 Views
3 replies
0 kudos

No saved model after stopping the cluster.

I have saved a keras model in some directories in dbfs to load and retrain that with more data, etc. The problem is that when cluster stops and restarts, seems those directories and model are no longer available there and it starts training a new mod...

Machine Learning

Reply

598 Views
3 replies
0 kudos

06-24-2022 11:07:35 AM

View Replies

Latest Reply

Somi
New Contributor III

09-02-2022 12:58:50 PM

0 kudos

Hi @Vidula Khanna I figured it out by replacing OS library module with dbutils utilities. It looks like mre compatible with DBFS.

0 kudos

09-02-2022 12:58:50 PM

2 More Replies

by Ashley1 • Contributor

07-05-2022 10:09:48 PM

2717 Views
5 replies
5 kudos

Feature table: merge very slow

Hi All, We're just started to look at the feature store capabilities of Databricks. Our first attempt to create a feature table has resulted in very slow write. To avoid the time incurred by the feature functions I generated a dataframe with same...

Historical Spark UI for cluster 0622-013318-zoqth84b, driver 332737051535251367 - Details for Query 352

Machine Learning

Reply

2717 Views
5 replies
5 kudos

07-05-2022 10:09:48 PM

View Replies

Latest Reply

Vidula
Honored Contributor

08-31-2022 11:43:05 PM

5 kudos

Hi @Ashley Betts Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thank...

5 kudos

08-31-2022 11:43:05 PM

4 More Replies

by Giorgi • New Contributor III

08-30-2022 5:57:32 AM

551 Views
2 replies
4 kudos

Resolved! Azure Data Factory: allocate resources per Notebook

I'm using Azure Data Factory to create pipeline of Databricks notebooks, something like this:[Notebook 1 - data pre-processing ] -> [Notebook 2 - model training ] -> [Notebook 3 - performance evaluation].Can I write some config file, that would allow...

Machine Learning

Reply

551 Views
2 replies
4 kudos

08-30-2022 5:57:32 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

08-30-2022 10:07:55 AM

4 kudos

I understand that, in your case, auto-scaling will take too much time.The simplest option is to use a different cluster for another notebook (and be sure that the previous cluster is terminated instantly).Another option is to use REST API 2.0/cluster...

4 kudos

08-30-2022 10:07:55 AM

1 More Replies

by confusedIntern • New Contributor III

08-02-2022 2:20:02 PM

1625 Views
4 replies
2 kudos

Uploaded Docker image into cluster. Used cluster for MLFlow experiment, but no experiment is logged/there are no experiment runs. Why is this?

Hi! So I used this MLFlow experiment I found from the databricks website: https://docs.databricks.com/_static/notebooks/machine-learning-with-unity-catalog.htmlAnd I created this cluster using a custom Docker image I created myself: Usually when I c...

Machine Learning

Reply

1625 Views
4 replies
2 kudos

08-02-2022 2:20:02 PM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

08-02-2022 2:39:38 PM

2 kudos

Have you tried the steps mentioned in the below URL:https://docs.databricks.com/clusters/custom-containers.html#step-3-launch-your-cluster

2 kudos

08-02-2022 2:39:38 PM

3 More Replies

by THIAM_HUATTAN • Valued Contributor

06-26-2022 11:26:50 PM

1118 Views
7 replies
6 kudos

Why this Databricks ML code gets stuck?

I could not paste the code here because of the some word not allowed, so I have to paste it elsewhere.Below is OK:https://justpaste.it/8xcr9But below gets stuck:https://justpaste.it/8nydtand it keeps looping and running...

Machine Learning

Reply

1118 Views
7 replies
6 kudos

06-26-2022 11:26:50 PM

View Replies

Latest Reply

Vidula
Honored Contributor

08-27-2022 12:28:12 AM

6 kudos

Hey @THIAM HUAT TAN Hope all is well! Just wanted to check in if you were able to resolve your issue, and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you....

6 kudos

08-27-2022 12:28:12 AM

6 More Replies

by matebreeze • New Contributor

08-26-2022 8:20:34 AM

814 Views
0 replies
0 kudos

MLflow model serving: KeyError: 'python_function'

Hello, I am training a logistic regression on text with the help of an tf-idf vectorizer.This is done with MLflow and sklearn in databricks.The model itself is trained successfully in databricks and it is possible to accomplish predictions within the...

Machine Learning

Reply

814 Views
0 replies
0 kudos

08-26-2022 8:20:34 AM

Databricks

Forum Posts

Sending R functions to worker nodes

MLFlow Python Error for Forecast problem

Error in importing feature_store

Error when running job in databricks

EOFError trying to assign a model using a custom module

Error in importing mlflow.sklearn

How can we automate MLFLOW model serving in databricks?

Deploy a ML model, trained and registered in Databricks to AKS

Resolved! How to set sparkTrials? I am receiving this TypeError: cannot pickle '_thread.lock' object

No saved model after stopping the cluster.

Feature table: merge very slow

Resolved! Azure Data Factory: allocate resources per Notebook

Uploaded Docker image into cluster. Used cluster for MLFlow experiment, but no experiment is logged/there are no experiment runs. Why is this?

Why this Databricks ML code gets stuck?

MLflow model serving: KeyError: 'python_function'

pdb debugger on databricks

import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstim...

Query ML Endpoint with R and Curl

'error_code': 'INVALID_PARAMETER_VALUE', 'message'...

AutoMl Dataset too large