Machine Learning

by moh3th1 • Visitor

5 hours ago

11 Views
0 replies
0 kudos

Optimal Cluster Configuration for Training on Billion-Row Datasets

Hello Databricks Community,I am currently facing a challenge in configuring a cluster for training machine learning models on a dataset consisting of approximately a billion rows and 40 features. Given the volume of data, I want to ensure that the cl...

Machine Learning

Reply

11 Views
0 replies
0 kudos

5 hours ago

by Shreyash • New Contributor

Tuesday

138 Views
4 replies
0 kudos

java.lang.ClassNotFoundException: com.johnsnowlabs.nlp.DocumentAssembler

I am trying to serve a pyspark model using an endpoint. I was able to load and register the model normally. I could also load that model and perform inference but while serving the model, I am getting the following error: [94fffqts54] ERROR StatusLog...

Machine Learning

Model serving

sparknlp

Reply

138 Views
4 replies
0 kudos

Tuesday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @Shreyash, It looks like your code is encountering a java.lang.ClassNotFoundException for the com.johnsnowlabs.nlp.DocumentAssembler class while serving your PySpark model. This error occurs when the required class is not found in the classpath. ...

0 kudos

yesterday

3 More Replies

by amal15 • New Contributor II

Monday

86 Views
1 replies
0 kudos

XGBoostEstimator is not a member of package ml.dmlc.xgboost4j.scala.spark ?

XGBoostEstimator is not a member of package ml.dmlc.xgboost4j.scala.spark ?How can I resolve this error?

Machine Learning

Reply

86 Views
1 replies
0 kudos

Monday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @amal15, The error message you’re encountering, “XGBoostEstimator is not a member of package ml.dmlc.xgboost4j.scala.spark,” indicates that the XGBoostEstimator class is not being recognized within the specified package. Check Dependencie...

0 kudos

yesterday

by Colombia • New Contributor

Monday

204 Views
1 replies
0 kudos

Use OF API from package enerbitdso 0.1.8 PYPI

Hello! I have code to use an API supplied in the energitdso package (This is the repository https://pypi.org/project/enerbitdso/). I changed the code adapting it to AZURE DATABRICKS in python, but although there is a connection with the API, it does ...

Machine Learning

Reply

204 Views
1 replies
0 kudos

Monday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @Colombia, To execute a notebook in Azure Databricks programmatically and retrieve its results, you can use the Jobs REST API. Here’s how it works: Create a new job (using the notebook_task parameter) or create a single run (also called RunSubmit...

0 kudos

yesterday

by e6exghu8 • New Contributor

Monday

153 Views
1 replies
0 kudos

Help - org.apache.spark.SparkException: Job aborted due to stage failure: Task 47 in stage 2842.0

Hello, I am training a SparkXGBRegressor model. It runs without errors if the complexity is low, however when I increase the max_depth and/or num_parallel_tree parameters, I get an error. I checked the cluster metrics during training and it doesn't l...

Machine Learning

Reply

153 Views
1 replies
0 kudos

Monday

View Replies

Latest Reply

Kaniz
Community Manager

yesterday

0 kudos

Hi @e6exghu8, Ensure that your cluster has sufficient memory to handle the increased complexity (higher max_depth and num_parallel_tree).Check the memory configuration for your Spark executors. You might need to allocate more memory to each executor...

0 kudos

yesterday

by cmilligan • Contributor II

11-23-2022 12:43:30 PM

3073 Views
3 replies
2 kudos

Issue with Multi-column In predicates are not supported in the DELETE condition.

I'm trying to delete rows from a table with the same date or id as records in another table. I'm using the below query and get the error 'Multi-column In predicates are not supported in the DELETE condition'. delete from cost_model.cm_dispatch_consol...

Machine Learning

Reply

3073 Views
3 replies
2 kudos

11-23-2022 12:43:30 PM

View Replies

Latest Reply

shubhaskar
New Contributor

yesterday

2 kudos

Had the same issue. Please check the subquery returned value there must be something wrong with that.

2 kudos

yesterday

2 More Replies

by AChang • New Contributor III

08-22-2023 1:38:44 PM

1803 Views
2 replies
1 kudos

How to fix this runtime error in this Databricks distributed training tutorial workbook

I am following along with this notebook found from this article. I am attempting to fine tune the model with a single node and multiple GPUs, so I run everything up to the "Run Local Training" section, but from there I skip to "Run distributed traini...

Machine Learning

Reply

1803 Views
2 replies
1 kudos

08-22-2023 1:38:44 PM

View Replies

Latest Reply

KYX
New Contributor

Monday

1 kudos

Hi AChang, have you eventually resolved the error? I've also having the same error.

1 kudos

Monday

1 More Replies

by amal15 • New Contributor II

Saturday

347 Views
2 replies
1 kudos

Resolved! import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstimator, XGBoostClassificationModel}

how i can import : import com.microsoft.ml.spark.{LightGBMClassifier,LightGBMClassificationModel}import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstimator, XGBoostClassificationModel} projet spark & scala in databricks

Machine Learning

Reply

347 Views
2 replies
1 kudos

Saturday

View Replies

Latest Reply

amal15
New Contributor II

Monday

1 kudos

XGBoostEstimator is not a member of package ml.dmlc.xgboost4j.scala.spark ?How can I resolve this error?with maven : ml.dmlc:xgboost4j-spark_2.12:2.0.3

1 kudos

Monday

1 More Replies

by chrisf_sts • New Contributor II

Sunday

224 Views
0 replies
0 kudos

Extract calculations naive bayes model

I have a naive Bayes ML model that takes call attributes and predicts if the caller is going to abandon the call while they are on hold waiting to speak to an agent. The model lives in Databricks ML flow, I have it registered. What I need to do is ex...

Machine Learning

Reply

224 Views
0 replies
0 kudos

Sunday

by Lcsp • New Contributor

a week ago

274 Views
0 replies
0 kudos

AssertionError Failed to create the catalog

getting this error when trying to setup the get-started-with-databricks-for-machine-learning LAB . Unity catalog is enabled. Validating the locally installed datasets: | listing local files...(0 seconds) | validation completed...(0 seconds total) C...

Machine Learning

Reply

274 Views
0 replies
0 kudos

a week ago

by amal15 • New Contributor II

a week ago

70 Views
0 replies
0 kudos

error: not found: type XGBoostEstimator

error: not found: type XGBoostEstimator Spark & Scala

Machine Learning

Reply

70 Views
0 replies
0 kudos

a week ago

by tanjil • New Contributor III

2 weeks ago

274 Views
2 replies
0 kudos

Import mlflow Error

Hello, I am trying to replicate this motebook in my environment: mlflow-end-to-end-example - Databricks However, I am getting the following error when I run "import mlflow": "TypeError: bases must be types"How can I solve this issue? Thank you, Tanji...

Machine Learning

Reply

274 Views
2 replies
0 kudos

2 weeks ago

View Replies

Latest Reply

Walter_C
Valued Contributor II

2 weeks ago

0 kudos

Can you share the specific cell of the notebook where you are receiving this error? Have you modified the code or it is the same? Do you have any particular libraries installed on the cluster you are using for the testing?

0 kudos

2 weeks ago

1 More Replies

by Kaizen • Contributor III

a week ago

474 Views
2 replies
0 kudos

Unity Catalog table management with multiple teams members

Hi! How are you guys managing large teams working on the same project. Each member has their own data to save in Unity Catalog.Based on my understanding there is only two ways to manage this:1) Create an individual member schea so they can store thei...

Machine Learning

Reply

474 Views
2 replies
0 kudos

a week ago

View Replies

Latest Reply

Kaizen
Contributor III

a week ago

0 kudos

Any suggestions regarding this?@s_park , @Sujitha , @Debayan

0 kudos

a week ago

1 More Replies

by MinThuraZaw • New Contributor III

a week ago

59 Views
0 replies
0 kudos

404 Page Not Found Error on Features page

We are facing this issue when accessing Features page. Our workspace is on AWS, ap-southeast-1.I think this is related to new feature for online tables and serverless. Is it because of online tables are not available yet in our region? If it not avai...

Machine Learning

Reply

59 Views
0 replies
0 kudos

a week ago

by Edna • New Contributor

a week ago

204 Views
0 replies
0 kudos

Model flavour using feature store model training log_model()

Hi I'm have succesfully registered my model using the feature engineering client with the following codes:with mlflow.start_run(): # Calculate the ratio of negative class samples to positive class samples ratio = (len(y_train) - y_train.sum()...

Machine Learning

Reply

204 Views
0 replies
0 kudos

a week ago

Databricks

Forum Posts

Optimal Cluster Configuration for Training on Billion-Row Datasets

java.lang.ClassNotFoundException: com.johnsnowlabs.nlp.DocumentAssembler

XGBoostEstimator is not a member of package ml.dmlc.xgboost4j.scala.spark ?

Use OF API from package enerbitdso 0.1.8 PYPI

Help - org.apache.spark.SparkException: Job aborted due to stage failure: Task 47 in stage 2842.0

Issue with Multi-column In predicates are not supported in the DELETE condition.

How to fix this runtime error in this Databricks distributed training tutorial workbook

Resolved! import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstimator, XGBoostClassificationModel}

Extract calculations naive bayes model

AssertionError Failed to create the catalog

error: not found: type XGBoostEstimator

Import mlflow Error

Unity Catalog table management with multiple teams members

404 Page Not Found Error on Features page

Model flavour using feature store model training log_model()

import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstim...

Query ML Endpoint with R and Curl

'error_code': 'INVALID_PARAMETER_VALUE', 'message'...

AutoMl Dataset too large

Github Datasets/Labs for Large Language Models: Ap...