Machine Learning

by mharrison • New Contributor II

12-17-2024 5:33:51 AM

1368 Views
2 replies
0 kudos

Feature Lookup Help

Hi,ContextI'm looking for help trying to get Unity Catalog Feature Lookup to work with my model how I need it to.I have a trained darts time series model that takes as input to its `.predict()` method both the history of the variable in question, and...

Machine Learning

Reply

1368 Views
2 replies
0 kudos

12-17-2024 5:33:51 AM

View Replies

Latest Reply

mharrison
New Contributor II

12-17-2024 6:36:51 AM

0 kudos

Thanks for your response. It sounds like the 2nd approach is best for me, modifying the `predict()` method to perform the required history lookup.Is it possible to do this via the Feature Engineering client within that method, or should I simply quer...

0 kudos

12-17-2024 6:36:51 AM

1 More Replies

by johndoe99012 • New Contributor II

12-15-2024 11:14:14 PM

1757 Views
4 replies
1 kudos

How to serve a Unity Catalog ML model to external usage

Hello everyone I am following this notebook tutorial https://docs.databricks.com/en/machine-learning/manage-model-lifecycle/index.html#example-notebook Now I can register a machine learning model in Unity Catalog, but the tutorial only shows how to u...

Machine Learning

Reply

1757 Views
4 replies
1 kudos

12-15-2024 11:14:14 PM

View Replies

Latest Reply

filipniziol
Esteemed Contributor

12-16-2024 10:02:33 AM

1 kudos

Hi @johndoe99012 If the answer resolved your question, please consider marking it as the solution. It helps others in the community find answers more easily.

1 kudos

12-16-2024 10:02:33 AM

3 More Replies

by TinSlim • New Contributor III

12-09-2024 11:18:42 AM

4077 Views
3 replies
0 kudos

Maximum wait time Databricks Model Serving

hi, hope you are fineI deployed a model 3 or 2 months ago using Databricks Serving and MLFlow. The model worked good using GPU from model serving.I stopped using it for some months and when I tried again deploying it, it has some errors.1. [FIXED] A ...

Machine Learning

Reply

4077 Views
3 replies
0 kudos

12-09-2024 11:18:42 AM

View Replies

Latest Reply

Alberto_Umana
Databricks Employee

12-10-2024 11:44:59 AM

0 kudos

Thanks, I will review it and get back. I'll DIM you.

0 kudos

12-10-2024 11:44:59 AM

2 More Replies

by mradassaad • New Contributor III

05-03-2022 9:44:08 AM

6790 Views
3 replies
1 kudos

Resolved! Tuning `CrossValidator` spark job performance

I am running a 3-fold cross validation of an ML pipeline that utilizes `GBTClassifier` as the final step. It takes 18 hours to run and I am looking for feedback into how to improve the performance as I expect this to go faster.For context here is the...

Machine Learning

Reply

6790 Views
3 replies
1 kudos

05-03-2022 9:44:08 AM

View Replies

Latest Reply

cchalc
New Contributor III

05-11-2022 10:51:01 AM

1 kudos

Hello @Assaad Mrad , So this looks like trying to decide between putting the pipeline in the cross validator or the cross validator in the pipeline. Since you are doing the polynomial expansion as part of the pipeline you might want to consider putt...

1 kudos

05-11-2022 10:51:01 AM

2 More Replies

by jonathanhodges • New Contributor II

10-30-2024 5:50:12 PM

3790 Views
4 replies
0 kudos

Training Job Failure (Driver Error)

We have a new model training job that was running fine for a few days and then started failing. I have attached images for more details.I am wondering if 'can't reach driver cluster' is a red herring. It says the driver is healthy right before execut...

Machine Learning

Reply

3790 Views
4 replies
0 kudos

10-30-2024 5:50:12 PM

View Replies

Latest Reply

jonathanhodges
New Contributor II

11-29-2024 1:14:26 PM

0 kudos

In our case, we needed to correct our dependent libraries. We had an incorrect path referenced.

0 kudos

11-29-2024 1:14:26 PM

3 More Replies

by danielvdc • New Contributor II

11-27-2024 7:39:16 AM

3641 Views
0 replies
2 kudos

Rolling predictions with FeatureEngineeringClient

I am performing a time series analysis, using a XGBoostRegressor with rolling predictions. I am doing so using the FeatureEngineeringClient (in combination with Unity Catalog), where I create and load in my features during training and inference, as ...

Machine Learning

Reply

3641 Views
0 replies
2 kudos

11-27-2024 7:39:16 AM

by nikviz • New Contributor II

11-26-2024 3:22:08 AM

1494 Views
2 replies
0 kudos

Resolved! Vector search index stops at 45406

I am trying to create a vector search index for a table, but it stops at 45406 rows. I can see that the writeback table has all the records but the indexing stops. Is there a hard limit on index?

Machine Learning

Reply

1494 Views
2 replies
0 kudos

11-26-2024 3:22:08 AM

View Replies

Latest Reply

Walter_C
Databricks Employee

11-26-2024 6:15:26 AM

0 kudos

There are some limits that you can be hitting: Row Size for Delta Sync Index: The maximum row size is 100KB.Embedding Source Column Size for Delta Sync Index: The maximum size is 32764 bytes.Bulk Upsert Request Size Limit for Direct Vector Index: The...

0 kudos

11-26-2024 6:15:26 AM

1 More Replies

by miahopman • New Contributor II

10-23-2023 9:07:55 AM

4417 Views
2 replies
1 kudos

AutoML Runs Failing

After the Data Exploration notebook runs successfully, all AutoML trials fail without providing a source notebook. I have ensured that the training data labels have no null values or any labels with 16 or less occurrences associated with them. I cann...

Machine Learning

Reply

4417 Views
2 replies
1 kudos

10-23-2023 9:07:55 AM

View Replies

Latest Reply

rtreves
Contributor

11-26-2024 6:07:30 PM

1 kudos

@AnNg Have there been any updates on this feature?

1 kudos

11-26-2024 6:07:30 PM

1 More Replies

by JoeAckerman • New Contributor II

11-21-2024 4:21:47 AM

1301 Views
2 replies
0 kudos

Python running far slower than locally, even with large cluster and multiple workers

I have a notebook that is running extremely slowly even when I try to do pretty basic python functions. It is running far slower than locally no matter what I try, this is in spite of using a 32gb 4 core cluster with 4-8 workers. For context, my data...

Machine Learning

Reply

1301 Views
2 replies
0 kudos

11-21-2024 4:21:47 AM

View Replies

Latest Reply

cgrant
Databricks Employee

11-26-2024 11:33:20 AM

0 kudos

Please share more information, for example: Type of data sourceType of operations being executed (sharing code if possible)Timings of local runs and Databricks runs

0 kudos

11-26-2024 11:33:20 AM

1 More Replies

by sangramraje • New Contributor

11-22-2024 10:33:12 AM

3584 Views
0 replies
0 kudos

AutoML "need to sample" not working as expected

tl; dr:When the AutoML run realizes it needs to do sampling because the driver / worker node memory is not enough to load / process the entire dataset, it fails. A sample weight column is NOT provided by me, but I believe somewhere in the process the...

Machine Learning

Reply

3584 Views
0 replies
0 kudos

11-22-2024 10:33:12 AM

by yopbibo • Contributor II

09-02-2022 6:53:19 AM

3610 Views
3 replies
5 kudos

Deploy a ML model, trained and registered in Databricks to AKS

Hi,I can train, registered a ML Model in my Datbricks Workspace.Then, to deploy it on AKS, I need to register the model in Azure ML, and then, deploy to AKS.Is it possible to skip the Azure ML step?I would like to deploy directly into my AKS instance...

Machine Learning

Reply

3610 Views
3 replies
5 kudos

09-02-2022 6:53:19 AM

View Replies

Latest Reply

sidharthpradhan
New Contributor II

11-18-2024 3:11:31 PM

5 kudos

Is it still the case, can't we serve the model in Databricks. I am new to this, so I am just wondering the capabilities.

5 kudos

11-18-2024 3:11:31 PM

2 More Replies

by damselfly20 • New Contributor III

11-14-2024 10:43:02 PM

1317 Views
2 replies
1 kudos

Endpoint creation without scale-to-zero

Hi, I've got a question about deploying an endpoint for Llama 3.1 8b. The following code should create the endpoint without scale-to-zero. The endpoint is being created, but with scale-to-zero, although scale_to_zero_enabled is set to False. Instead ...

Machine Learning

Reply

1317 Views
2 replies
1 kudos

11-14-2024 10:43:02 PM

View Replies

Latest Reply

damselfly20
New Contributor III

11-18-2024 1:57:03 AM

1 kudos

Thanks for the reply @Walter_C. This didn't quite work, since it used a CPU and didn't consider the max_provisioned_throughput, but I finally got it to work like this: from mlflow.deployments import get_deploy_client client = get_deploy_client("data...

1 kudos

11-18-2024 1:57:03 AM

1 More Replies

by cmilligan • Contributor II

11-23-2022 12:43:30 PM

8700 Views
5 replies
2 kudos

Issue with Multi-column In predicates are not supported in the DELETE condition.

I'm trying to delete rows from a table with the same date or id as records in another table. I'm using the below query and get the error 'Multi-column In predicates are not supported in the DELETE condition'. delete from cost_model.cm_dispatch_consol...

Machine Learning

Reply

8700 Views
5 replies
2 kudos

11-23-2022 12:43:30 PM

View Replies

Latest Reply

thisisthemurph
New Contributor II

11-14-2024 3:35:11 AM

2 kudos

I seem to get this error on some DeltaTables and not others:df.createOrReplaceTempView("channels_to_delete") spark.sql(""" delete from lake.something.earnings where TenantId = :tenantId and ChannelId = in ( select ChannelId ...

2 kudos

11-14-2024 3:35:11 AM

4 More Replies

by amirA • New Contributor II

10-08-2024 6:56:36 PM

2938 Views
3 replies
1 kudos

Resolved! Extracting Topics From Text Data Using PySpark

Hi EveryoneI tried to follow the same steps in Topic from Text on similar data as example. However, when I tri to fit the model with data I get this error.IllegalArgumentException: requirement failed: Column features must be of type equal to one of t...

Machine Learning

Reply

2938 Views
3 replies
1 kudos

10-08-2024 6:56:36 PM

View Replies

Latest Reply

filipniziol
Esteemed Contributor

10-08-2024 11:32:47 PM

1 kudos

Hi @amirA ,The LDA model expects the features column to be of type Vector from the pyspark.ml.linalg module, specifically either a SparseVector or DenseVector, whereas you have provided Row type.You need to convert your Row object to SparseVector.Che...

1 kudos

10-08-2024 11:32:47 PM

2 More Replies

by ukaplan • New Contributor III

08-19-2024 7:20:56 AM

7501 Views
15 replies
2 kudos

Serving Endpoint Container Image Creation Fails

Hello, I trained a model using MLFlow, and saved the model as an artifact. I can load the model from a notebook and it works as expected (i.e. I can load the model using its URI).However, when I want to deploy it using Databricks endpoints, container...

Machine Learning

Reply

7501 Views
15 replies
2 kudos

08-19-2024 7:20:56 AM

View Replies

Latest Reply

damselfly20
New Contributor III

11-12-2024 1:52:16 AM

2 kudos

@ivan_calvo The problem still exists. Surely there has to be some other option than downgrading the ML cluster to DBR 14.3 LTS ML?

2 kudos

11-12-2024 1:52:16 AM

14 More Replies

Databricks Community

Forum Posts

Feature Lookup Help

How to serve a Unity Catalog ML model to external usage

Maximum wait time Databricks Model Serving

Resolved! Tuning `CrossValidator` spark job performance

Training Job Failure (Driver Error)

Rolling predictions with FeatureEngineeringClient

Resolved! Vector search index stops at 45406

AutoML Runs Failing

Python running far slower than locally, even with large cluster and multiple workers

AutoML "need to sample" not working as expected

Deploy a ML model, trained and registered in Databricks to AKS

Endpoint creation without scale-to-zero

Issue with Multi-column In predicates are not supported in the DELETE condition.

Resolved! Extracting Topics From Text Data Using PySpark

Serving Endpoint Container Image Creation Fails

Join Us as a Local Community Builder!

Problem loading a pyfunc model in job run

Serving Endpoint Disappears After One Day

Can't use pyspark bucketizer

VLLM dependency Issues with DBR 17.0

Custom docker container for GPU compute using pyth...