Machine Learning

by notsure • New Contributor

02-09-2023 9:30:08 AM

677 Views
1 replies
1 kudos

Model serving with Serverless Real-Time Inference - How could I call the endpoint with json file consisted of raw text that need to be transformed and get the prediction?

Hi!I want to call the generated endpoint with a json file consisted of texts directly, could this endpoint take the raw texts, transform the texts into vectors and then output the prediction?Is there a way to support so?Thanks in advance!!!

Machine Learning

Reply

677 Views
1 replies
1 kudos

02-09-2023 9:30:08 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

02-12-2023 9:21:59 PM

1 kudos

Hi, the updated document is : https://docs.databricks.com/machine-learning/model-inference/serverless/serverless-real-time-inference.html, (mentioned in the document stated above: This documentation has been retired and might not be updated. The prod...

1 kudos

02-12-2023 9:21:59 PM

by Gilg • Contributor II

02-01-2023 7:07:29 PM

1278 Views
3 replies
3 kudos

INVALID_STATE: Databricks could not access keyvault

Hi Team,Update: We are using Unity Catalog workspace. Also we are using RBAC model.I am able to create a secret scope and able to list the scope in a notebook usingdbutils.secrets.list("<scopename>")But when I try get the secret value using value = d...

Machine Learning

Reply

1278 Views
3 replies
3 kudos

02-01-2023 7:07:29 PM

View Replies

Latest Reply

Anonymous
Not applicable

02-08-2023 8:47:43 PM

3 kudos

Hi @Gil Gonong Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

3 kudos

02-08-2023 8:47:43 PM

2 More Replies

by anvil • New Contributor II

01-24-2023 1:14:46 PM

1400 Views
3 replies
4 kudos

Are UDFs necessary for applying models from ML libraries at scale ?

Hello,I recently finished the "scalable machine learning with apache spark" course and saw that SKLearn models could be applied faster in a distributed manner when used in pandas UDFs or with mapInPandas() method. Spark MLlib models don't need this k...

Machine Learning

Reply

1400 Views
3 replies
4 kudos

01-24-2023 1:14:46 PM

View Replies

Latest Reply

Manoj12421
Valued Contributor II

02-08-2023 11:17:49 AM

4 kudos

MlLib is in the maintenance model and udf is not used by creating model in most cases

4 kudos

02-08-2023 11:17:49 AM

2 More Replies

by fa • New Contributor III

12-07-2022 10:18:05 AM

1895 Views
6 replies
7 kudos

How are dashboards served and what would happen to them if the cluster attached to the notebook terminates?

I have two dashboards in presentation mode both from notebooks being run on the same compute cluster. Last night the cluster terminated due to idle time and in the morning one of my dashboards was fine but the other one was set to the default stab di...

Machine Learning

Reply

1895 Views
6 replies
7 kudos

12-07-2022 10:18:05 AM

View Replies

Latest Reply

Manoj12421
Valued Contributor II

02-08-2023 11:14:08 AM

7 kudos

If your query were scheduled, it's automatically started the cluster at the scheduled time Or might be possible that the portion that is still visible doesn't need to be generated so it looks like it's working but it is just left over from the prior...

7 kudos

02-08-2023 11:14:08 AM

5 More Replies

by Charley • New Contributor II

12-16-2022 1:03:58 AM

3696 Views
1 replies
1 kudos

error status 400 calling serving model endpoint invocation using personal access token on Azure Databricks

Hi all, I've deployed a model, moved it to production and served it (mlflow), but when testing it in the python notebook I get a 400 error. code/details below:import osimport requestsimport jsonimport pandas as pdimport numpy as np# Create two record...

Machine Learning

Reply

3696 Views
1 replies
1 kudos

12-16-2022 1:03:58 AM

View Replies

Latest Reply

nakany
New Contributor II

02-07-2023 5:02:54 AM

1 kudos

data_json in the score_model function should be defined as followsds_dict = {"dataframe_split": dataset.to_dict(orient='split')} if isinstance(dataset, pd.DataFrame) else create_tf_serving_json(dataset)

1 kudos

02-07-2023 5:02:54 AM

by Hubert-Dudek • Esteemed Contributor III

02-01-2023 1:19:50 AM

1071 Views
1 replies
7 kudos

Materialized views are a powerful feature soon available on databricks. Unlike traditional views, which store the query definition, materialized views...

Materialized views are a powerful feature soon available on databricks. Unlike traditional views, which store the query definition, materialized views physically store the data, making it available for faster querying. This translates to significantl...

Machine Learning

Reply

1071 Views
1 replies
7 kudos

02-01-2023 1:19:50 AM

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

02-01-2023 1:41:03 AM

7 kudos

Very informative, Thanks for sharing

7 kudos

02-01-2023 1:41:03 AM

by thibault • Contributor

12-02-2022 5:56:11 AM

1147 Views
2 replies
2 kudos

Best practice for model promotion so that models are not removed from previous stage

Hi,Using Model Registry to promote models is great. However, I am facing an issue, where multiple Databricks workspaces (SIT / UAT / Prod) use a model at various stages (Staging for SIT and UAT, Production for Prod workspace).We have a workflow runni...

Machine Learning

Reply

1147 Views
2 replies
2 kudos

12-02-2022 5:56:11 AM

View Replies

Latest Reply

User16502773013
New Contributor III

01-30-2023 12:09:28 PM

2 kudos

Hello Thibault,For reusing already built model there are multiple options:Register the model from dev to QA model registry as described here ORIn this scenario only the registered model will be copiedLineage to runs is not possibleYou can copy dev's ...

2 kudos

01-30-2023 12:09:28 PM

1 More Replies

by ALIDI • New Contributor II

12-12-2022 7:02:10 AM

1014 Views
1 replies
2 kudos

Run with UUID *** is already active when running automl

Hi, I'm tried using databricks autoML API following the documentation and example notebook. The documentation and example are pretty straight forward however I encountered the following error:Exception: Run with UUID 1315376a0cbb4657b5d23fa552efba4b ...

Machine Learning

Reply

1014 Views
1 replies
2 kudos

12-12-2022 7:02:10 AM

View Replies

Latest Reply

shan_chandra
Honored Contributor III

01-31-2023 8:08:49 AM

2 kudos

@Al IDI - could you please let us know the ML runtime version you have ran into this? could you please try setting and see if it works? spark.conf.set("spark.databricks.mlflow.trackHyperopt.enabled", "false")

2 kudos

01-31-2023 8:08:49 AM

by jonathan-dufaul • Valued Contributor

01-25-2023 9:24:54 AM

757 Views
1 replies
0 kudos

how does the data science workflow change in databricks if you start with a nosql database (specifically document store) instead of something more traditional/rdbms type source?

I'm sorry if this is a bad question. The tl;dr is are there any concrete examples of a nosql data science workflows specifically in databricks and if so what are they?is it always the case that our end goal is a dataframe?For us we start as a bunch o...

Machine Learning

Reply

757 Views
1 replies
0 kudos

01-25-2023 9:24:54 AM

View Replies

Latest Reply

Nhan_Nguyen
Valued Contributor

01-31-2023 5:18:18 AM

0 kudos

Nice sharing, thanks!

0 kudos

01-31-2023 5:18:18 AM

by ryojikn • New Contributor III

01-15-2023 8:26:07 PM

878 Views
1 replies
1 kudos

Error on pandas udf usage in databricks, sc.broadcasting random forest loaded from Kedro MLFlow Logger DataSet, cannot pickle '_thread.RLock' object

I'm trying to broadcast a Random forest (sklearn 1.2.0) recently loaded from mlflow, and using Pandas UDF to predict a model.However, the same code works perfectly on Spark 2.4 + our OnPrem cluster.I thought it was due to Spark 2.4 to 3 changes, an...

Machine Learning

Reply

878 Views
1 replies
1 kudos

01-15-2023 8:26:07 PM

View Replies

Latest Reply

ryojikn
New Contributor III

01-30-2023 5:03:31 AM

1 kudos

Anyone?

1 kudos

01-30-2023 5:03:31 AM

by Sujitha • Community Manager

01-25-2023 3:04:41 AM

821 Views
5 replies
1 kudos

Latest Blog PostsJanuary 13 - 20 Did you get a chance to look at the most recent blog posts? Here are some happening content from the past week that i...

Latest Blog PostsJanuary 13 - 20Did you get a chance to look at the most recent blog posts? Here are some happening content from the past week that is worth the read. What’s New With SQL User-Defined Functions In this blog, we describe several enhanc...

Machine Learning

Reply

821 Views
5 replies
1 kudos

01-25-2023 3:04:41 AM

View Replies

Latest Reply

Chaitanya_Raju
Honored Contributor

01-25-2023 10:05:31 PM

1 kudos

Thanks @Sujitha Ramamoorthy , for sharing with the community these are worth reading and insightful.

1 kudos

01-25-2023 10:05:31 PM

4 More Replies

by lbourgeois • New Contributor III

01-03-2023 10:02:14 AM

2147 Views
9 replies
3 kudos

com.amazonaws.services.s3.model.AmazonS3Exception: The bucket is in this region: *** when using S3 Select

Hello,I have a cluster running in us-east-1 region.I hava a Spark job loading data in a DataFrame using s3select format on a bucket in eu-west-1 region.Access and Secret keys are encoded in URI s3a://$AccessKey:$SecretKey@bucket/path/to/dirJob fails ...

Machine Learning

Reply

2147 Views
9 replies
3 kudos

01-03-2023 10:02:14 AM

View Replies

Latest Reply

lbourgeois
New Contributor III

01-26-2023 12:31:48 AM

3 kudos

Hello,I tried your suggestion by setting up the peering connection between the 2 VPC but issue remains the same.The error message The bucket is in this region: .... please use this region to retry the requestmakes me think that the root cause is not ...

3 kudos

01-26-2023 12:31:48 AM

8 More Replies

by venkad • Contributor

01-25-2023 8:59:07 AM

5741 Views
1 replies
1 kudos

Unity Catalog Pricing

Hi All, I would like to understand the pricing model of the Unity Catalog. Earlier I remember there was some mention of the data lineage and a few other features that will have a cost associated with it. If that's true, what other features cost us? W...

Machine Learning

Reply

5741 Views
1 replies
1 kudos

01-25-2023 8:59:07 AM

View Replies

Latest Reply

LandanG
Honored Contributor

01-25-2023 5:24:44 PM

1 kudos

Hi @Venkadeshwaran K ,All Unity Catalog features are provided at no charge to customers, provided they are using a Premium or Enterprise SKU.

1 kudos

01-25-2023 5:24:44 PM

by KenAN • New Contributor II

10-12-2022 3:21:31 PM

1708 Views
3 replies
3 kudos

How to circumvent Py4JSecurityException for spark-nlp : Constructor public com.johnsnowlabs.nlp.***(java.lang.String) is not whitelisted.

Running into the following error on our company's cluster. py4j.security.Py4JSecurityException: Constructor public com.johnsnowlabs.nlp.DocumentAssembler(java.lang.String) is not whitelisted.For the following code(which is just tutorial code from the...

Machine Learning

Reply

1708 Views
3 replies
3 kudos

10-12-2022 3:21:31 PM

View Replies

Latest Reply

Anonymous
Not applicable

11-27-2022 4:50:21 AM

3 kudos

Hi @Kenan Spruill Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

3 kudos

11-27-2022 4:50:21 AM

2 More Replies

by Dhanunjay • New Contributor II

01-24-2023 8:23:23 AM

999 Views
3 replies
3 kudos

Is it possible to access online feature store (Cosmos DB) outside databricks?

We are building an machine learning application with feature store enabled. Once the model is built, we are trying to move the model artifacts and deploy it in azure ml as online endpoint. Does it possible to access the online store in azure ml endpo...

Machine Learning

Reply

999 Views
3 replies
3 kudos

01-24-2023 8:23:23 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-24-2023 8:29:48 AM

3 kudos

if you want databricks to use the feature store, which is in Cosmos DB, yes, it is possible https://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/online-feature-storessuppose you want to consume a future store in Databrick...

3 kudos

01-24-2023 8:29:48 AM

2 More Replies

Databricks

Forum Posts

Model serving with Serverless Real-Time Inference - How could I call the endpoint with json file consisted of raw text that need to be transformed and get the prediction?

INVALID_STATE: Databricks could not access keyvault

Are UDFs necessary for applying models from ML libraries at scale ?

How are dashboards served and what would happen to them if the cluster attached to the notebook terminates?

error status 400 calling serving model endpoint invocation using personal access token on Azure Databricks

Materialized views are a powerful feature soon available on databricks. Unlike traditional views, which store the query definition, materialized views...

Best practice for model promotion so that models are not removed from previous stage

Run with UUID *** is already active when running automl

how does the data science workflow change in databricks if you start with a nosql database (specifically document store) instead of something more traditional/rdbms type source?

Error on pandas udf usage in databricks, sc.broadcasting random forest loaded from Kedro MLFlow Logger DataSet, cannot pickle '_thread.RLock' object

Latest Blog PostsJanuary 13 - 20 Did you get a chance to look at the most recent blog posts? Here are some happening content from the past week that i...

com.amazonaws.services.s3.model.AmazonS3Exception: The bucket is in this region: *** when using S3 Select

Unity Catalog Pricing

How to circumvent Py4JSecurityException for spark-nlp : Constructor public com.johnsnowlabs.nlp.***(java.lang.String) is not whitelisted.

Is it possible to access online feature store (Cosmos DB) outside databricks?

pdb debugger on databricks

import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstim...

Query ML Endpoint with R and Curl

'error_code': 'INVALID_PARAMETER_VALUE', 'message'...

AutoMl Dataset too large