Machine Learning

by migq2 • New Contributor III

07-12-2024 12:06:06 PM

2639 Views
1 replies
0 kudos

Cannot log SparkML model to Unity Catalog due to missing output signature

I am training Spark ML model (concretely a SynapseML LightGBM ) in Databricks using mlflow and autologWhen I try to register my model in Unity catalog I get the following error: MlflowException: Model passed for registration contained a signature th...

Machine Learning

Reply

2639 Views
1 replies
0 kudos

07-12-2024 12:06:06 PM

View Replies

by Rexe • New Contributor

07-14-2024 3:02:40 AM

772 Views
0 replies
0 kudos

TypeError: float() argument must be a string or a number, not 'StepArtifact'?

How to get the content of a returned variable in zenml without having this error:TypeError: float() argument must be a string or a number, not 'StepArtifact'?

Machine Learning

Reply

772 Views
0 replies
0 kudos

07-14-2024 3:02:40 AM

by datastones • Contributor

07-08-2024 9:45:22 AM

1219 Views
1 replies
0 kudos

Deployment as code pattern with double training effort?

Hi everybody, I have a question re: the deployment as code pattern on databricks. I found and watched a great demo here: https://www.youtube.com/watch?v=JApPzAnbfPIMy question is, in the case where I can get read access to prod data in dev env, the d...

Machine Learning

Reply

1219 Views
1 replies
0 kudos

07-08-2024 9:45:22 AM

View Replies

by rahuja • Contributor

07-06-2024 2:38:34 PM

1170 Views
1 replies
0 kudos

Create Databricks Dashboards on MLFlow Metrics

HelloCurrently we have multiple ML models running in Production which are logging metrics and other meta-data on mlflow. I wanted to ask is it possible somehow to build Databricks dashboards on top of this data and also can this data be somehow avail...

Machine Learning

Reply

1170 Views
1 replies
0 kudos

07-06-2024 2:38:34 PM

View Replies

Latest Reply

rahuja
Contributor

07-08-2024 7:07:31 AM

0 kudos

Hello @Retired_mod Thanks for responding. I think you are talking about using the Python API. But we don't want that is it possible since MLFlow also uses an sql table to store metrics. To expose those tables as a part of our meta-store and build da...

0 kudos

07-08-2024 7:07:31 AM

by datastones • Contributor

07-03-2024 8:39:09 AM

3690 Views
2 replies
0 kudos

Resolved! ML model promotion from Databricks dev workspace to prod workspace

Hi everybody. I am relatively new to Databricks. I am working on an ML model promotion process between different Databricks workspaces. I am aware that best practice should be deployment as code (e.g. export the whole training pipeline and model regi...

Machine Learning

Reply

3690 Views
2 replies
0 kudos

07-03-2024 8:39:09 AM

View Replies

Latest Reply

amr
Databricks Employee

07-04-2024 10:59:34 AM

0 kudos

I am aware that models registered in Databricks Unity Catalog (UC) in the prod workspace can be loaded from dev workspace for model comparison/debugging. But to comply with best practices, we restrict access to assets in UC in the dev workspace fro...

0 kudos

07-04-2024 10:59:34 AM

1 More Replies

by hadoan • New Contributor II

07-07-2024 9:37:06 PM

1122 Views
0 replies
0 kudos

Cannot use Databricks ARC as demo code

I read the link about Databricks ARC - https://github.com/databricks-industry-solutions/auto-data-linkageand run on DBR 12.2 LTS ML runtime environment on DB cloud communityBut I got the error below: 2024/07/08 04:25:33 INFO mlflow.tracking.fluent: E...

Machine Learning

Reply

1122 Views
0 replies
0 kudos

07-07-2024 9:37:06 PM

by adrianna2942842 • New Contributor III

06-11-2024 6:27:04 AM

3593 Views
1 replies
0 kudos

Deployment with model serving failed after entering "DEPLOYMENT_READY" state

Hi, I was trying to update a config for an endpoint, by adding a new version of an entity (version 7). The new model entered "DEPLOYMENT_READY" state, but the deployment failed with timed out exception. I didn't get any other exception in Build or Se...

Machine Learning

Reply

3593 Views
1 replies
0 kudos

06-11-2024 6:27:04 AM

View Replies

Latest Reply

Kumaran
Databricks Employee

07-05-2024 12:56:30 PM

0 kudos

Hi @adrianna2942842, Thank you for contacting the Databricks community. May I know how you are loading the model?

0 kudos

07-05-2024 12:56:30 PM

by ChanduBhujang • New Contributor II

06-12-2024 12:01:41 PM

1128 Views
1 replies
0 kudos

Pyspark models iterative/augmented training capability

Does Pyspark tree based models have iterative or augmented training capabilities ? Similar to sklearn package can be used to train models using model artifact and use that model to train using additional data? #ML_Models_Pyspark

Machine Learning

Reply

1128 Views
1 replies
0 kudos

06-12-2024 12:01:41 PM

View Replies

Latest Reply

Kumaran
Databricks Employee

07-05-2024 12:36:01 PM

0 kudos

Hi @ChanduBhujang, Thank you for contacting Databricks community. PySpark tree-based models do not have built-in iterative or augmented training capabilities like Scikit-learn's partial_fit method. While there are workarounds to update the model wit...

0 kudos

07-05-2024 12:36:01 PM

by Solide • New Contributor

06-22-2023 4:21:20 AM

14441 Views
7 replies
6 kudos

Databricks runtime version Error

Hello,I'm following courses on the Databricks academy and using for that purpose the Databricks Community edition using a runtime 12.2 LTS (includes Apache Spark 3.3.2, Scala 2.12) and I believe it can't be changedI'm following the Data engineering c...

Machine Learning

Reply

14441 Views
7 replies
6 kudos

06-22-2023 4:21:20 AM

View Replies

Latest Reply

V2dha
New Contributor III

12-20-2023 6:40:53 AM

6 kudos

I was facing the same error. This could be resolved by adding the version that you are currently working with in the config function present in '_common' notebook in the "Includes' folder. (This was the case of my folder structure that I downloaded f...

6 kudos

12-20-2023 6:40:53 AM

6 More Replies

by Psybelo • New Contributor II

06-19-2023 1:41:01 AM

5015 Views
4 replies
3 kudos

DE 2.2 - Providing Options for External Sources - Classroom setup error

Hi All,I am unable to execute "Classroom-Setup-02.2" setup in Data Engineering Course. There is the following error: FileNotFoundError: [errno 2] no such file or directory: '/dbfs/mnt/dbacademy-datasets/data-engineer-learning-path/v01/ecommerce/raw/u...

Machine Learning

Reply

5015 Views
4 replies
3 kudos

06-19-2023 1:41:01 AM

View Replies

Latest Reply

Eagle78
New Contributor III

06-26-2024 12:47:16 PM

3 kudos

Inspired by https://stackoverflow.com/questions/58984925/pandas-missing-read-parquet-function-in-azure-databricks-notebookI changed df = pd.read_parquet(path = datasource_path.replace("dbfs:/", '/dbfs/')) # original, error!intodf = spark.read.format(...

3 kudos

06-26-2024 12:47:16 PM

3 More Replies

by bbashuk • New Contributor II

06-26-2024 5:58:38 AM

3185 Views
1 replies
0 kudos

How to implement early stop in SparkXGBRegressor with Pipeline?

Trying to implement an Early Stopping mechanism in SparkXGBRegressor model with Pipeline: from pyspark.ml.feature import VectorAssembler, StringIndexer from pyspark.ml import Pipeline, PipelineModel from xgboost.spark import SparkXGBRegressor from x...

Machine Learning

Reply

3185 Views
1 replies
0 kudos

06-26-2024 5:58:38 AM

View Replies

Latest Reply

bbashuk
New Contributor II

06-26-2024 8:30:39 AM

0 kudos

Ok, I finally solved it - added a column to the dataset validation_indicator_col='validation_0', and did not pass it the the VectorAssembler:xgboost_regressor = SparkXGBRegressor() xgboost_regressor.setParams( gamma=0.2, max_depth=6, obje...

0 kudos

06-26-2024 8:30:39 AM

by simranisanewbie • New Contributor II

06-25-2024 12:31:49 AM

1784 Views
0 replies
1 kudos

Pyspark custom Transformer class -AttributeError: 'DummyMod' object has no attribute 'MyTransformer'

I am trying to create a custom transformer as a stage in my pipeline. A few of the transformations I am doing via SparkNLP and the next few using MLlib. To pass the result of SparkNLP transformation at a stage to the next MLlib transformation, I need...

Machine Learning

Custom Transformer

ML FLow

Reply

1784 Views
0 replies
1 kudos

06-25-2024 12:31:49 AM

by Octavian1 • Contributor

03-20-2024 7:15:43 AM

4008 Views
3 replies
1 kudos

port undefined error in SQLDatabase.from_databricks (langchain.sql_database)

The following assignment:from langchain.sql_database import SQLDatabasedbase = SQLDatabase.from_databricks(catalog=catalog, schema=db,host=host, api_token=token,)fails with ValueError: invalid literal for int() with base 10: ''because ofcls._assert_p...

Machine Learning

Reply

4008 Views
3 replies
1 kudos

03-20-2024 7:15:43 AM

View Replies

Latest Reply

vburam
New Contributor II

06-21-2024 12:44:35 AM

1 kudos

I am also facing the same issue. not able to connect even after using sqlalchemy

1 kudos

06-21-2024 12:44:35 AM

2 More Replies

by Betul • New Contributor

06-11-2024 4:08:30 PM

1120 Views
1 replies
0 kudos

How to do cicd with different models/versions using databricks resources?

Generally speaking what are the tips to make cicd process better with having different versions and models?

Machine Learning

Reply

1120 Views
1 replies
0 kudos

06-11-2024 4:08:30 PM

View Replies

Latest Reply

robbe
Contributor

06-20-2024 8:44:20 AM

0 kudos

Hi @Betul, I think that there are different ways but it really depends on what do you mean by different models and versions.One simple option is to use Databricks Asset Bundles to create multiple workflows (one for each model) and use the champion-ch...

0 kudos

06-20-2024 8:44:20 AM

by rasgaard • New Contributor

06-19-2024 1:54:26 AM

2827 Views
1 replies
0 kudos

Model Serving Endpoints - Build configuration and Interactive access

Hi there I have used the Databricks Model Serving Endpoints to serve a model which depends on some config files and a custom library. The library has been included by logging the model with the `code_path` argument in `mlflow.pyfunc.log_model` and it...

Machine Learning

Reply

2827 Views
1 replies
0 kudos

06-19-2024 1:54:26 AM

View Replies

Latest Reply

robbe
Contributor

06-20-2024 8:38:35 AM

0 kudos

Hi @rasgaard, one way to achieve that without inspecting the container is to use MLflow artifacts. Artifacts allow you to log files together with your models and reference them inside the endpoint.For example, let's assume that you need to include a ...

0 kudos

06-20-2024 8:38:35 AM

Databricks Community

Forum Posts

Cannot log SparkML model to Unity Catalog due to missing output signature

TypeError: float() argument must be a string or a number, not 'StepArtifact'?

Deployment as code pattern with double training effort?

Create Databricks Dashboards on MLFlow Metrics

Resolved! ML model promotion from Databricks dev workspace to prod workspace

Cannot use Databricks ARC as demo code

Deployment with model serving failed after entering "DEPLOYMENT_READY" state

Pyspark models iterative/augmented training capability

Databricks runtime version Error

DE 2.2 - Providing Options for External Sources - Classroom setup error

How to implement early stop in SparkXGBRegressor with Pipeline?

Pyspark custom Transformer class -AttributeError: 'DummyMod' object has no attribute 'MyTransformer'

port undefined error in SQLDatabase.from_databricks (langchain.sql_database)

How to do cicd with different models/versions using databricks resources?

Model Serving Endpoints - Build configuration and Interactive access

Join Us as a Local Community Builder!

Can serverless environments not use SynapseML's Li...

How to store & update a FAISS Index in Databricks

course material access

Databricks Model Serving Endpoint Fails: “_USER no...

No option for create compute in trial version