Machine Learning

by Mirko • Contributor

08-22-2022 3:23:47 AM

1161 Views
4 replies
0 kudos

Why is there a limit in /2.1/jobs/list?

I detected that there ist a limit of 25 in /2.1/jobs/list. While from what i know /2.0/jobs/list had no limit? Why is this the case? Is it planned to increase the limit at some point?I know that the offset concept exist, but from my standpoint that i...

Machine Learning

Reply

1161 Views
4 replies
0 kudos

08-22-2022 3:23:47 AM

View Replies

Latest Reply

User16873043099
Contributor

08-23-2022 7:40:24 AM

0 kudos

Jobs API 2.1 jobs list responses will be capped at a limit of 25. With the introduction of pagination in Jobs API 2.1, and to stay in-line with providing increased stability, a limit was introduced on the amount Jobs API 2.1 jobslist responses.

0 kudos

08-23-2022 7:40:24 AM

3 More Replies

by sroychow • New Contributor

04-20-2022 2:57:44 PM

963 Views
2 replies
0 kudos

Unable to create model version using rest api on Managed MLFlow on GCP. Getting a Failed Registration.

I am trying to use Managed MLFlow as tracking server on GCP. I use rest apis to connect with the MLFLOW using Databricks token.I can create experiment and even the model but what when I try to create a model version I run into this following error. ...

Machine Learning

Reply

963 Views
2 replies
0 kudos

04-20-2022 2:57:44 PM

View Replies

Latest Reply

jose_gonzalez
Moderator

08-22-2022 5:48:16 PM

0 kudos

Hi @Shounak Roychowdhury, Just a friendly follow-up. Do you still need help or you were able to find the solution to this question? please let us know

0 kudos

08-22-2022 5:48:16 PM

1 More Replies

by jhonw901227 • New Contributor II

06-09-2022 2:31:46 PM

1298 Views
4 replies
2 kudos

Save VM cost when using Rest API deploying models for online inference

ADB allows us to deploy the models for online inference through a REST API. To that aim ADB creates a VM dedicated to serve a specific model. Data Scientist can create and deploy several models for testing online inference, thus the cost can rapidly ...

Machine Learning

Reply

1298 Views
4 replies
2 kudos

06-09-2022 2:31:46 PM

View Replies

Latest Reply

Anonymous
Not applicable

08-18-2022 7:46:47 AM

2 kudos

Hey @John Wilmar Herrera Gil Thank you so much for getting back to us. We really appreciate your time.Wish you a great Databricks journey ahead!

2 kudos

08-18-2022 7:46:47 AM

3 More Replies

by labromb • Contributor

07-27-2022 3:34:41 AM

2517 Views
4 replies
5 kudos

Submitting multiple parallel jobs to the same job cluster causes Azure vCPU quota manager to count the clusters vCPUs on each invocation

I have an ADF pipeline which invokes a Databricks job six times in parallel. My assumption is all jobs get routed to the same job cluster which then deals with all the invocations in parallel. This was working fine when I had five sources, when I add...

Machine Learning

Reply

2517 Views
4 replies
5 kudos

07-27-2022 3:34:41 AM

View Replies

Latest Reply

labromb
Contributor

07-28-2022 2:38:03 AM

5 kudos

Many thanks both for confirming.

5 kudos

07-28-2022 2:38:03 AM

3 More Replies

by vaver_3 • New Contributor III

08-05-2022 12:45:07 PM

8932 Views
1 replies
5 kudos

Resolved! ingest a .csv file with spaces in column names using Delta Live into a streaming table

How do I ingest a .csv file with spaces in column names using Delta Live into a streaming table? All of the fields should be read using the default behavior .csv files for DLT autoloader - as strings. Running the pipeline gives me an error about in...

Machine Learning

Reply

8932 Views
1 replies
5 kudos

08-05-2022 12:45:07 PM

View Replies

Latest Reply

vaver_3
New Contributor III

08-11-2022 5:30:07 AM

5 kudos

After additional googling on "withColumnRenamed", I was able to replace all spaces in column names with "_" all at once by using select and alias instead:@dlt.view( comment="" ) def vw_raw(): return ( spark.readStream.format("cloudF...

5 kudos

08-11-2022 5:30:07 AM

by mayank347 • New Contributor II

07-13-2022 6:22:33 AM

791 Views
1 replies
3 kudos

Feature Store - Feature Lookup with Filter

I am working with feature store to save the engineered features. However, for the specific case we have lots of feature table and lot of separate target variables on which we want to train separate models. Now for each of these model, we can leverage...

Machine Learning

Reply

791 Views
1 replies
3 kudos

07-13-2022 6:22:33 AM

View Replies

Latest Reply

Anonymous
Not applicable

08-10-2022 10:01:35 AM

3 kudos

Thanks for taking the time to let us know how to make Databricks even better! @Mayank Srivastava I love that you included a real-life example as well. I think I know the right PM at Databricks that will be interested in this input. Thanks again for...

3 kudos

08-10-2022 10:01:35 AM

by Gopi0403 • New Contributor III

08-09-2022 7:05:04 AM

449 Views
1 replies
0 kudos

hi Team, I am facing an issue when deploying the databricks model into AWS Sagemaker. Kindly check the below error and advice me on this. Traceback (...

hi Team, I am facing an issue when deploying the databricks model into AWS Sagemaker. Kindly check the below error and advice me on this.Traceback (most recent call last): File "<string>", line 1, in <module> File "/miniconda/lib/python3.9/site-pack...

Machine Learning

Reply

449 Views
1 replies
0 kudos

08-09-2022 7:05:04 AM

View Replies

Latest Reply

Gopi0403
New Contributor III

08-09-2022 9:49:18 PM

0 kudos

Any update on the above issue?

0 kudos

08-09-2022 9:49:18 PM

by User16345769212 • New Contributor III

08-08-2022 9:34:17 AM

353 Views
0 replies
2 kudos

Unity Catalog Webinar: Join us to learn what's new, and what’s coming in Unity Catalog Governance for Data and AI is complex. Databricks Unity Cat...

Unity Catalog Webinar: Join us to learn what's new, and what’s coming in Unity CatalogGovernance for Data and AI is complex. Databricks Unity Catalog provides a unified governance solution for all data and AI assets on any cloud, empowering data team...

How to simplify data and AI governance with Unity Catalog

Machine Learning

Reply

353 Views
0 replies
2 kudos

08-08-2022 9:34:17 AM

by ebyhry • New Contributor II

08-07-2022 8:01:37 PM

1302 Views
0 replies
0 kudos

How to identify S3 object type (directory or file) created by Databricks?

The issue context is Delta Lake connector in Trino https://github.com/trinodb/trino/issues/13017Trino identifies S3 object as a directory or a file using Content-Type header. Other query engines set application/x-directory in case of directories, bu...

Machine Learning

Reply

1302 Views
0 replies
0 kudos

08-07-2022 8:01:37 PM

by dvmentalmadess • Valued Contributor

04-12-2022 9:13:56 AM

1025 Views
3 replies
2 kudos

Resolved! Store a secret only accessible to the current user

During an interactive notebook session, I want a user to be able to retrieve a secret specific to that user. I haven't decided on storage mechanisms, but I'm open to storage mechanisms that can scalably authorize access to a single user and that I ca...

Machine Learning

Reply

1025 Views
3 replies
2 kudos

04-12-2022 9:13:56 AM

View Replies

Latest Reply

dvmentalmadess
Valued Contributor

08-05-2022 11:29:47 AM

2 kudos

I ended up using Databricks Secrets as the storage mechanism after learning from my account rep that the limit is soft and we can request a higher scope limit. In this case, each user gets a dedicated scope and no other users have access.

2 kudos

08-05-2022 11:29:47 AM

2 More Replies

by Slalom_Tobias • New Contributor III

08-01-2022 9:30:39 AM

1292 Views
4 replies
1 kudos

Resolved! ML Practioner | ml 09 - automl notebook | error on importing databricks.automl

executing the following code...from databricks import automlsummary = automl.regress(train_df, target_col="price", primary_metric="rmse", timeout_minutes=5, max_trials=10)generates the error...ImportError: cannot import name 'automl' from 'databricks...

Machine Learning

Reply

1292 Views
4 replies
1 kudos

08-01-2022 9:30:39 AM

View Replies

Latest Reply

Krueger156
New Contributor II

08-05-2022 2:03:22 AM

1 kudos

I'm happy to see a particularly subject.

1 kudos

08-05-2022 2:03:22 AM

3 More Replies

by HariK1 • New Contributor II

06-28-2022 8:36:43 AM

1088 Views
3 replies
1 kudos

How to input initial centroids to K-Means or GMM Clustering in SparkML ?

Hi, I want to use KMeans Model or Gaussian Mixture Model algorithm for clustering using the SparkML library, in which I want to specify the initial centroids. The option of giving initial centroids is there in the Spark MLlib (RDD based APIs) however...

Machine Learning

Reply

1088 Views
3 replies
1 kudos

06-28-2022 8:36:43 AM

View Replies

Latest Reply

HariK1
New Contributor II

08-04-2022 7:52:28 AM

1 kudos

@Kaniz Fatma I still haven't got an answer to my question!!!

1 kudos

08-04-2022 7:52:28 AM

2 More Replies

by duck_butter123 • New Contributor II

07-28-2022 12:06:42 PM

2536 Views
4 replies
6 kudos

Resolved! No Module named 'mlflow'

I new to the scalable machine learning with apache spark course. I am in the notebook ML 00a - Install Datasets it includes one cell (attached) which throws an error 'no module named 'mlflow''. It attempts to run the Classroom-Setup file. Error is th...

Machine Learning

Reply

2536 Views
4 replies
6 kudos

07-28-2022 12:06:42 PM

View Replies

Latest Reply

Anonymous
Not applicable

08-03-2022 11:11:52 AM

6 kudos

@Myles Pember I hope the suggestions above helped out! If so, please select one as 'best' for us!If you still need assistance, let us know!

6 kudos

08-03-2022 11:11:52 AM

3 More Replies

by Slalom_Tobias • New Contributor III

08-01-2022 10:23:18 AM

1496 Views
2 replies
3 kudos

Resolved! ML Practioner | ML 10 - Feature Store notebook | feature_store import error

the following code...from pyspark.sql.functions import monotonically_increasing_id, lit, expr, randimport uuidfrom databricks import feature_storefrom pyspark.sql.types import StringType, DoubleTypefrom databricks.feature_store import feature_table, ...

Machine Learning

Reply

1496 Views
2 replies
3 kudos

08-01-2022 10:23:18 AM

View Replies

Latest Reply

Anonymous
Not applicable

08-03-2022 11:06:42 AM

3 kudos

Hope that was an easy fix - @Tobias Cortese ! Thanks for marking the "best answer"!

3 kudos

08-03-2022 11:06:42 AM

1 More Replies

by mattsteinpreis • New Contributor III

05-02-2022 5:45:19 PM

2587 Views
4 replies
5 kudos

Getting Py4J "Could not find py4j jar" error when trying to use pypmml, solution doesn't work

I'm trying to use pypmml in a DB notebook, but I'm getting the known `Error : Py4JError: Could not find py4j jar at` error. I've followed the solution here: https://kb.databricks.com/libraries/pypmml-fail-find-py4j-jar.html. However, this has not wor...

Machine Learning

Reply

2587 Views
4 replies
5 kudos

05-02-2022 5:45:19 PM

View Replies

Latest Reply

pawelmitrus
New Contributor III

08-02-2022 8:47:37 AM

5 kudos

I've been struggling myslef with it, but after installing pypmml for spark, I can use the other library, maybe it will work for you:runtime 10.4 LTS MLinstall pypmml-spark (https://github.com/autodeployai/pypmml-spark)install pmml4s-spark (org.pmml4s...

5 kudos

08-02-2022 8:47:37 AM

3 More Replies

Databricks

Forum Posts

Why is there a limit in /2.1/jobs/list?

Unable to create model version using rest api on Managed MLFlow on GCP. Getting a Failed Registration.

Save VM cost when using Rest API deploying models for online inference

Submitting multiple parallel jobs to the same job cluster causes Azure vCPU quota manager to count the clusters vCPUs on each invocation

Resolved! ingest a .csv file with spaces in column names using Delta Live into a streaming table

Feature Store - Feature Lookup with Filter

hi Team, I am facing an issue when deploying the databricks model into AWS Sagemaker. Kindly check the below error and advice me on this. Traceback (...

Unity Catalog Webinar: Join us to learn what's new, and what’s coming in Unity Catalog Governance for Data and AI is complex. Databricks Unity Cat...

How to identify S3 object type (directory or file) created by Databricks?

Resolved! Store a secret only accessible to the current user

Resolved! ML Practioner | ml 09 - automl notebook | error on importing databricks.automl

How to input initial centroids to K-Means or GMM Clustering in SparkML ?

Resolved! No Module named 'mlflow'

Resolved! ML Practioner | ML 10 - Feature Store notebook | feature_store import error

Getting Py4J "Could not find py4j jar" error when trying to use pypmml, solution doesn't work

pdb debugger on databricks

import ml.dmlc.xgboost4j.scala.spark.{XGBoostEstim...

Query ML Endpoint with R and Curl

'error_code': 'INVALID_PARAMETER_VALUE', 'message'...

AutoMl Dataset too large