Machine Learning

by Avi711 • New Contributor III

11-09-2022 11:47:40 PM

4020 Views
3 replies
5 kudos

Resolved! Access denied error to S3 bucket while running Kinesis spark streaming.

I get this below error while trying to simulate kinesis streams as mentioned in Databricks documentation at https://docs.databricks.com/getting-started/streaming.htmlError:java.nio.file.AccessDeniedException:Amazon S3; Status Code: 403; Error Code: A...

Machine Learning

Reply

4020 Views
3 replies
5 kudos

11-09-2022 11:47:40 PM

View Replies

Latest Reply

jsuarezg
New Contributor II

01-09-2024 1:21:16 PM

5 kudos

If you do spark.sparkContext._jsc.hadoopConfiguration().set("fs.s3a.access.key", AWS_ACCESS_KEY_ID) + secret with any other secret that has less access than your default one this sometimes happens, so running those commands but with your normal secre...

5 kudos

01-09-2024 1:21:16 PM

2 More Replies

by ptawil • New Contributor III

07-07-2022 8:49:48 AM

2363 Views
2 replies
4 kudos

Runtime error using MLFlow and Spark on databricks

Here is some model I created:class SomeModel(mlflow.pyfunc.PythonModel): def predict(self, context, input): # do fancy ML stuff # log results pandas_df = pd.DataFrame(...insert predictions here...) spark_df = spark...

Machine Learning

Reply

2363 Views
2 replies
4 kudos

07-07-2022 8:49:48 AM

View Replies

Latest Reply

Nikhil3107
New Contributor III

06-07-2023 8:08:01 AM

4 kudos

Any updates on this? I am running into the same issue@Patrick Tawil were you able to solve this problem? If so, do you mind sharing?

4 kudos

06-07-2023 8:08:01 AM

1 More Replies

by vittal • New Contributor

01-24-2023 10:35:44 PM

912 Views
1 replies
0 kudos

Getting errors in DLT Pipeline while using ML Model

I am getting the following error when I try to run ML Models in Delta live Table Pipeline File "/local_disk0/.ephemeral_nfs/repl_tmp_data/ReplId-55c61-9b898-2c4b6-d/mlflow/envs/virtualenv_envs/mlflow-888f8c9b966409e6bddca3894244b4df9d1f94c1/lib/pyth...

Machine Learning

Reply

912 Views
1 replies
0 kudos

01-24-2023 10:35:44 PM

View Replies

Latest Reply

shan_chandra
Esteemed Contributor

04-27-2023 9:17:21 AM

0 kudos

@Vittal Pai - In general, please follow the below steps for the mlflow CLI error,Step 1: set up API token and create secrets as mentioned in the below documenthttps://docs.databricks.com/machine-learning/manage-model-lifecycle/multiple-workspaces.h...

0 kudos

04-27-2023 9:17:21 AM

by notsure • New Contributor

02-20-2023 3:46:12 AM

2111 Views
3 replies
2 kudos

Error with calling a machine learning serving endpoint

Hi!I have registered a spark model and generated a serving endpoint based on that.I am calling the endpoint with the relevant dataframe, somehow I got below errors. Could anyone show me how to tackle it, please? "Exception: Request failed with status...

Machine Learning

Reply

2111 Views
3 replies
2 kudos

02-20-2023 3:46:12 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-21-2023 10:08:26 PM

2 kudos

Hi @mavis chen Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers yo...

2 kudos

04-21-2023 10:08:26 PM

2 More Replies

by DebK • New Contributor III

04-04-2023 1:13:00 AM

3445 Views
6 replies
6 kudos

Resolved! MLFlow is throwing error for the shape of input

I am running the code for prediction which will take the model from mlflow deployment. Code I have copied from the example given by mlflow experiment tab.import mlflow logged_model = 'runs:/id/model' # Load model as a PyFuncModel. loaded_model = ml...

Machine Learning

Reply

3445 Views
6 replies
6 kudos

04-04-2023 1:13:00 AM

View Replies

Latest Reply

Anonymous
Not applicable

04-04-2023 11:24:42 PM

6 kudos

Hi @Koushik Deb Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers y...

6 kudos

04-04-2023 11:24:42 PM

5 More Replies

by NSRBX • Contributor

10-06-2022 6:38:43 AM

3361 Views
6 replies
6 kudos

Resolved! Error loading model from mlflow: java.io.StreamCorruptedException: invalid type code: 00

Hello,I'm using, in my IDE, Databricks Connect version 9.1LTS ML to connect to a databricks cluster with spark version 3.1 and download a spark model that's been trained and saved using mlflow.So it seems like it's able to find a copy the model, but ...

Machine Learning

Reply

3361 Views
6 replies
6 kudos

10-06-2022 6:38:43 AM

View Replies

Latest Reply

NSRBX
Contributor

10-17-2022 4:41:34 AM

6 kudos

Hi @Kaniz Fatma and @Shanmugavel Chandrakasu,It works after putting hadoop.dll into C:\Windows\System32 folder.I have hadoop version 3.3.1.I already had winutils.exe in the Hadoop bin folder.RegardsNath

6 kudos

10-17-2022 4:41:34 AM

5 More Replies

by ryojikn • New Contributor III

01-15-2023 8:26:07 PM

1280 Views
1 replies
1 kudos

Error on pandas udf usage in databricks, sc.broadcasting random forest loaded from Kedro MLFlow Logger DataSet, cannot pickle '_thread.RLock' object

I'm trying to broadcast a Random forest (sklearn 1.2.0) recently loaded from mlflow, and using Pandas UDF to predict a model.However, the same code works perfectly on Spark 2.4 + our OnPrem cluster.I thought it was due to Spark 2.4 to 3 changes, an...

Machine Learning

Reply

1280 Views
1 replies
1 kudos

01-15-2023 8:26:07 PM

View Replies

Latest Reply

ryojikn
New Contributor III

01-30-2023 5:03:31 AM

1 kudos

Anyone?

1 kudos

01-30-2023 5:03:31 AM

by PNegro • New Contributor III

11-17-2022 11:08:42 AM

2608 Views
4 replies
4 kudos

conda-env: error: unrecognized arguments: 'virtualenv': 'python_env.yaml'

I have registered an experiment as model in the model registry and when I start serving the model I get the following error:usage: conda-env [-h] {create,export,list,remove,update,config} ...conda-env: error: unrecognized arguments: 'virtualenv': 'py...

Machine Learning

Reply

2608 Views
4 replies
4 kudos

11-17-2022 11:08:42 AM

View Replies

Latest Reply

PNegro
New Contributor III

12-13-2022 10:15:23 AM

4 kudos

Hi Follks, Is there any new on this?.What should I do?ThanksBestPablo

4 kudos

12-13-2022 10:15:23 AM

3 More Replies

by jonathan-dufaul • Valued Contributor

11-28-2022 2:42:59 PM

1519 Views
3 replies
1 kudos

How does mlflow determine if a pyfunc model uses SparkContext?

I've been getting this error pretty regularly while working with mlflow:"It appears that you are attempting to reference SparkContext from a broadcast variable, action, or transformation. SparkContext can only be used on the driver, not in code that ...

Machine Learning

Reply

1519 Views
3 replies
1 kudos

11-28-2022 2:42:59 PM

View Replies

Latest Reply

Anonymous
Not applicable

11-28-2022 4:30:10 PM

1 kudos

I checked the page and it looks like there is no integration with Datarobot and Datarobot doesn't contribute to mlflow. https://mlflow.org/ has all the integrations listed

1 kudos

11-28-2022 4:30:10 PM

2 More Replies

by Mado • Valued Contributor II

11-19-2022 12:03:26 AM

9273 Views
5 replies
17 kudos

Resolved! Error when reading Excel file: "java.lang.NoClassDefFoundError: shadeio/poi/schemas/vmldrawing/XmlDocument"

Hi, I want to read an Excel file by:filepath_xlsx = "dbfs:/FileStore/data.xlsx" sampleDF = (spark.read.format("com.crealytics.spark.excel") .option("Header", "true") .option("inferSchema", "false") .option("treatEmptyValuesAsNulls", ...

Machine Learning

Reply

9273 Views
5 replies
17 kudos

11-19-2022 12:03:26 AM

View Replies

Latest Reply

Mado
Valued Contributor II

11-19-2022 4:56:34 AM

17 kudos

For this dataset, I also tried binary file reading as below: xldf_xlsx = ( spark.read.format("binaryFile") .option("pathGlobFilter", "*.xls*") .load(filepath_xlsx) ) excel_content = xldf_xlsx.head(1)[0].content file_like_obj = io.BytesIO(excel...

17 kudos

11-19-2022 4:56:34 AM

4 More Replies

by Mado • Valued Contributor II

11-19-2022 9:51:18 PM

3116 Views
1 replies
4 kudos

Resolved! Error when reading Excel file: "org.apache.poi.ooxml.POIXMLException: Strict OOXML isn't currently supported, please see bug #57699"

Hi,I want to read an Excel "xlsx" file. The excel file has several sheets and multi-row header. The original file format was "xlsm" and I changed the extension to "xlsx". I try the following code:filepath_xlsx = "dbfs:/FileStore/Sample_Excel/data.xl...

Machine Learning

Reply

3116 Views
1 replies
4 kudos

11-19-2022 9:51:18 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

11-20-2022 9:14:01 PM

4 kudos

Hi @Mohammad Saber, The error says, Don't save your spreadsheet in "strict OOXML" format.For example, in Excel use.Save As --> "Excel Workbook (.xlsx)" instead ofSave As --> "Strict Open XML Spreadsheet (.xlsx)"

4 kudos

11-20-2022 9:14:01 PM

by eshaanpathak • New Contributor III

11-07-2022 4:12:27 PM

2695 Views
3 replies
4 kudos

AttributeError: 'NoneType' object has no attribute 'enum_types_by_name'

I run into this error while using MLFlow: AttributeError: 'NoneType' object has no attribute 'enum_types_by_name'Here is the relevant stack trace:/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/mlflow/tracking/fluent....

Machine Learning

Reply

2695 Views
3 replies
4 kudos

11-07-2022 4:12:27 PM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

11-09-2022 2:09:22 AM

4 kudos

Hi @Eshaan Pathak , We haven’t heard from you on the last response from @Debayan Mukherjee, and I was checking back to see if their suggestions helped you. Or else, If you have any solution, please do share that with the community as it can be hel...

4 kudos

11-09-2022 2:09:22 AM

2 More Replies

by brendanmckenna • New Contributor III

11-03-2022 9:41:00 AM

2195 Views
4 replies
4 kudos

Resolved! How to avoid an error when using the automl python api on a classification problem

I am working through a basic example to get familiar with databricks automl. When I run classify, I hit an mlflow error. How can I avoid this error? My code:summary = databricks.automl.classify(train_df, target_col='new_cases', data_dir='dbfs:/automl...

Machine Learning

Reply

2195 Views
4 replies
4 kudos

11-03-2022 9:41:00 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

11-04-2022 8:43:25 AM

4 kudos

Hi @Brendan McKenna , We haven’t heard from you since the last response from @Debayan Mukherjee. Or else, If you have any solution, please share it with the community, as it can be helpful to others. Also, Please don't forget to click on the "Selec...

4 kudos

11-04-2022 8:43:25 AM

3 More Replies

by sameer_gupta • New Contributor

07-10-2022 10:30:52 PM

1336 Views
3 replies
0 kudos

Error in importing feature_store

from databricks import feature_storeI am trying to import feature_store but it is showing this error.ImportError: cannot import name 'feature_store' from 'databricks' (/databricks/python/lib/python3.8/site-packages/databricks/__init__.py)

Machine Learning

Reply

1336 Views
3 replies
0 kudos

07-10-2022 10:30:52 PM

View Replies

Latest Reply

Anonymous
Not applicable

09-06-2022 12:44:38 AM

0 kudos

Is this issue resolved completely? We are facing the same problem. this might help.

0 kudos

09-06-2022 12:44:38 AM

2 More Replies

by Benji • New Contributor II

07-25-2022 11:47:33 PM

3353 Views
5 replies
0 kudos

Error when running job in databricks

Hello, I am very new with databricks and MLflow. I faced with the problem about running job. When the job is run, it usually failed and retried itself, so it incasesed running time, i.e., from normally 6 hrs to 12-18 hrs. From the error log, it shows...

Machine Learning

Reply

3353 Views
5 replies
0 kudos

07-25-2022 11:47:33 PM

View Replies

Latest Reply

Vidula
Honored Contributor

09-05-2022 6:25:18 AM

0 kudos

Hey there @Tanawat Benchasirirot Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hea...

0 kudos

09-05-2022 6:25:18 AM

4 More Replies

Databricks Community

Forum Posts

Resolved! Access denied error to S3 bucket while running Kinesis spark streaming.

Runtime error using MLFlow and Spark on databricks

Getting errors in DLT Pipeline while using ML Model

Error with calling a machine learning serving endpoint

Resolved! MLFlow is throwing error for the shape of input

Resolved! Error loading model from mlflow: java.io.StreamCorruptedException: invalid type code: 00

Error on pandas udf usage in databricks, sc.broadcasting random forest loaded from Kedro MLFlow Logger DataSet, cannot pickle '_thread.RLock' object

conda-env: error: unrecognized arguments: 'virtualenv': 'python_env.yaml'

How does mlflow determine if a pyfunc model uses SparkContext?

Resolved! Error when reading Excel file: "java.lang.NoClassDefFoundError: shadeio/poi/schemas/vmldrawing/XmlDocument"

Resolved! Error when reading Excel file: "org.apache.poi.ooxml.POIXMLException: Strict OOXML isn't currently supported, please see bug #57699"

AttributeError: 'NoneType' object has no attribute 'enum_types_by_name'

Resolved! How to avoid an error when using the automl python api on a classification problem

Error in importing feature_store

Error when running job in databricks