cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

py4j.security.Py4JSecurityException: Constructor public org.apache.spark.ml.feature.VectorAssembler(java.lang.String) is not whitelisted.

DataRabbit
New Contributor II

Hello, i have a problem.

When I try to run the MLlib Assembler (from pyspark.ml.feature import VectorAssembler) I get this error and I don't know what to do anymore. Please help.

1 ACCEPTED SOLUTION

Accepted Solutions

Hubert-Dudek
Esteemed Contributor III

Is it High Concurrency cluster with credential passthrough enabled? In that case, you can use a different cluster mode.

https://docs.azuredatabricks.net/spark/latest/data-sources/azure/adls-passthrough.html

This exception is thrown when you have accessed a method that Azure Databricks has not explicitly marked as safe for Azure Data Lake Storage credential passthrough clusters. In most cases, the technique could allow a user on an Azure Data Lake Storage credential passthrough cluster to access another user’s credential.

View solution in original post

2 REPLIES 2

Hubert-Dudek
Esteemed Contributor III

Is it High Concurrency cluster with credential passthrough enabled? In that case, you can use a different cluster mode.

https://docs.azuredatabricks.net/spark/latest/data-sources/azure/adls-passthrough.html

This exception is thrown when you have accessed a method that Azure Databricks has not explicitly marked as safe for Azure Data Lake Storage credential passthrough clusters. In most cases, the technique could allow a user on an Azure Data Lake Storage credential passthrough cluster to access another user’s credential.

Thanks you so much!