Databricks Community

Christine · ‎03-03-2023

I have been using "rdd.flatMap(lambda x:x)" for a while to create lists from columns however after I have changed the cluster to a Shared acess mode (to use unity catalog) I get the following error:

py4j.security.Py4JSecurityException: Method public org.apache.spark.rdd.RDD org.apache.spark.api.java.JavaRDD.rdd() is not whitelisted on class class org.apache.spark.api.java.JavaRDD

I have tried to solve the error by adding:

"spark.databricks.pyspark.enablePy4JSecurity false"

however I then get the following error:

"spark.databricks.pyspark.enablePy4JSecurity is not allowed when chossing an access mode"

Does anybody know how to use RDD when using a cluster for unity catalouge?

Thank you!

rahuja · ‎05-06-2024

was this resolved?

him_agg · ‎05-29-2024

I was having a similar issue in using .rdd.map()
Solved it by adding two key value pairs in the spark config for the cluster

spark.databricks.pyspark.enablePy4JSecurity false

spark.databricks.pyspark.trustedFilesystems org.apache.spark.api.java.JavaRDD

After this I was able to read the schema of the json from the column that was read as string

json_schema = spark.read.json(df.rdd.map(lambda row: row.preferences)).schema

print(json_schema)

Shivanshu_ · ‎06-12-2024

Did you tried this in a UC enabled cluster?

rahuja · ‎06-13-2024

In my case the problem was that we were trying to use SparkXGBoostRegressor and in the docs it says that it does not work on clusters with autoscaling enabled. So we just disabled autoscaling for the interactive cluster where we were testing the model and it worked like a charm 🙂

Hope it helps

de-qrosh · ‎11-03-2024

Hello,
In the past I used

rdd.mapPartitions(lambda ...)

to call functions that access third party APIs like azure ai translate text to batch call the API and return the batched data.

How would one do this now?

v_nayakk · ‎11-24-2024

Hi All,

I faced the same issue. When i updated the cluster to DBR 15.5 it got resolved, it is not working in DBR 13.3 (my previous cluster version).

Well, it is also working in DBR 10.4. I stayed with 15.5 as being the latest. Just try out with different DBR version and it should solve your issue.

Databricks Community

Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for cluster

Join Us as a Local Community Builder!

🚀 Weekly Delta (8 - 14 October): A Look Back at This Week’s Top Community Highlights

Databricks Community Champion - September 2025 - Nayanjyoti Sonowal

BrickCon 2025 — Dec 3–5 | A Community Conference for Databricks Builders

🌟 Community Sparks of the Week | September 26 – October 2 🌟

Solution Accelerator Series | #4 - Toxicity Detection for Gaming