<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Cannot use RDD and cannot set &amp;quot;spark.databricks.pyspark.enablePy4JSecurity false&amp;quot; for in Data Governance</title>
    <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/61980#M1703</link>
    <description>&lt;P&gt;Faced this issue multiple times.&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;Solution:&lt;/STRONG&gt;&lt;/U&gt;&lt;/P&gt;&lt;P&gt;1. Don't use Shared Cluster or cluster without Unity Catalog enabled for running 'rdd' queries on Databricks.&lt;/P&gt;&lt;P&gt;2. Instead create a Personal Cluster (Single User) with basic configuration and with Unity Catalog enabled.&lt;/P&gt;&lt;P&gt;3. Also for the new compute cluster in Advanced Options set the following parameters:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Under Spark Config:&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;spark.databricks.driver.disableScalaOutput true &lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;spark.databricks.delta.preview.enabled true&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;Under Environment Variables:&lt;/SPAN&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;PYSPARK_PYTHON=/databricks/python3/bin/python3&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Re-run your rdd queries with new compute cluster. It works perfectly well for me.&lt;/P&gt;</description>
    <pubDate>Mon, 26 Feb 2024 13:09:44 GMT</pubDate>
    <dc:creator>KandyKad</dc:creator>
    <dc:date>2024-02-26T13:09:44Z</dc:date>
    <item>
      <title>Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for cluster</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8296#M255</link>
      <description>&lt;P&gt;I have been using "rdd.flatMap(lambda x:x)" for a while to create lists from columns however after I have changed the cluster to a Shared acess mode (to use unity catalog) I get the following error: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;py4j.security.Py4JSecurityException: Method public org.apache.spark.rdd.RDD org.apache.spark.api.java.JavaRDD.rdd() is not whitelisted on class class org.apache.spark.api.java.JavaRDD&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I have tried to solve the error by adding: &lt;/P&gt;&lt;P&gt;"spark.databricks.pyspark.enablePy4JSecurity&amp;nbsp;false"&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;however I then get the following error:&lt;/P&gt;&lt;P&gt;"spark.databricks.pyspark.enablePy4JSecurity is not allowed when chossing an access mode"&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Does anybody know how to use RDD when using a cluster for unity catalouge? &lt;/P&gt;&lt;P&gt;Thank you! &lt;/P&gt;</description>
      <pubDate>Fri, 03 Mar 2023 13:32:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8296#M255</guid>
      <dc:creator>Christine</dc:creator>
      <dc:date>2023-03-03T13:32:09Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for cluster</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8297#M256</link>
      <description>&lt;P&gt;@Christine Pedersen​&amp;nbsp;: Would you like to start migrating to dataframes? The DataFrame API is a more modern and optimized way to work with structured data in Spark.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The error you are encountering is related to Py4J security settings in Apache Spark. In Shared access mode, Py4J security is enabled by default for security reasons, which restricts certain methods from being called on the Spark RDD object.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Mar 2023 03:51:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8297#M256</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-03-09T03:51:24Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for cluster</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8298#M257</link>
      <description>&lt;P&gt;Hi @Suteja Kanuri​,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In this case I am using pyspark dataframe, but I am trying to get alle values from a column in that dataframe and create a list. I am using this list to filter columns in another dataframe. (see example below):&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;value_list = pysparkDF.select(&amp;lt;column_name&amp;gt;).distinct().rdd.flatMap(lambda x: x).collect()&lt;/P&gt;&lt;P&gt;filtered_table = DF2.filter(DF2.&amp;lt;column_name&amp;gt;.isin(value_list))&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;But I will try to search for ways to avoid lists and keep it in dataframe format. &lt;/P&gt;</description>
      <pubDate>Thu, 09 Mar 2023 07:16:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8298#M257</guid>
      <dc:creator>Christine</dc:creator>
      <dc:date>2023-03-09T07:16:29Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for cluster</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8299#M258</link>
      <description>&lt;P&gt;@Christine Pedersen​&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;You can achieve this without collecting data into a list using Spark's built-in DataFrame operations.&lt;/P&gt;&lt;P&gt;You can use the  join operation to filter  DF2 based on the distinct values in the column from  pysparkDF . Here's an example:&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;filtered_table = DF2.join(
    pysparkDF.select(&amp;lt;column_name&amp;gt;).distinct(),
    on=DF2.&amp;lt;column_name&amp;gt; == pysparkDF.&amp;lt;column_name&amp;gt;,
    how='inner'
)&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;This code will perform an inner join on DF2 and  pysparkDF using the column name, which will effectively filter DF2 based on the distinct values of that column in  pysparkDF. Note that this approach will return a new DataFrame rather than a list, which should be more efficient for larger datasets&lt;/P&gt;</description>
      <pubDate>Mon, 13 Mar 2023 06:59:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8299#M258</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-03-13T06:59:45Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for cluster</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8300#M259</link>
      <description>&lt;P&gt;@Suteja Kanuri​&amp;nbsp;&lt;/P&gt;&lt;P&gt;let me know if I have to do this rdd.map on a column having json data, and then read it as a json string in pyspark!&lt;/P&gt;&lt;P&gt;how can I do that!!&lt;/P&gt;&lt;P&gt;Sample Syantx for what I'm trying to achieve on a shared cluster with the same error related to "&lt;B&gt;spark.databricks.pyspark.enablePy4JSecurity"&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;Syntax: &lt;U&gt;spark.read.json(df.rdd.map(lambda x:x[0]))&lt;/U&gt;&lt;/B&gt;&lt;/P&gt;&lt;P&gt;&lt;B&gt;what will be the optimal alternative for the same!!&lt;/B&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Jun 2023 05:14:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/8300#M259</guid>
      <dc:creator>Shivanshu_</dc:creator>
      <dc:date>2023-06-06T05:14:36Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/39078#M1125</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;I have the exact same issue as &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/70328"&gt;@Shivanshu_&lt;/a&gt;&amp;nbsp;any help would be highly appreciated.&lt;/P&gt;</description>
      <pubDate>Fri, 04 Aug 2023 05:38:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/39078#M1125</guid>
      <dc:creator>Barmat</dc:creator>
      <dc:date>2023-08-04T05:38:10Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/39259#M1130</link>
      <description>&lt;P&gt;Try this:&lt;/P&gt;&lt;P&gt;# Change column_name to the actual column name:&lt;BR /&gt;placeholder_list = spark.sql("select column from table").collect()&lt;BR /&gt;desired_list = [row.column_name for row in placeholder_list]&lt;BR /&gt;print(desired_list)&lt;/P&gt;</description>
      <pubDate>Mon, 07 Aug 2023 14:19:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/39259#M1130</guid>
      <dc:creator>Haiyangl104</dc:creator>
      <dc:date>2023-08-07T14:19:03Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/41041#M1164</link>
      <description>&lt;P&gt;Try setting below configuration in databricks notebook, then retry. It should work.&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;spark.conf.&lt;/SPAN&gt;&lt;SPAN&gt;set&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"spark.jvm.class.allowlist"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"spark.databricks.pyspark.enablePy4JSecurity"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Tue, 22 Aug 2023 16:37:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/41041#M1164</guid>
      <dc:creator>Sumit_Kumar</dc:creator>
      <dc:date>2023-08-22T16:37:24Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/51038#M1453</link>
      <description>&lt;P&gt;I get the same error while using repartition command in a shared cluster, works fine with single user cluster. Is there an alternative for that. Any issues with continuing single user cluster&lt;/P&gt;</description>
      <pubDate>Mon, 13 Nov 2023 09:35:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/51038#M1453</guid>
      <dc:creator>Nithya_r</dc:creator>
      <dc:date>2023-11-13T09:35:09Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/56324#M1517</link>
      <description>&lt;P&gt;this configuration does not work for me. please suggest any other solution. i do need to use rdd.mapPartitions for a data framework created from unity catalog data&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;df_unity_catalog.rdd.&lt;/SPAN&gt;&lt;SPAN&gt;mapPartitions&lt;/SPAN&gt;&lt;SPAN&gt;(an_function)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 03 Jan 2024 02:36:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/56324#M1517</guid>
      <dc:creator>283513</dc:creator>
      <dc:date>2024-01-03T02:36:09Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/60972#M1681</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/4670"&gt;@283513&lt;/a&gt;&amp;nbsp;were you able to solve this? I am facing the same issue with using vectorAssembler with unity cluster&lt;/P&gt;</description>
      <pubDate>Sun, 18 Feb 2024 01:05:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/60972#M1681</guid>
      <dc:creator>sue01</dc:creator>
      <dc:date>2024-02-18T01:05:01Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/61980#M1703</link>
      <description>&lt;P&gt;Faced this issue multiple times.&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;Solution:&lt;/STRONG&gt;&lt;/U&gt;&lt;/P&gt;&lt;P&gt;1. Don't use Shared Cluster or cluster without Unity Catalog enabled for running 'rdd' queries on Databricks.&lt;/P&gt;&lt;P&gt;2. Instead create a Personal Cluster (Single User) with basic configuration and with Unity Catalog enabled.&lt;/P&gt;&lt;P&gt;3. Also for the new compute cluster in Advanced Options set the following parameters:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Under Spark Config:&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;spark.databricks.driver.disableScalaOutput true &lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;spark.databricks.delta.preview.enabled true&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;Under Environment Variables:&lt;/SPAN&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;PYSPARK_PYTHON=/databricks/python3/bin/python3&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;Re-run your rdd queries with new compute cluster. It works perfectly well for me.&lt;/P&gt;</description>
      <pubDate>Mon, 26 Feb 2024 13:09:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/61980#M1703</guid>
      <dc:creator>KandyKad</dc:creator>
      <dc:date>2024-02-26T13:09:44Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/64355#M1732</link>
      <description>&lt;P&gt;faced with the same issue and working for a company, it is not possible to create a new cluster. do you have any other solution for this issue?&lt;/P&gt;</description>
      <pubDate>Thu, 21 Mar 2024 23:32:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/64355#M1732</guid>
      <dc:creator>mervekilincer</dc:creator>
      <dc:date>2024-03-21T23:32:48Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/68179#M1793</link>
      <description>&lt;P&gt;was this resolved?&lt;/P&gt;</description>
      <pubDate>Mon, 06 May 2024 09:21:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/68179#M1793</guid>
      <dc:creator>rahuja</dc:creator>
      <dc:date>2024-05-06T09:21:29Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/68822#M1804</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;Can you use &lt;SPAN&gt;&amp;nbsp;json.loads instead? Example below -&lt;/SPAN&gt;&lt;/P&gt;
&lt;LI-CODE lang="python"&gt;from pyspark.sql import Row
import json 
# Sample JSON data as a list of dictionaries (similar to JSON objects)
json_data_str = response.text
json_data = [json.loads(json_data_str)]

# Convert dictionaries to Row objects
rows = [Row(**json_dict) for json_dict in json_data]
# Create DataFrame from list of Row objects
df = spark.createDataFrame(rows)
# Show the DataFrame
df.display()
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 11 May 2024 19:30:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/68822#M1804</guid>
      <dc:creator>dkushari</dc:creator>
      <dc:date>2024-05-11T19:30:41Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/71051#M1848</link>
      <description>&lt;P&gt;I was having a similar issue in using .rdd.map()&lt;BR /&gt;Solved it by adding two key value pairs in the spark config for the cluster&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;spark.databricks.pyspark.enablePy4JSecurity false&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;spark.databricks.pyspark.trustedFilesystems org.apache.spark.api.java.JavaRDD&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;After this I was able to read the schema of the json from the column that was read as string&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; json_schema &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; spark.read.&lt;/SPAN&gt;&lt;SPAN&gt;json&lt;/SPAN&gt;&lt;SPAN&gt;(df.rdd.&lt;/SPAN&gt;&lt;SPAN&gt;map&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;lambda&lt;/SPAN&gt; &lt;SPAN&gt;row&lt;/SPAN&gt;&lt;SPAN&gt;: row.preferences)).schema&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; print&lt;/SPAN&gt;&lt;SPAN&gt;(json_schema)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 29 May 2024 17:44:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/71051#M1848</guid>
      <dc:creator>him_agg</dc:creator>
      <dc:date>2024-05-29T17:44:16Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/72805#M1869</link>
      <description>&lt;P&gt;Did you tried this in a UC enabled cluster?&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2024 15:20:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/72805#M1869</guid>
      <dc:creator>Shivanshu_</dc:creator>
      <dc:date>2024-06-12T15:20:48Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/73785#M1877</link>
      <description>&lt;P&gt;In my case the problem was that we were trying to use SparkXGBoostRegressor and in the docs it says that it does not work on clusters with autoscaling enabled. So we just disabled autoscaling for the interactive cluster where we were testing the model and it worked like a charm &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope it helps&lt;/P&gt;</description>
      <pubDate>Thu, 13 Jun 2024 11:44:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/73785#M1877</guid>
      <dc:creator>rahuja</dc:creator>
      <dc:date>2024-06-13T11:44:09Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/91056#M2141</link>
      <description>&lt;P&gt;Thanks, that solved me the issue!&lt;/P&gt;</description>
      <pubDate>Thu, 19 Sep 2024 12:01:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/91056#M2141</guid>
      <dc:creator>Makal</dc:creator>
      <dc:date>2024-09-19T12:01:42Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot use RDD and cannot set "spark.databricks.pyspark.enablePy4JSecurity false" for</title>
      <link>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/97412#M2231</link>
      <description>&lt;P&gt;Hello,&lt;BR /&gt;In the past I used&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;rdd.mapPartitions(lambda ...)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;to call functions that access third party APIs like azure ai translate text to batch call the API and return the batched data.&lt;BR /&gt;&lt;BR /&gt;How would one do this now?&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 03 Nov 2024 14:18:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-governance/cannot-use-rdd-and-cannot-set-quot-spark-databricks-pyspark/m-p/97412#M2231</guid>
      <dc:creator>de-qrosh</dc:creator>
      <dc:date>2024-11-03T14:18:04Z</dc:date>
    </item>
  </channel>
</rss>

