<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: StringIndexer method fails with shared compute in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/stringindexer-method-fails-with-shared-compute/m-p/94828#M38975</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;The issue you're encountering with the StringIndexer method from the MLflow library failing on a Unity Catalog-enabled Databricks cluster with Shared access mode is likely due to the limitations associated with Shared access mode in Unity Catalog&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Shared Access Mode Limitations on Unity Catalog:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;- Databricks Runtime ML and Spark Machine Learning Library (MLlib) are not supported in Shared access mode on Unity Catalog. &lt;EM&gt;This limitation could directly impact the functionality of the StringIndexer method, which is part of the Spark MLlib.&lt;/EM&gt;&lt;BR /&gt;- Spark-submit jobs are not supported in Shared access mode on Unity Catalog.&lt;BR /&gt;- PySpark UDFs cannot access Git folders, workspace files, or volumes to import modules in Databricks Runtime 14.2 and below.&lt;BR /&gt;- DBFS root and mounts do not support FUSE in Shared access mode.&lt;/P&gt;
&lt;P&gt;For more understanding check: &lt;A href="https://docs.databricks.com/en/compute/access-mode-limitations.html" target="_blank"&gt;https://docs.databricks.com/en/compute/access-mode-limitations.html&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 18 Oct 2024 10:59:15 GMT</pubDate>
    <dc:creator>shashank853</dc:creator>
    <dc:date>2024-10-18T10:59:15Z</dc:date>
    <item>
      <title>StringIndexer method fails with shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/stringindexer-method-fails-with-shared-compute/m-p/94825#M38974</link>
      <description>&lt;P&gt;Dear Team&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;StringIndexer method of mlflow library upon running code on &lt;/SPAN&gt;&lt;SPAN&gt;No Isolation Shared access mode data bricks&amp;nbsp;cluster it works but it is failing on Unity catalog enabled data bricks cluster having Shared access mode. &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;here is the library name:&amp;nbsp;from pyspark.ml.feature import StringIndexer.&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Error is py4j.security. Py4JSecurityException: Constructor public org.apache.spark.ml.feature.&lt;/SPAN&gt;&lt;SPAN&gt;StringIndexer(java.lang.String) is not whitelisted.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 18 Oct 2024 10:53:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stringindexer-method-fails-with-shared-compute/m-p/94825#M38974</guid>
      <dc:creator>sandeephenkel23</dc:creator>
      <dc:date>2024-10-18T10:53:43Z</dc:date>
    </item>
    <item>
      <title>Re: StringIndexer method fails with shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/stringindexer-method-fails-with-shared-compute/m-p/94828#M38975</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;The issue you're encountering with the StringIndexer method from the MLflow library failing on a Unity Catalog-enabled Databricks cluster with Shared access mode is likely due to the limitations associated with Shared access mode in Unity Catalog&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Shared Access Mode Limitations on Unity Catalog:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;- Databricks Runtime ML and Spark Machine Learning Library (MLlib) are not supported in Shared access mode on Unity Catalog. &lt;EM&gt;This limitation could directly impact the functionality of the StringIndexer method, which is part of the Spark MLlib.&lt;/EM&gt;&lt;BR /&gt;- Spark-submit jobs are not supported in Shared access mode on Unity Catalog.&lt;BR /&gt;- PySpark UDFs cannot access Git folders, workspace files, or volumes to import modules in Databricks Runtime 14.2 and below.&lt;BR /&gt;- DBFS root and mounts do not support FUSE in Shared access mode.&lt;/P&gt;
&lt;P&gt;For more understanding check: &lt;A href="https://docs.databricks.com/en/compute/access-mode-limitations.html" target="_blank"&gt;https://docs.databricks.com/en/compute/access-mode-limitations.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 18 Oct 2024 10:59:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stringindexer-method-fails-with-shared-compute/m-p/94828#M38975</guid>
      <dc:creator>shashank853</dc:creator>
      <dc:date>2024-10-18T10:59:15Z</dc:date>
    </item>
    <item>
      <title>Re: StringIndexer method fails with shared compute</title>
      <link>https://community.databricks.com/t5/data-engineering/stringindexer-method-fails-with-shared-compute/m-p/150826#M53526</link>
      <description>&lt;P&gt;so can we not run Spark ML in the Databricks Free edition that uses only serverless compute?&lt;/P&gt;</description>
      <pubDate>Fri, 13 Mar 2026 13:58:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/stringindexer-method-fails-with-shared-compute/m-p/150826#M53526</guid>
      <dc:creator>vivadiva1981</dc:creator>
      <dc:date>2026-03-13T13:58:00Z</dc:date>
    </item>
  </channel>
</rss>

