<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Insufficient privileges:User does not have permission SELECT on any file in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/69107#M9527</link>
    <description>&lt;P&gt;Hello &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/99364"&gt;@Hkesharwani&lt;/a&gt; ,&lt;/P&gt;&lt;P&gt;thanks for replying.&lt;/P&gt;&lt;P&gt;Indeed, as I stated in the beginning of my post, the issue occurs only with shared cluster usage (single user cluster all is fine). Since I *have to* switch to shared cluster (rowlevel security is only available there atm.), it would be great if someone provides any insights of what is causing this issue on shared clusters.&lt;/P&gt;</description>
    <pubDate>Wed, 15 May 2024 20:34:54 GMT</pubDate>
    <dc:creator>GeKo</dc:creator>
    <dc:date>2024-05-15T20:34:54Z</dc:date>
    <item>
      <title>Insufficient privileges:User does not have permission SELECT on any file</title>
      <link>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/68718#M9525</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;after switching to "shared cluster" usage a python job is failing with error message:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;Py4JJavaError: An error occurred while calling o877.load.
: org.apache.spark.SparkSecurityException: [INSUFFICIENT_PERMISSIONS] Insufficient privileges:
User does not have permission SELECT on any file.&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This error happens on the attempt of reading messages from a Kafka topic, according to the stacktrace (in the spark method spark_.read) =&amp;gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;    288 else:
    289     raw_df = (
    290         self.spark_.read.format("kafka")
    291         .option(
    292             "kafka.bootstrap.servers",
    293             self.kafka_secrets.kafka_bootstrap_servers,
    294         )
    295         .option("subscribe", topic.topic)
    296         .option("groupIdPrefix", topic.consumer_group_prefix)
    297         .option("startingOffsets", "earliest")
    298         .option("failOnDataLoss", "false")
    299         .option("includeHeaders", "true")
    300         .options(**self.sasl_ssl_auth_options)
    301         .options(**spark_opts)
--&amp;gt; 302         .load()
    303     ).drop("timestampType")&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;The job runs fine if "streaming" is enabled, means we use &lt;STRONG&gt;spark_.readStream&lt;/STRONG&gt; instead.&lt;/P&gt;&lt;P&gt;What exactly is raising the "INSUFFICIENT_PERMISSIONS" error, at using "spark_.read" methon , and how to get rid of it ?!?!&lt;/P&gt;&lt;P&gt;Usually this error is thrown if someone wants to access data on DBFS or has tableACLs enabled, but both of them is not the case here.&lt;/P&gt;&lt;P&gt;Context:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;using shared cluster&lt;/LI&gt;&lt;LI&gt;everything is managed via UnityCatalog&lt;/LI&gt;&lt;LI&gt;no Hive metastore is in use, table ACLs are disabled&lt;/LI&gt;&lt;LI&gt;the job does not interact with any data from DBFS (it simply wants to read from Kafka), also potential checkpoints of Kafka are configured to use UC Volume&lt;/LI&gt;&lt;LI&gt;I know that the statement "grant select on any file..." would solve the problem, but I don't want to use it, since I explicitly do not want to allow something on DBFS which I do not want to use anyways, neither Hive metastore related stuff&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Since the difference in behaviour is between using &lt;STRONG&gt;spark_.read vs spark_.readStream&lt;/STRONG&gt; my guess is, that the spark_.read is internally trying to access/interact with Hive-Metastore&lt;/P&gt;&lt;P&gt;Any hint how to eliminate this issue is highly appreciated &lt;span class="lia-unicode-emoji" title=":grinning_face_with_smiling_eyes:"&gt;😄&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 10 May 2024 09:57:51 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/68718#M9525</guid>
      <dc:creator>GeKo</dc:creator>
      <dc:date>2024-05-10T09:57:51Z</dc:date>
    </item>
    <item>
      <title>Re: Insufficient privileges:User does not have permission SELECT on any file</title>
      <link>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/69099#M9526</link>
      <description>&lt;P&gt;Hi, The reason for this issue could be shared cluster, Unity catalog best supports with personal cluster or job clusters.&lt;BR /&gt;I would suggest try using personal cluster.&lt;BR /&gt;Check out the below article this might help&lt;BR /&gt;&lt;A href="https://community.databricks.com/t5/data-engineering/create-table-using-a-location/td-p/68725" target="_blank"&gt;https://community.databricks.com/t5/data-engineering/create-table-using-a-location/td-p/68725&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 15 May 2024 17:47:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/69099#M9526</guid>
      <dc:creator>Hkesharwani</dc:creator>
      <dc:date>2024-05-15T17:47:32Z</dc:date>
    </item>
    <item>
      <title>Re: Insufficient privileges:User does not have permission SELECT on any file</title>
      <link>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/69107#M9527</link>
      <description>&lt;P&gt;Hello &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/99364"&gt;@Hkesharwani&lt;/a&gt; ,&lt;/P&gt;&lt;P&gt;thanks for replying.&lt;/P&gt;&lt;P&gt;Indeed, as I stated in the beginning of my post, the issue occurs only with shared cluster usage (single user cluster all is fine). Since I *have to* switch to shared cluster (rowlevel security is only available there atm.), it would be great if someone provides any insights of what is causing this issue on shared clusters.&lt;/P&gt;</description>
      <pubDate>Wed, 15 May 2024 20:34:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/69107#M9527</guid>
      <dc:creator>GeKo</dc:creator>
      <dc:date>2024-05-15T20:34:54Z</dc:date>
    </item>
    <item>
      <title>Re: Insufficient privileges:User does not have permission SELECT on any file</title>
      <link>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/72105#M9528</link>
      <description>&lt;P&gt;hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/86174"&gt;@GeKo&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;did you get any solution ?&lt;/P&gt;</description>
      <pubDate>Sat, 08 Jun 2024 05:01:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/72105#M9528</guid>
      <dc:creator>sravs_227</dc:creator>
      <dc:date>2024-06-08T05:01:38Z</dc:date>
    </item>
    <item>
      <title>Re: Insufficient privileges:User does not have permission SELECT on any file</title>
      <link>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/77161#M9529</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/106554"&gt;@sravs_227&lt;/a&gt; ,&lt;BR /&gt;the issue was, that the checkpoint directory (while reading from kafka) was set to a dbfs folder. We switched this now to also UC volume&lt;/P&gt;</description>
      <pubDate>Mon, 08 Jul 2024 14:30:51 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/77161#M9529</guid>
      <dc:creator>GeKo</dc:creator>
      <dc:date>2024-07-08T14:30:51Z</dc:date>
    </item>
    <item>
      <title>Re: Insufficient privileges:User does not have permission SELECT on any file</title>
      <link>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/105608#M9530</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/86174"&gt;@GeKo&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;The checkpoint directory, is that set on cluster level or how do we set that ? Can you please help me with this ?&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jan 2025 16:21:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/insufficient-privileges-user-does-not-have-permission-select-on/m-p/105608#M9530</guid>
      <dc:creator>Uj337</dc:creator>
      <dc:date>2025-01-14T16:21:12Z</dc:date>
    </item>
  </channel>
</rss>

