<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks AI Genie - Data Security and Thrid Party Platform in Generative AI</title>
    <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/125375#M1019</link>
    <description>&lt;P&gt;Genie allows you to use serverless as well as classic compute (sql warehouse). Classic compute runs within your tenant using your vpc/vnet. Irrespective of the type of compute you use, data needs to be read into it and this applies to any other solution as well. In case of databricks sql warehouse (serverless) you have an option to use a) NCC &amp;gt; Private Link to your storage accounts ie secure ingress control to read data b) Network polcies ie to prevent data exfiltration or have an egress f/w. Further more you are allowed to bring your own encryption keys aka CMK to encrypt the metadata / queries saved within the genie space. In short there are several options available to utilize genie and not compromise your security posture and that’s the true value of our platform “optionality”.&lt;/P&gt;</description>
    <pubDate>Tue, 15 Jul 2025 21:18:27 GMT</pubDate>
    <dc:creator>k1chi</dc:creator>
    <dc:date>2025-07-15T21:18:27Z</dc:date>
    <item>
      <title>Databricks AI Genie - Data Security and Thrid Party Platform</title>
      <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/100852#M654</link>
      <description>&lt;P&gt;I am currently exploring the possibility of using Databricks AI Genie to allow layman users to ask questions and retrieve data on their own.&lt;/P&gt;&lt;P&gt;We would like to keep the data in our warehouse (e.g., Snowflake or local). I read the documentation, but it seems like the data must be uploaded to the Databricks server to use Genie. I'm wondering if, rather than uploading the data to Databricks, is there a way for Genie to read the data that on another platform—such as by using a Snowflake connector or even accessing it from a local host. Also, how secure is Genie AI? Thank you!&lt;/P&gt;</description>
      <pubDate>Wed, 04 Dec 2024 06:43:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/100852#M654</guid>
      <dc:creator>ChrisChan</dc:creator>
      <dc:date>2024-12-04T06:43:50Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks AI Genie - Data Security and Thrid Party Platform</title>
      <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/100866#M655</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/134890"&gt;@ChrisChan&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;You’re absolutely right that the data used with Genie needs to be managed under Unity Catalog. However, if you want Genie to query data in Snowflake, you can use lakehouse federation. I’ve personally tried this method, and it worked successfully for me.&lt;/P&gt;&lt;P&gt;Additionally, this documentation might be helpful regarding Genie's security features. Apologies if you’re already familiar with it:&lt;BR /&gt;&lt;A href="https://docs.databricks.com/en/genie/index.html#privacy-and-security" target="_new" rel="noopener"&gt;&lt;SPAN&gt;https&lt;/SPAN&gt;&lt;SPAN&gt;://docs&lt;/SPAN&gt;&lt;SPAN&gt;.databricks&lt;/SPAN&gt;&lt;SPAN&gt;.com&lt;/SPAN&gt;&lt;SPAN&gt;/en&lt;/SPAN&gt;&lt;SPAN&gt;/genie&lt;/SPAN&gt;&lt;SPAN&gt;/index&lt;/SPAN&gt;&lt;SPAN&gt;.html&lt;/SPAN&gt;&lt;SPAN&gt;#privacy&lt;/SPAN&gt;&lt;SPAN&gt;-and&lt;/SPAN&gt;&lt;SPAN&gt;-security&lt;/SPAN&gt;&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Dec 2024 08:30:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/100866#M655</guid>
      <dc:creator>Takuya-Omi</dc:creator>
      <dc:date>2024-12-04T08:30:34Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks AI Genie - Data Security and Thrid Party Platform</title>
      <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/100998#M656</link>
      <description>&lt;P&gt;Many thanks for your advice! It works well with lakehouse federation.&lt;/P&gt;</description>
      <pubDate>Thu, 05 Dec 2024 03:14:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/100998#M656</guid>
      <dc:creator>ChrisChan</dc:creator>
      <dc:date>2024-12-05T03:14:41Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks AI Genie - Data Security and Thrid Party Platform</title>
      <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/101008#M658</link>
      <description>&lt;P&gt;Sorry, one more question. I successfully use Genie to query data via &lt;SPAN&gt;lakehouse federation&lt;/SPAN&gt;, but I also see there is a limitation that&amp;nbsp;Single user access mode is only available for users owning the connection. From your experience, is that means the user must have the ownership of the connection like (edit, remove etc). Thanks&lt;/P&gt;</description>
      <pubDate>Thu, 05 Dec 2024 05:51:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/101008#M658</guid>
      <dc:creator>ChrisChan</dc:creator>
      <dc:date>2024-12-05T05:51:49Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks AI Genie - Data Security and Thrid Party Platform</title>
      <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/101025#M659</link>
      <description>&lt;P&gt;If this is about access permissions to data within Genie, I thought the following documentation might be helpful&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/en/genie/index.html#required-permissions" target="_blank" rel="noopener"&gt;https://docs.databricks.com/en/genie/index.html#required-permissions&lt;/A&gt;&lt;/P&gt;&lt;UL class=""&gt;&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;Data access permissions:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;Any user who interacts with the space needs at least&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;SELECT&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;privileges on the data used in a space.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P data-unlink="true"&gt;&lt;STRONG&gt;Genie space permissions:&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;Users need CAN RUN permissions on the Genie space to interact with Genie and the data used in the space. See&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Genie space ACLs&lt;/SPAN&gt;&amp;nbsp;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;for a complete mapping of privileges and abilities for a Genie space.&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Thu, 05 Dec 2024 06:55:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/101025#M659</guid>
      <dc:creator>Takuya-Omi</dc:creator>
      <dc:date>2024-12-05T06:55:01Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks AI Genie - Data Security and Thrid Party Platform</title>
      <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/101254#M663</link>
      <description>&lt;P&gt;I'll take a stab at "&lt;SPAN&gt;Also, how secure is Genie AI?" since I've dug into this for our own uses.&amp;nbsp;There aren't many moving parts to Genie, it's really just a fine-tuned LLM and the rest is the same stuff you use in your notebooks.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The most insecure part of Genie I could find is that it uses serverless compute, and serverless compute is hosted in Databricks' tenant, not yours.&amp;nbsp; This means for a brief period of time, the prompt and metadata exist in the memory of a VM hosted outside your realm.&amp;nbsp; Per the docs, serverless compute nodes are isolated from one another but to me there is an "ick factor" when I make the statement "our data never leaves our environment" to the business and then I have to explain this technicality to InfoSec.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Genie itself does not access any of your data directly.&amp;nbsp; The prompt and your metadata are sent to the Genie model, which then generates a SQL statement.&amp;nbsp; This SQL is then executed on the serverless compute engine against the data stored in your tenant, same as if you were using a notebook or DLT job with serverless compute.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;The LLM behind Genie is currently the Azure Open AI Model, which is Microsoft's hosted version of the LLM behind ChatGPT, and Databricks opted in to "&lt;SPAN&gt;exemption from abuse monitoring and human review program, under which Microsoft does not store any prompts and completions sent to the Azure OpenAI service&lt;/SPAN&gt;" (see&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/genie/#privacy-and-security" target="_blank"&gt;Work with an AI/BI Genie space - Azure Databricks | Microsoft Learn&lt;/A&gt;).&amp;nbsp; If you're on AWS or GCP I'd expect the models are different but I didn't check.&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;My reply here is what I understand at this time, but security is fight club so solid answers are difficult to come by.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2024 15:44:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/101254#M663</guid>
      <dc:creator>Rjdudley</dc:creator>
      <dc:date>2024-12-06T15:44:41Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks AI Genie - Data Security and Thrid Party Platform</title>
      <link>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/125375#M1019</link>
      <description>&lt;P&gt;Genie allows you to use serverless as well as classic compute (sql warehouse). Classic compute runs within your tenant using your vpc/vnet. Irrespective of the type of compute you use, data needs to be read into it and this applies to any other solution as well. In case of databricks sql warehouse (serverless) you have an option to use a) NCC &amp;gt; Private Link to your storage accounts ie secure ingress control to read data b) Network polcies ie to prevent data exfiltration or have an egress f/w. Further more you are allowed to bring your own encryption keys aka CMK to encrypt the metadata / queries saved within the genie space. In short there are several options available to utilize genie and not compromise your security posture and that’s the true value of our platform “optionality”.&lt;/P&gt;</description>
      <pubDate>Tue, 15 Jul 2025 21:18:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/databricks-ai-genie-data-security-and-thrid-party-platform/m-p/125375#M1019</guid>
      <dc:creator>k1chi</dc:creator>
      <dc:date>2025-07-15T21:18:27Z</dc:date>
    </item>
  </channel>
</rss>

