<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Genie Agent Mode Visibility vs Standard Mode Monitoring in Generative AI</title>
    <link>https://community.databricks.com/t5/generative-ai/genie-agent-mode-visibility-vs-standard-mode-monitoring/m-p/157519#M1826</link>
    <description>&lt;P&gt;&lt;SPAN&gt;In standard Genie Space chat, space managers can use the Monitoring tab to review prompts/conversations. However, in Agent Mode, we see the warning:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;“Genie agent responses may contain results obtained using other users’ credentials and are hidden from space managers.”&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;A few questions:&lt;/SPAN&gt;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;SPAN&gt;Why is Agent Mode treated differently from standard Genie chat if both are ultimately querying the same underlying tables/data assets?&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;Is there currently any admin/workspace setting, governance control, or future roadmap item that would allow space managers to have fuller visibility into Agent Mode conversations/results for testing and governance purposes?&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;Do benchmarks, thumbs up/down feedback, and saved benchmark questions improve/tune both standard mode and Agent Mode behavior equally, or are they handled separately?&lt;/SPAN&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;SPAN&gt;Would appreciate any clarification from anyone who has implemented governance/testing processes around Genie Agent Mode. Thank you!!&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 22 May 2026 18:14:17 GMT</pubDate>
    <dc:creator>simmitil</dc:creator>
    <dc:date>2026-05-22T18:14:17Z</dc:date>
    <item>
      <title>Genie Agent Mode Visibility vs Standard Mode Monitoring</title>
      <link>https://community.databricks.com/t5/generative-ai/genie-agent-mode-visibility-vs-standard-mode-monitoring/m-p/157519#M1826</link>
      <description>&lt;P&gt;&lt;SPAN&gt;In standard Genie Space chat, space managers can use the Monitoring tab to review prompts/conversations. However, in Agent Mode, we see the warning:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;“Genie agent responses may contain results obtained using other users’ credentials and are hidden from space managers.”&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;A few questions:&lt;/SPAN&gt;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;SPAN&gt;Why is Agent Mode treated differently from standard Genie chat if both are ultimately querying the same underlying tables/data assets?&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;Is there currently any admin/workspace setting, governance control, or future roadmap item that would allow space managers to have fuller visibility into Agent Mode conversations/results for testing and governance purposes?&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;Do benchmarks, thumbs up/down feedback, and saved benchmark questions improve/tune both standard mode and Agent Mode behavior equally, or are they handled separately?&lt;/SPAN&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;SPAN&gt;Would appreciate any clarification from anyone who has implemented governance/testing processes around Genie Agent Mode. Thank you!!&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 22 May 2026 18:14:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/genie-agent-mode-visibility-vs-standard-mode-monitoring/m-p/157519#M1826</guid>
      <dc:creator>simmitil</dc:creator>
      <dc:date>2026-05-22T18:14:17Z</dc:date>
    </item>
    <item>
      <title>Re: Genie Agent Mode Visibility vs Standard Mode Monitoring</title>
      <link>https://community.databricks.com/t5/generative-ai/genie-agent-mode-visibility-vs-standard-mode-monitoring/m-p/157522#M1827</link>
      <description>&lt;H3&gt;Answers to your questions:&lt;/H3&gt;
&lt;OL&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;Why Agent Mode is treated differently&lt;/STRONG&gt;&lt;BR /&gt;Because &lt;STRONG&gt;Agent Mode can generate synthesized text/report answers from multi-step reasoning&lt;/STRONG&gt;, and internal product guidance says those answers may contain data outside the reviewing manager’s own &lt;STRONG&gt;RLS/CLS&lt;/STRONG&gt; scope, so managers may see the prompt but not open the answer by default.&lt;BR /&gt;By contrast, standard/chat-mode governance is more aligned with manager review and rerun workflows using the manager’s own credentials.&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;Current admin/governance control / roadmap&lt;/STRONG&gt;&lt;BR /&gt;The main control is &lt;STRONG&gt;Genie conversation/chat sharing&lt;/STRONG&gt; (Beta / workspace preview setting). When enabled, &lt;STRONG&gt;new conversations default to “Reviewable by space managers”&lt;/STRONG&gt;; when not enabled, conversations are &lt;STRONG&gt;Private&lt;/STRONG&gt;.&lt;BR /&gt;Enabling &lt;STRONG&gt;Genie Chat Sharing&lt;/STRONG&gt; is the current way to let space managers inspect &lt;STRONG&gt;Agent Mode&lt;/STRONG&gt; conversations/results; it applies to conversations created &lt;STRONG&gt;after&lt;/STRONG&gt; the setting is turned on, unless the user makes the conversation &lt;STRONG&gt;Private&lt;/STRONG&gt;.&lt;BR /&gt;The documented controls are&amp;nbsp;&lt;STRONG&gt;sharing&lt;/STRONG&gt; and &lt;STRONG&gt;request review&lt;/STRONG&gt;.&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;STRONG&gt;Benchmarks / thumbs up-down / saved benchmark questions: same or separate?&lt;/STRONG&gt;&lt;BR /&gt;&lt;STRONG&gt;Benchmarks support both Chat and Agent Mode&lt;/STRONG&gt;, but they are handled &lt;STRONG&gt;separately&lt;/STRONG&gt;: &lt;STRONG&gt;Chat mode&lt;/STRONG&gt; benchmarks compare against gold SQL, while &lt;STRONG&gt;Agent mode&lt;/STRONG&gt; benchmarks use the same multi-step reasoning as Agent Mode and are graded by an &lt;STRONG&gt;LLM judge&lt;/STRONG&gt;.&lt;BR /&gt;Also, benchmarks are for &lt;STRONG&gt;evaluation, not tuning&lt;/STRONG&gt;: the docs explicitly say benchmark questions and example SQL in benchmarks &lt;STRONG&gt;do not improve Genie’s context&lt;/STRONG&gt;.&lt;BR /&gt;&lt;STRONG&gt;Thumbs up/down&lt;/STRONG&gt; and review feedback help space managers &lt;STRONG&gt;refine the space&lt;/STRONG&gt; (instructions, examples, suggested SQL snippets), which can improve both modes indirectly, but that is &lt;STRONG&gt;space curation&lt;/STRONG&gt;, not automatic model tuning.&lt;BR /&gt;Saved representative answers/messages can be turned into &lt;STRONG&gt;benchmark questions&lt;/STRONG&gt;, which helps testing coverage, but again that is &lt;STRONG&gt;evaluation asset creation&lt;/STRONG&gt;, not direct tuning.&lt;/P&gt;
&lt;/LI&gt;
&lt;/OL&gt;
&lt;H3&gt;Practical governance pattern to use&lt;/H3&gt;
&lt;P&gt;Enable &lt;STRONG&gt;Genie Chat Sharing&lt;/STRONG&gt;, keep new conversations &lt;STRONG&gt;Reviewable by space managers&lt;/STRONG&gt;, use &lt;STRONG&gt;Request review&lt;/STRONG&gt; for edge cases, and run &lt;STRONG&gt;separate Chat-mode and Agent-mode benchmark suites&lt;/STRONG&gt; for validation.&lt;/P&gt;</description>
      <pubDate>Sat, 23 May 2026 05:14:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/genie-agent-mode-visibility-vs-standard-mode-monitoring/m-p/157522#M1827</guid>
      <dc:creator>Lu_Wang_ENB_DBX</dc:creator>
      <dc:date>2026-05-23T05:14:45Z</dc:date>
    </item>
  </channel>
</rss>

