cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Genie Agent Mode Visibility vs Standard Mode Monitoring

simmitil
New Contributor

In standard Genie Space chat, space managers can use the Monitoring tab to review prompts/conversations. However, in Agent Mode, we see the warning:

โ€œGenie agent responses may contain results obtained using other usersโ€™ credentials and are hidden from space managers.โ€

A few questions:

  1. Why is Agent Mode treated differently from standard Genie chat if both are ultimately querying the same underlying tables/data assets?
  2. Is there currently any admin/workspace setting, governance control, or future roadmap item that would allow space managers to have fuller visibility into Agent Mode conversations/results for testing and governance purposes?
  3. Do benchmarks, thumbs up/down feedback, and saved benchmark questions improve/tune both standard mode and Agent Mode behavior equally, or are they handled separately?

Would appreciate any clarification from anyone who has implemented governance/testing processes around Genie Agent Mode. Thank you!!

1 REPLY 1

Lu_Wang_ENB_DBX
Databricks Employee
Databricks Employee

Answers to your questions:

  1. Why Agent Mode is treated differently
    Because Agent Mode can generate synthesized text/report answers from multi-step reasoning, and internal product guidance says those answers may contain data outside the reviewing managerโ€™s own RLS/CLS scope, so managers may see the prompt but not open the answer by default.
    By contrast, standard/chat-mode governance is more aligned with manager review and rerun workflows using the managerโ€™s own credentials.

  2. Current admin/governance control / roadmap
    The main control is Genie conversation/chat sharing (Beta / workspace preview setting). When enabled, new conversations default to โ€œReviewable by space managersโ€; when not enabled, conversations are Private.
    Enabling Genie Chat Sharing is the current way to let space managers inspect Agent Mode conversations/results; it applies to conversations created after the setting is turned on, unless the user makes the conversation Private.
    The documented controls are sharing and request review.

  3. Benchmarks / thumbs up-down / saved benchmark questions: same or separate?
    Benchmarks support both Chat and Agent Mode, but they are handled separately: Chat mode benchmarks compare against gold SQL, while Agent mode benchmarks use the same multi-step reasoning as Agent Mode and are graded by an LLM judge.
    Also, benchmarks are for evaluation, not tuning: the docs explicitly say benchmark questions and example SQL in benchmarks do not improve Genieโ€™s context.
    Thumbs up/down and review feedback help space managers refine the space (instructions, examples, suggested SQL snippets), which can improve both modes indirectly, but that is space curation, not automatic model tuning.
    Saved representative answers/messages can be turned into benchmark questions, which helps testing coverage, but again that is evaluation asset creation, not direct tuning.

Practical governance pattern to use

Enable Genie Chat Sharing, keep new conversations Reviewable by space managers, use Request review for edge cases, and run separate Chat-mode and Agent-mode benchmark suites for validation.