yesterday
Hi everyone,
I’m curious if anyone has successfully implemented Databricks Genie (chat/agent) for production use.
Currently, we’ve enabled a few Genie instances for power users who are comfortable working with data outside of the data team. However, we’re evaluating whether AI analytics tools like Genie are mature enough to be rolled out more broadly across the organization.
From my experience, even as a data team leader, I still find it necessary to carefully validate both prompts and outputs, especially for more complex questions that require connecting multiple business domains. But what Genie recommended confused me, screenshot below
Has anyone deployed Genie analytics at scale and opened access to a wider audience?
Would love to hear how others are approaching this. Thanks!
yesterday
Hi, I think you forgot to attach a screenshot. I'd be keen to see what you're seeing.
yesterday
Hi @Emma
The recommendation was from Genie code as attached below.
To me, this is not curated issue, it's 1. maturity of LLM and 2. user prompt quality. I observed several times that users asked questions with confusing perspective, Genie could decode it in a wrong way and provide wrong result
8 hours ago
Hi Julie, thanks for sharing the screenshot. There are a few things we'd recommend to customers when they are starting out on a Genie journey, we defintiely see enterprises creating Genie spaces at scale but it needs some thought and guardrails. Some recommendations for you, from one of our internal resources:
The goal of curating a space is to ensure your users can answer their questions accurately and consistently. Genie spaces are equipped with best-in-class models that are capable of generating sophisticated queries and have general knowledge about the world at large to interpret user questions. However, most business questions are domain-specific. Consequently, the role of a space curator is to fill these gaps through a combination of metadata and instructions. This requires iteration and practice, but this document aims to capture best practices and principles to create an effective space.
You can the key is to test and iterate, observing the history of what has been asked and what Genie delivered.
I hope this helps.
Thanks,
Emma
2 hours ago
Hi @emma_s
Thanks for the feedback, really appreciate the perspectives shared.
To provide a bit more context, we’ve already built several Genie instances and granted access to a group of power users internally, and use Genie agent production analysis for a while.
My question is more about the longer-term impact of AI analytics. From our experience, Genie doesn’t seem to add significant value if it’s primarily used to answer straightforward questions like “last month’s sales by X dimension YoY growth”. These types of standardized metrics are already well served by our BI tool.
When it comes to analytics, I tend to see two distinct layers:
Standardized metrics → live in BI tools, where users can easily access consistent
Exploration / deep research → where the real value lies, especially when connecting multiple domains to uncover insights
In our case, we’ve been leveraging Genie chat more for the second layer — enabling analysis that correlates multiple business signals, such as: marketing spend, traffic by channel, MTA attribution, onsite engagement, conversion and sales... etc
This kind of cross-domain exploration allows us to answer more complex “what if” questions and significantly reduce the time required (often replacing work that would otherwise take 10–20x more manual effort).
That’s why I’m particularly interested in how others are using Genie — especially the Genie agent capabilities for deeper research and exploratory analysis.
At the end of the day, I don’t think the true impact of analytics comes from making standardized metrics available. It comes from answering the hardest questions: Where should we invest next? Where are the hidden opportunities? What risks are we not seeing yet?
Given that, I’m curious — based on what you've seen, does it still make sense to limit Genie primarily to power users, or have you seen success scaling this type of deep exploratory analytics more broadly across the organization?
Thanks again, and looking forward to hearing your thoughts.
