<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Databricks Dashboard Optimization in Warehousing &amp; Analytics</title>
    <link>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141224#M2411</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/168412"&gt;@nanditakrishnan&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;There's already something like that in databricks dashboards, but some conditions need to be fulfilled (i.e queries need to share same group by). One of dataset optimization techniques that databricks team implemented is doing following:&lt;/P&gt;&lt;P&gt;"&lt;SPAN&gt;For visualization queries sent to the backend, separate visualization queries against the same dataset that share the same&amp;nbsp;&lt;/SPAN&gt;GROUP BY&lt;SPAN&gt;&amp;nbsp;clauses and filter predicates are combined into a single query for processing. In this case, users may see a combined query in the query history that fetches results for multiple visualizations or filters."&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;There are also other optimization tricks that they're doing - you can read about them below:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/dashboards/caching#dataset-optimizations" target="_blank" rel="noopener"&gt;Dataset optimization and caching - Azure Databricks | Microsoft Learn&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 05 Dec 2025 08:44:18 GMT</pubDate>
    <dc:creator>szymon_dybczak</dc:creator>
    <dc:date>2025-12-05T08:44:18Z</dc:date>
    <item>
      <title>Databricks Dashboard Optimization</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141204#M2410</link>
      <description>&lt;P&gt;I have trouble understanding why, for every report in a dashboard that refers to the same data source, the query re-runs each time&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ideally I would want queries being used to fuel the tables for the dashboard run exactly once, and then have the filters/parameters run on top of this already built-table&lt;/P&gt;&lt;P&gt;Is that an incorrect expectation? Or is there a better way around this?&lt;/P&gt;</description>
      <pubDate>Fri, 05 Dec 2025 03:28:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141204#M2410</guid>
      <dc:creator>nanditakrishnan</dc:creator>
      <dc:date>2025-12-05T03:28:17Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Dashboard Optimization</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141224#M2411</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/168412"&gt;@nanditakrishnan&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;There's already something like that in databricks dashboards, but some conditions need to be fulfilled (i.e queries need to share same group by). One of dataset optimization techniques that databricks team implemented is doing following:&lt;/P&gt;&lt;P&gt;"&lt;SPAN&gt;For visualization queries sent to the backend, separate visualization queries against the same dataset that share the same&amp;nbsp;&lt;/SPAN&gt;GROUP BY&lt;SPAN&gt;&amp;nbsp;clauses and filter predicates are combined into a single query for processing. In this case, users may see a combined query in the query history that fetches results for multiple visualizations or filters."&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;There are also other optimization tricks that they're doing - you can read about them below:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/dashboards/caching#dataset-optimizations" target="_blank" rel="noopener"&gt;Dataset optimization and caching - Azure Databricks | Microsoft Learn&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Dec 2025 08:44:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141224#M2411</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2025-12-05T08:44:18Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Dashboard Optimization</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141284#M2414</link>
      <description>&lt;P&gt;I see multiple queries in my query history instead of a combined one. Thanks for sharing the doc though - my data being greater than 100K rows and dashboard being parameterized could be the reason.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Dec 2025 17:58:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141284#M2414</guid>
      <dc:creator>nanditakrishnan</dc:creator>
      <dc:date>2025-12-05T17:58:50Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Dashboard Optimization</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141288#M2415</link>
      <description>&lt;P&gt;Your expectation is reasonable, but most dashboards are designed to recompute queries per visual. The best workaround is to precompute the base dataset in a table or view, and have all visuals query that instead. This reduces query load and ensures consistent results across visuals.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Dec 2025 18:08:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141288#M2415</guid>
      <dc:creator>Poorva21</dc:creator>
      <dc:date>2025-12-05T18:08:48Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks Dashboard Optimization</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141295#M2416</link>
      <description>&lt;P&gt;The base table/query for all of my visuals is the same - it's a single query coming from the Data Tab&lt;/P&gt;&lt;P&gt;Unless you mean a precomputed table that actually lives in a schema, in which case the problem is I can't give users the flexibility of choosing the date ranges&lt;/P&gt;</description>
      <pubDate>Fri, 05 Dec 2025 21:40:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-dashboard-optimization/m-p/141295#M2416</guid>
      <dc:creator>nanditakrishnan</dc:creator>
      <dc:date>2025-12-05T21:40:16Z</dc:date>
    </item>
  </channel>
</rss>

