<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Databricks Data Warehousing Announcements— July 2024 in Announcements</title>
    <link>https://community.databricks.com/t5/announcements/databricks-data-warehousing-announcements-july-2024/m-p/83571#M225</link>
    <description>&lt;H1 id="89cc" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;Predictive Optimisation&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://www.databricks.com/blog/announcing-general-availability-predictive-optimization" target="_blank" rel="noopener ugc nofollow"&gt;Predictive Optimisation is in GA&lt;/A&gt;, which uses AI to understand the maintenance operations required from Unity Catalog (eg: data access patterns) and automatically runs optimisations on your data layouts to improve query performance. This removes manual overhead of scheduling optimisation jobs with considerations around frequency, type of optimisation, tables are automatically managed&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;&amp;nbsp;&lt;/P&gt;
&lt;H1 id="e63c" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;Cost Management Dashboards&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/account-settings/usage.html#import-a-usage-dashboard" target="_blank" rel="noopener ugc nofollow"&gt;This is in Public Preview&lt;/A&gt;. Account admins can now import dashboards to monitor costs at either an account level or on a workspace level. Use the dashboard to view the metrics below, with the option to fully customise the dashboard&lt;/P&gt;
&lt;UL class=""&gt;
&lt;LI id="e745" class="ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage breakdown by SKU name&lt;/LI&gt;
&lt;LI id="642d" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage analysis based on custom tags&lt;/LI&gt;
&lt;LI id="fe8b" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage analysis on the most expensive usage&lt;/LI&gt;
&lt;LI id="1745" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage breakdown by billing origin product&lt;/LI&gt;
&lt;/UL&gt;
&lt;FIGURE class="vo vp vq vr vs vt vl vm paragraph-image"&gt;
&lt;DIV class="vu vv fl vw bg vx" tabindex="0" role="button"&gt;
&lt;DIV class="vl vm vn"&gt;&lt;PICTURE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 640w, https://miro.medium.com/v2/resize:fit:720/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 720w, https://miro.medium.com/v2/resize:fit:750/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 750w, https://miro.medium.com/v2/resize:fit:786/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 786w, https://miro.medium.com/v2/resize:fit:828/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 828w, https://miro.medium.com/v2/resize:fit:1100/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 1400w" type="image/webp" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px"&gt;&lt;/SOURCE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/1*oG9yko9HmU-6te02u9U1Aw.png 640w, https://miro.medium.com/v2/resize:fit:720/1*oG9yko9HmU-6te02u9U1Aw.png 720w, https://miro.medium.com/v2/resize:fit:750/1*oG9yko9HmU-6te02u9U1Aw.png 750w, https://miro.medium.com/v2/resize:fit:786/1*oG9yko9HmU-6te02u9U1Aw.png 786w, https://miro.medium.com/v2/resize:fit:828/1*oG9yko9HmU-6te02u9U1Aw.png 828w, https://miro.medium.com/v2/resize:fit:1100/1*oG9yko9HmU-6te02u9U1Aw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/1*oG9yko9HmU-6te02u9U1Aw.png 1400w" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px" data-testid="og"&gt;&lt;/SOURCE&gt;&lt;/PICTURE&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Beatrice_Liew_0-1724141840687.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/10447i906EEC2ADC87B53E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Beatrice_Liew_0-1724141840687.png" alt="Beatrice_Liew_0-1724141840687.png" /&gt;&lt;/span&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/FIGURE&gt;
&lt;H1 class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;&amp;nbsp;&lt;/H1&gt;
&lt;H1 id="5848" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;System Table updates&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;There are various updates around system tables, which is Databricks storage of operational data for observability:&lt;/P&gt;
&lt;UL class=""&gt;
&lt;LI id="ae73" class="ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/assistant.html" target="_blank" rel="noopener ugc nofollow"&gt;Databricks Assistant system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in public preview: Track the usage of Databricks assistant through system.access.assistant_events table, which will record the workspace, datetime, and the email of the user initiating a message on assistant.&lt;/LI&gt;
&lt;LI id="1405" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/compute.html#node-timeline" target="_blank" rel="noopener ugc nofollow"&gt;Node timeline system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;are in public preview: The node timeline table provides node level utilisation at minute granularity. Monitor metrics such as node type, cpu &amp;amp; memory utilisation, as well as network traffic sent in bytes.&lt;/LI&gt;
&lt;LI id="9a33" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/query-history.html" target="_blank" rel="noopener ugc nofollow"&gt;Query history system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in public preview: The system.query.history table records every SQL statement that has ran via SQL warehouses, where metrics such as the SQL statement, the warehouse id, execution duration, bytes read etc are available.&lt;/LI&gt;
&lt;LI id="8143" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/billing.html" target="_blank" rel="noopener ugc nofollow"&gt;Billing system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;are enabled by default in all Unity catalog workspaces. Billing tables allow you to get an overview of usage by SKU, duration etc&lt;/LI&gt;
&lt;LI id="22d9" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/jobs.html" target="_blank" rel="noopener ugc nofollow"&gt;Workflows system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in public preview: There are 4 tables in the system.workflow schema, which allows you to monitor:&lt;/LI&gt;
&lt;LI id="10f3" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;jobs: tracks creation, deletion &amp;amp; basic information of all jobs&lt;/LI&gt;
&lt;LI id="fece" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;job_tasks: tracks creation, deletion &amp;amp; basic information of all job tasks&lt;/LI&gt;
&lt;LI id="d18b" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;jobs_run_timeline: records the start, end and resulting state of job runs&lt;/LI&gt;
&lt;LI id="fe10" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;job_task_run_timeline: records the start, end, and resulting state of job tasks&lt;/LI&gt;
&lt;/UL&gt;
&lt;H1 class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;&amp;nbsp;&lt;/H1&gt;
&lt;H1 id="229a" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;Primary Key and Foreign Key constraints are GA and now enable faster queries&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;Primary keys (PK) and foreign keys (FK) can be defined for Unity Catalog tables for data modeling purposes. You can define it as a constraint during table creation or with modification. Do note that primary and foreign key constraints are currently not enforced. These are mainly used to indicate data integrity relationship, which also gives end users the ability to view the constraints in Unity Catalog via an Entity Relationship Diagram (ERD)&lt;/P&gt;
&lt;FIGURE class="vo vp vq vr vs vt vl vm paragraph-image"&gt;
&lt;DIV class="vu vv fl vw bg vx" tabindex="0" role="button"&gt;
&lt;DIV class="vl vm vz"&gt;&lt;PICTURE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 640w, https://miro.medium.com/v2/resize:fit:720/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 720w, https://miro.medium.com/v2/resize:fit:750/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 750w, https://miro.medium.com/v2/resize:fit:786/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 786w, https://miro.medium.com/v2/resize:fit:828/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 828w, https://miro.medium.com/v2/resize:fit:1100/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 1400w" type="image/webp" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px"&gt;&lt;/SOURCE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/1*QLy9ijAz-t0UIrao9hHtDw.png 640w, https://miro.medium.com/v2/resize:fit:720/1*QLy9ijAz-t0UIrao9hHtDw.png 720w, https://miro.medium.com/v2/resize:fit:750/1*QLy9ijAz-t0UIrao9hHtDw.png 750w, https://miro.medium.com/v2/resize:fit:786/1*QLy9ijAz-t0UIrao9hHtDw.png 786w, https://miro.medium.com/v2/resize:fit:828/1*QLy9ijAz-t0UIrao9hHtDw.png 828w, https://miro.medium.com/v2/resize:fit:1100/1*QLy9ijAz-t0UIrao9hHtDw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/1*QLy9ijAz-t0UIrao9hHtDw.png 1400w" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px" data-testid="og"&gt;&lt;/SOURCE&gt;&lt;/PICTURE&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Beatrice_Liew_1-1724141839537.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/10446iCA2E8A5DE471E9F9/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Beatrice_Liew_1-1724141839537.png" alt="Beatrice_Liew_1-1724141839537.png" /&gt;&lt;/span&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/FIGURE&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;For valid primary keys, using the RELY option allows you to enable optimisations based on constraints as Databricks will factor in data integrity of the primary key declared into query plans to optimize queries&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;One optimization RELY enables is that it can eliminate unnecessary aggregates based on the primary key constraints. For example, if a distinct operation is ran over the table with a primary key using RELY, the unnecessary distinct operation is removed, which speeds up the query by 2x&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;Another optimization from RELY is removing unnecessary joins. If a query joins a table which is only referenced in the join condition, the primary key constraint present would indicate that the join will output one row, which in turn would help the query optimizer identify instances where it can eliminate the join from the query entirely. In the blog example, the optimization sped up the query from 1.5 minutes to 6 seconds!&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;Read the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A class="af ld" href="https://www.databricks.com/blog/primary-key-and-foreign-key-constraints-are-ga-and-now-enable-faster-queries" target="_blank" rel="noopener ugc nofollow"&gt;full blog post&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;here!&lt;/P&gt;</description>
    <pubDate>Tue, 20 Aug 2024 08:18:23 GMT</pubDate>
    <dc:creator>Beatrice_Liew</dc:creator>
    <dc:date>2024-08-20T08:18:23Z</dc:date>
    <item>
      <title>Databricks Data Warehousing Announcements— July 2024</title>
      <link>https://community.databricks.com/t5/announcements/databricks-data-warehousing-announcements-july-2024/m-p/83571#M225</link>
      <description>&lt;H1 id="89cc" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;Predictive Optimisation&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://www.databricks.com/blog/announcing-general-availability-predictive-optimization" target="_blank" rel="noopener ugc nofollow"&gt;Predictive Optimisation is in GA&lt;/A&gt;, which uses AI to understand the maintenance operations required from Unity Catalog (eg: data access patterns) and automatically runs optimisations on your data layouts to improve query performance. This removes manual overhead of scheduling optimisation jobs with considerations around frequency, type of optimisation, tables are automatically managed&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;&amp;nbsp;&lt;/P&gt;
&lt;H1 id="e63c" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;Cost Management Dashboards&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/account-settings/usage.html#import-a-usage-dashboard" target="_blank" rel="noopener ugc nofollow"&gt;This is in Public Preview&lt;/A&gt;. Account admins can now import dashboards to monitor costs at either an account level or on a workspace level. Use the dashboard to view the metrics below, with the option to fully customise the dashboard&lt;/P&gt;
&lt;UL class=""&gt;
&lt;LI id="e745" class="ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage breakdown by SKU name&lt;/LI&gt;
&lt;LI id="642d" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage analysis based on custom tags&lt;/LI&gt;
&lt;LI id="fe8b" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage analysis on the most expensive usage&lt;/LI&gt;
&lt;LI id="1745" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;Usage breakdown by billing origin product&lt;/LI&gt;
&lt;/UL&gt;
&lt;FIGURE class="vo vp vq vr vs vt vl vm paragraph-image"&gt;
&lt;DIV class="vu vv fl vw bg vx" tabindex="0" role="button"&gt;
&lt;DIV class="vl vm vn"&gt;&lt;PICTURE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 640w, https://miro.medium.com/v2/resize:fit:720/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 720w, https://miro.medium.com/v2/resize:fit:750/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 750w, https://miro.medium.com/v2/resize:fit:786/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 786w, https://miro.medium.com/v2/resize:fit:828/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 828w, https://miro.medium.com/v2/resize:fit:1100/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/format:webp/1*oG9yko9HmU-6te02u9U1Aw.png 1400w" type="image/webp" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px"&gt;&lt;/SOURCE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/1*oG9yko9HmU-6te02u9U1Aw.png 640w, https://miro.medium.com/v2/resize:fit:720/1*oG9yko9HmU-6te02u9U1Aw.png 720w, https://miro.medium.com/v2/resize:fit:750/1*oG9yko9HmU-6te02u9U1Aw.png 750w, https://miro.medium.com/v2/resize:fit:786/1*oG9yko9HmU-6te02u9U1Aw.png 786w, https://miro.medium.com/v2/resize:fit:828/1*oG9yko9HmU-6te02u9U1Aw.png 828w, https://miro.medium.com/v2/resize:fit:1100/1*oG9yko9HmU-6te02u9U1Aw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/1*oG9yko9HmU-6te02u9U1Aw.png 1400w" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px" data-testid="og"&gt;&lt;/SOURCE&gt;&lt;/PICTURE&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Beatrice_Liew_0-1724141840687.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/10447i906EEC2ADC87B53E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Beatrice_Liew_0-1724141840687.png" alt="Beatrice_Liew_0-1724141840687.png" /&gt;&lt;/span&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/FIGURE&gt;
&lt;H1 class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;&amp;nbsp;&lt;/H1&gt;
&lt;H1 id="5848" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;System Table updates&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;There are various updates around system tables, which is Databricks storage of operational data for observability:&lt;/P&gt;
&lt;UL class=""&gt;
&lt;LI id="ae73" class="ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/assistant.html" target="_blank" rel="noopener ugc nofollow"&gt;Databricks Assistant system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in public preview: Track the usage of Databricks assistant through system.access.assistant_events table, which will record the workspace, datetime, and the email of the user initiating a message on assistant.&lt;/LI&gt;
&lt;LI id="1405" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/compute.html#node-timeline" target="_blank" rel="noopener ugc nofollow"&gt;Node timeline system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;are in public preview: The node timeline table provides node level utilisation at minute granularity. Monitor metrics such as node type, cpu &amp;amp; memory utilisation, as well as network traffic sent in bytes.&lt;/LI&gt;
&lt;LI id="9a33" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/query-history.html" target="_blank" rel="noopener ugc nofollow"&gt;Query history system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in public preview: The system.query.history table records every SQL statement that has ran via SQL warehouses, where metrics such as the SQL statement, the warehouse id, execution duration, bytes read etc are available.&lt;/LI&gt;
&lt;LI id="8143" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/billing.html" target="_blank" rel="noopener ugc nofollow"&gt;Billing system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;are enabled by default in all Unity catalog workspaces. Billing tables allow you to get an overview of usage by SKU, duration etc&lt;/LI&gt;
&lt;LI id="22d9" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;&lt;A class="af ld" href="https://docs.databricks.com/en/admin/system-tables/jobs.html" target="_blank" rel="noopener ugc nofollow"&gt;Workflows system tables&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in public preview: There are 4 tables in the system.workflow schema, which allows you to monitor:&lt;/LI&gt;
&lt;LI id="10f3" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;jobs: tracks creation, deletion &amp;amp; basic information of all jobs&lt;/LI&gt;
&lt;LI id="fece" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;job_tasks: tracks creation, deletion &amp;amp; basic information of all job tasks&lt;/LI&gt;
&lt;LI id="d18b" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;jobs_run_timeline: records the start, end and resulting state of job runs&lt;/LI&gt;
&lt;LI id="fe10" class="ua ub pd uc b ud vg uf ug uh vh uj uk ul vi un uo up vj ur us ut vk uv uw ux vd ve vf bj" data-selectable-paragraph=""&gt;job_task_run_timeline: records the start, end, and resulting state of job tasks&lt;/LI&gt;
&lt;/UL&gt;
&lt;H1 class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;&amp;nbsp;&lt;/H1&gt;
&lt;H1 id="229a" class="tc td pd be te tf tg th ti tj tk tl tm tn to tp tq tr ts tt tu tv tw tx ty tz bj" data-selectable-paragraph=""&gt;Primary Key and Foreign Key constraints are GA and now enable faster queries&lt;/H1&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud ue uf ug uh ui uj uk ul um un uo up uq ur us ut uu uv uw ux jp bj" data-selectable-paragraph=""&gt;Primary keys (PK) and foreign keys (FK) can be defined for Unity Catalog tables for data modeling purposes. You can define it as a constraint during table creation or with modification. Do note that primary and foreign key constraints are currently not enforced. These are mainly used to indicate data integrity relationship, which also gives end users the ability to view the constraints in Unity Catalog via an Entity Relationship Diagram (ERD)&lt;/P&gt;
&lt;FIGURE class="vo vp vq vr vs vt vl vm paragraph-image"&gt;
&lt;DIV class="vu vv fl vw bg vx" tabindex="0" role="button"&gt;
&lt;DIV class="vl vm vz"&gt;&lt;PICTURE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 640w, https://miro.medium.com/v2/resize:fit:720/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 720w, https://miro.medium.com/v2/resize:fit:750/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 750w, https://miro.medium.com/v2/resize:fit:786/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 786w, https://miro.medium.com/v2/resize:fit:828/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 828w, https://miro.medium.com/v2/resize:fit:1100/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/format:webp/1*QLy9ijAz-t0UIrao9hHtDw.png 1400w" type="image/webp" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px"&gt;&lt;/SOURCE&gt;&lt;SOURCE srcset="https://miro.medium.com/v2/resize:fit:640/1*QLy9ijAz-t0UIrao9hHtDw.png 640w, https://miro.medium.com/v2/resize:fit:720/1*QLy9ijAz-t0UIrao9hHtDw.png 720w, https://miro.medium.com/v2/resize:fit:750/1*QLy9ijAz-t0UIrao9hHtDw.png 750w, https://miro.medium.com/v2/resize:fit:786/1*QLy9ijAz-t0UIrao9hHtDw.png 786w, https://miro.medium.com/v2/resize:fit:828/1*QLy9ijAz-t0UIrao9hHtDw.png 828w, https://miro.medium.com/v2/resize:fit:1100/1*QLy9ijAz-t0UIrao9hHtDw.png 1100w, https://miro.medium.com/v2/resize:fit:1400/1*QLy9ijAz-t0UIrao9hHtDw.png 1400w" sizes="(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px" data-testid="og"&gt;&lt;/SOURCE&gt;&lt;/PICTURE&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Beatrice_Liew_1-1724141839537.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/10446iCA2E8A5DE471E9F9/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Beatrice_Liew_1-1724141839537.png" alt="Beatrice_Liew_1-1724141839537.png" /&gt;&lt;/span&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;/FIGURE&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;For valid primary keys, using the RELY option allows you to enable optimisations based on constraints as Databricks will factor in data integrity of the primary key declared into query plans to optimize queries&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;One optimization RELY enables is that it can eliminate unnecessary aggregates based on the primary key constraints. For example, if a distinct operation is ran over the table with a primary key using RELY, the unnecessary distinct operation is removed, which speeds up the query by 2x&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;Another optimization from RELY is removing unnecessary joins. If a query joins a table which is only referenced in the join condition, the primary key constraint present would indicate that the join will output one row, which in turn would help the query optimizer identify instances where it can eliminate the join from the query entirely. In the blog example, the optimization sped up the query from 1.5 minutes to 6 seconds!&lt;/P&gt;
&lt;P class="pw-post-body-paragraph ua ub pd uc b ud uy uf ug uh uz uj uk ul va un uo up vb ur us ut vc uv uw ux jp bj" data-selectable-paragraph=""&gt;Read the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A class="af ld" href="https://www.databricks.com/blog/primary-key-and-foreign-key-constraints-are-ga-and-now-enable-faster-queries" target="_blank" rel="noopener ugc nofollow"&gt;full blog post&lt;/A&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;here!&lt;/P&gt;</description>
      <pubDate>Tue, 20 Aug 2024 08:18:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/announcements/databricks-data-warehousing-announcements-july-2024/m-p/83571#M225</guid>
      <dc:creator>Beatrice_Liew</dc:creator>
      <dc:date>2024-08-20T08:18:23Z</dc:date>
    </item>
  </channel>
</rss>

