<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: instance_id in compute.node_timelines in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106372#M42463</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/144790"&gt;@pranitha&lt;/a&gt;,&lt;/P&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;The &lt;CODE&gt;instance_id&lt;/CODE&gt; field refers to the specific instance of a node, which can include both driver and worker nodes. In your query, you are counting distinct &lt;CODE&gt;instance_id&lt;/CODE&gt;s, which might include instances that are not currently active or are driver nodes, leading to a higher count than the maximum number of worker nodes set in your cluster configuration.&lt;/P&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;SPAN&gt;Additionally, the &lt;CODE&gt;node_timeline&lt;/CODE&gt; table captures node-level resource utilization data at minute granularity, and each record contains data for a given minute of time per instance. This means that if nodes are frequently added and removed (e.g., due to autoscaling), the count of distinct &lt;CODE&gt;instance_id&lt;/CODE&gt;s over a period of time can exceed the maximum number of worker nodes set for the cluster at any single point in time.&lt;/SPAN&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 20 Jan 2025 17:20:05 GMT</pubDate>
    <dc:creator>Alberto_Umana</dc:creator>
    <dc:date>2025-01-20T17:20:05Z</dc:date>
    <item>
      <title>instance_id in compute.node_timelines</title>
      <link>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106367#M42462</link>
      <description>&lt;P&gt;I am trying to fetch active worker nodes from system tables using the code like below:&lt;/P&gt;&lt;P&gt;select count(distinct instance_id)&lt;/P&gt;&lt;P&gt;from system.compute.node_timelines where cluster_id = "xx"&lt;/P&gt;&lt;P&gt;groupy by instance_id,start_time,end_times&lt;/P&gt;&lt;P&gt;It gives an output like 20 but in the my cluster configuration I have set max worker nodes is 15. How is this possible. Does instance_id refers to the worker nodes ?&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jan 2025 16:34:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106367#M42462</guid>
      <dc:creator>pranitha</dc:creator>
      <dc:date>2025-01-20T16:34:05Z</dc:date>
    </item>
    <item>
      <title>Re: instance_id in compute.node_timelines</title>
      <link>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106372#M42463</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/144790"&gt;@pranitha&lt;/a&gt;,&lt;/P&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;The &lt;CODE&gt;instance_id&lt;/CODE&gt; field refers to the specific instance of a node, which can include both driver and worker nodes. In your query, you are counting distinct &lt;CODE&gt;instance_id&lt;/CODE&gt;s, which might include instances that are not currently active or are driver nodes, leading to a higher count than the maximum number of worker nodes set in your cluster configuration.&lt;/P&gt;
&lt;P class="_1t7bu9h1 paragraph"&gt;&lt;SPAN&gt;Additionally, the &lt;CODE&gt;node_timeline&lt;/CODE&gt; table captures node-level resource utilization data at minute granularity, and each record contains data for a given minute of time per instance. This means that if nodes are frequently added and removed (e.g., due to autoscaling), the count of distinct &lt;CODE&gt;instance_id&lt;/CODE&gt;s over a period of time can exceed the maximum number of worker nodes set for the cluster at any single point in time.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jan 2025 17:20:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106372#M42463</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-01-20T17:20:05Z</dc:date>
    </item>
    <item>
      <title>Re: instance_id in compute.node_timelines</title>
      <link>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106373#M42464</link>
      <description>&lt;P class="p1"&gt;To ensure you are only counting active worker nodes, you can use:&lt;/P&gt;
&lt;P class="p1"&gt;SELECT COUNT(DISTINCT instance_id)&lt;/P&gt;
&lt;P class="p1"&gt;FROM system.compute.node_timelines&lt;/P&gt;
&lt;P class="p1"&gt;WHERE cluster_id = 'xx'&lt;/P&gt;
&lt;P class="p1"&gt;&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &lt;/SPAN&gt;AND driver = false&lt;/P&gt;
&lt;P class="p1"&gt;&lt;SPAN class="Apple-converted-space"&gt;&amp;nbsp; &lt;/SPAN&gt;AND end_time IS NULL&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jan 2025 17:20:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106373#M42464</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-01-20T17:20:47Z</dc:date>
    </item>
    <item>
      <title>Re: instance_id in compute.node_timelines</title>
      <link>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106374#M42465</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/106294"&gt;@Alberto_Umana&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for replying.&lt;/P&gt;&lt;P&gt;Even if we add the driver node it should be around 16-17 right, not like 20. I checked for al the clusters, for every cluster there is a difference of 5-7 nodes between max_worker count and count(distinct instance_id) for a specific minute.&lt;/P&gt;</description>
      <pubDate>Mon, 20 Jan 2025 17:30:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/instance-id-in-compute-node-timelines/m-p/106374#M42465</guid>
      <dc:creator>pranitha</dc:creator>
      <dc:date>2025-01-20T17:30:55Z</dc:date>
    </item>
  </channel>
</rss>

