<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Massive increase in the number of &amp;quot;GetBlobProperties&amp;quot; operations in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/massive-increase-in-the-number-of-quot-getblobproperties-quot/m-p/122599#M46822</link>
    <description>&lt;P&gt;Massive increase in&amp;nbsp;&lt;SPAN&gt;"GetBlobProperties" operations in your Azure Blob Storage account could be due to the following&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;1. Delta Tables and _delta_log Metadata Access&lt;BR /&gt;If you're using Delta Lake, Databricks reads blob properties (e.g., last-modified time, size) from the _delta_log directory to check for updates. A new notebook, workflow, or user action that frequently reads or lists a Delta table can lead to repeated GetBlobProperties operations on the json commit files and checkpoints in that directory.&lt;/P&gt;&lt;P&gt;Check if:&lt;BR /&gt;New notebooks or dashboards recently started querying Delta tables more often.&lt;BR /&gt;Auto-refresh features in notebooks or dashboards are enabled.&lt;BR /&gt;VACUUM, DESCRIBE HISTORY, or schema evolution commands are being run more frequently.&lt;/P&gt;&lt;P&gt;2. Databricks File System (DBFS) Listing&lt;BR /&gt;Any dbutils.fs.ls() or similar calls made recursively over folders in DBFS backed by Azure Blob Storage can trigger many GetBlobProperties calls. These are commonly used in diagnostics, logging, or monitoring code.&lt;/P&gt;&lt;P&gt;3. Unity Catalog or External Table Audits&lt;BR /&gt;If you're using Unity Catalog or have external Hive Metastore tables that point to blob-backed data, background jobs or access control audits can trigger metadata reads.&lt;/P&gt;&lt;P&gt;Widgets or jobs that query tables repeatedly, especially in a multi-user workspace, can inadvertently lead to a flood of GetBlobProperties.&lt;BR /&gt;&lt;BR /&gt;I hope this gives you some clarity!&lt;/P&gt;</description>
    <pubDate>Tue, 24 Jun 2025 02:48:36 GMT</pubDate>
    <dc:creator>Vidhi_Khaitan</dc:creator>
    <dc:date>2025-06-24T02:48:36Z</dc:date>
    <item>
      <title>Massive increase in the number of "GetBlobProperties" operations</title>
      <link>https://community.databricks.com/t5/data-engineering/massive-increase-in-the-number-of-quot-getblobproperties-quot/m-p/113633#M44585</link>
      <description>&lt;P&gt;I had a massive increase in the volume of "GetBlobProperties" operations in my Azure Blob Storage account. The storage logs indicate that all the extra operations have IPs attributed to my Databricks resource group. I haven't made any changes to my resource configurations, so I'm confused on what could have led to this increase. I appreciate any input on this issue.&lt;/P&gt;</description>
      <pubDate>Wed, 26 Mar 2025 03:22:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/massive-increase-in-the-number-of-quot-getblobproperties-quot/m-p/113633#M44585</guid>
      <dc:creator>syazwansuhaimi</dc:creator>
      <dc:date>2025-03-26T03:22:35Z</dc:date>
    </item>
    <item>
      <title>Re: Massive increase in the number of "GetBlobProperties" operations</title>
      <link>https://community.databricks.com/t5/data-engineering/massive-increase-in-the-number-of-quot-getblobproperties-quot/m-p/122599#M46822</link>
      <description>&lt;P&gt;Massive increase in&amp;nbsp;&lt;SPAN&gt;"GetBlobProperties" operations in your Azure Blob Storage account could be due to the following&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;1. Delta Tables and _delta_log Metadata Access&lt;BR /&gt;If you're using Delta Lake, Databricks reads blob properties (e.g., last-modified time, size) from the _delta_log directory to check for updates. A new notebook, workflow, or user action that frequently reads or lists a Delta table can lead to repeated GetBlobProperties operations on the json commit files and checkpoints in that directory.&lt;/P&gt;&lt;P&gt;Check if:&lt;BR /&gt;New notebooks or dashboards recently started querying Delta tables more often.&lt;BR /&gt;Auto-refresh features in notebooks or dashboards are enabled.&lt;BR /&gt;VACUUM, DESCRIBE HISTORY, or schema evolution commands are being run more frequently.&lt;/P&gt;&lt;P&gt;2. Databricks File System (DBFS) Listing&lt;BR /&gt;Any dbutils.fs.ls() or similar calls made recursively over folders in DBFS backed by Azure Blob Storage can trigger many GetBlobProperties calls. These are commonly used in diagnostics, logging, or monitoring code.&lt;/P&gt;&lt;P&gt;3. Unity Catalog or External Table Audits&lt;BR /&gt;If you're using Unity Catalog or have external Hive Metastore tables that point to blob-backed data, background jobs or access control audits can trigger metadata reads.&lt;/P&gt;&lt;P&gt;Widgets or jobs that query tables repeatedly, especially in a multi-user workspace, can inadvertently lead to a flood of GetBlobProperties.&lt;BR /&gt;&lt;BR /&gt;I hope this gives you some clarity!&lt;/P&gt;</description>
      <pubDate>Tue, 24 Jun 2025 02:48:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/massive-increase-in-the-number-of-quot-getblobproperties-quot/m-p/122599#M46822</guid>
      <dc:creator>Vidhi_Khaitan</dc:creator>
      <dc:date>2025-06-24T02:48:36Z</dc:date>
    </item>
  </channel>
</rss>

