<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: system.access.table_lineage table missing data in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75664#M35019</link>
    <description>&lt;P&gt;Is all your ETL querying/referencing the full table name (i.e. catalog.schema.table)? If you query delta files for example, metadata for data lineage will not be captured.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 25 Jun 2024 07:22:06 GMT</pubDate>
    <dc:creator>jacovangelder</dc:creator>
    <dc:date>2024-06-25T07:22:06Z</dc:date>
    <item>
      <title>system.access.table_lineage table missing data</title>
      <link>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75365#M34950</link>
      <description>&lt;P&gt;I am using the&amp;nbsp;system.access.table_lineage table&amp;nbsp; to figure out the tables accessed by sql queries and the corresponding SQL queries. However I am noticing this table missing data or values very often.&lt;/P&gt;&lt;P&gt;For eg for sql queries executed by our DBT jobs, the table system.access.table_lineage has an entry but the entity run id (which should be the query id in this case) is NULL even though the queries history API and the UI have the corresponding queries. Why is the entity run id not populated in such case?&lt;/P&gt;&lt;P&gt;I am also noticing this table missing entries for some reads entirely. Our DBT jobs read from a few tables every hour once, but&amp;nbsp;system.access.table_lineage table often only has 20-22 entries for those tables as opposed to 24 even though the&amp;nbsp;queries history API and the UI have all the corresponding 24 queries.&lt;/P&gt;&lt;P&gt;This looks like a bug to me, can someone help on why would this be the case?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jun 2024 17:16:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75365#M34950</guid>
      <dc:creator>aranjan99</dc:creator>
      <dc:date>2024-06-21T17:16:44Z</dc:date>
    </item>
    <item>
      <title>Re: system.access.table_lineage table missing data</title>
      <link>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75647#M35013</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;I have access to the&amp;nbsp;system.access.table_lineage&amp;nbsp;&amp;nbsp;table. I can see some data in there. My questions is specifically asking about in correct data and missing data in this table.&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jun 2024 23:44:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75647#M35013</guid>
      <dc:creator>aranjan99</dc:creator>
      <dc:date>2024-06-24T23:44:23Z</dc:date>
    </item>
    <item>
      <title>Re: system.access.table_lineage table missing data</title>
      <link>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75664#M35019</link>
      <description>&lt;P&gt;Is all your ETL querying/referencing the full table name (i.e. catalog.schema.table)? If you query delta files for example, metadata for data lineage will not be captured.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 25 Jun 2024 07:22:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75664#M35019</guid>
      <dc:creator>jacovangelder</dc:creator>
      <dc:date>2024-06-25T07:22:06Z</dc:date>
    </item>
    <item>
      <title>Re: system.access.table_lineage table missing data</title>
      <link>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75976#M35122</link>
      <description>&lt;P&gt;Yes it is referencing&amp;nbsp;&lt;SPAN&gt;full table name&amp;nbsp;and these are all SQL tables and not&amp;nbsp;query delta files.&amp;nbsp;&lt;BR /&gt;If I run the exact same query via a Databricks jobs, the entity run ids are populated. But if I run them via DBT jobs the&amp;nbsp;entity run ids are always NULL&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jun 2024 22:04:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/75976#M35122</guid>
      <dc:creator>aranjan99</dc:creator>
      <dc:date>2024-06-27T22:04:10Z</dc:date>
    </item>
    <item>
      <title>Re: system.access.table_lineage table missing data</title>
      <link>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/100775#M40421</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105621"&gt;@aranjan99&lt;/a&gt;&amp;nbsp;did you ever get an answer or conclusion to the limitations of Unity Catalog in regards to tracking access via SQL?&lt;/P&gt;</description>
      <pubDate>Tue, 03 Dec 2024 15:03:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/system-access-table-lineage-table-missing-data/m-p/100775#M40421</guid>
      <dc:creator>goldenmountain</dc:creator>
      <dc:date>2024-12-03T15:03:57Z</dc:date>
    </item>
  </channel>
</rss>

