<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic DLT + Unity Catalogue Issue accessing Dataset not defined in the pipeline in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/dlt-unity-catalogue-issue-accessing-dataset-not-defined-in-the/m-p/36856#M26203</link>
    <description>&lt;P&gt;I have 2 different schemas [silver and gold] under the same Unity Catalog.&lt;/P&gt;&lt;P&gt;We are trying to incrementally ingest data in both silver and gold layers.&lt;/P&gt;&lt;P&gt;The silver tables were created as streaming DLT tables using&amp;nbsp;dlt.create_streaming_table(....) and the apply_merge(....) which is working perfectly&lt;/P&gt;&lt;P&gt;I have a DLT pipeline reading from the silver streaming DLT table and try to ingest to gold using the same concept:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;dlt.create_streaming_table(GOLD_TABLE)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;then&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;apply_merge({UNITY_CATALOG_NAME}.{SCHEMA_SILVER}.{SOURCE_SILVER_TABLE} &amp;gt;&amp;gt;&amp;gt;&amp;gt; {GOLD_TABLE} . . . )&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and the pipeline is throwing this error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;Failed to read dataset '{UNITY_CATALOG}.{SCHEMA_SILVER}.{TABLE_NAME}'. Dataset is not defined in the pipeline&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;Configuration:&lt;/STRONG&gt;&lt;/U&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;The Catalog was created by specifying an ADL storage for it.&lt;/LI&gt;&lt;LI&gt;Schema was created without specifying any storage for it (because I faced another issue with the DLT pipeline, not allowing to execute pipeline for specified SCHEMA Location)&lt;/LI&gt;&lt;LI&gt;Schema details:&lt;UL&gt;&lt;LI&gt;Catalog Type: MANAGED_CATALOG&lt;/LI&gt;&lt;LI&gt;Securable Type: SCHEMA&lt;/LI&gt;&lt;LI&gt;Securable Kind: SCHEMA_STANDARD&lt;/LI&gt;&lt;LI&gt;Browse Only: false&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;Catalog Details:&lt;UL&gt;&lt;LI&gt;Storage root: abfss://{CONTAINER}@{STORAGE_ACCOUNT}.dfs.core.windows.net/{FOLDER_NAME}&lt;BR /&gt;Storage location: abfss://{CONTAINER}@{STORAGE_ACCOUNT}.dfs.core.windows.net/{FOLDER_NAME}/__unitystorage/catalogs/be07xxxxx-xxxxxxxxxxxx&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;PS: The same was working perfectly before using the Unity Catalog, by simply using the hive_metastore.&lt;/P&gt;</description>
    <pubDate>Mon, 03 Jul 2023 15:28:25 GMT</pubDate>
    <dc:creator>Fz1</dc:creator>
    <dc:date>2023-07-03T15:28:25Z</dc:date>
    <item>
      <title>DLT + Unity Catalogue Issue accessing Dataset not defined in the pipeline</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-unity-catalogue-issue-accessing-dataset-not-defined-in-the/m-p/36856#M26203</link>
      <description>&lt;P&gt;I have 2 different schemas [silver and gold] under the same Unity Catalog.&lt;/P&gt;&lt;P&gt;We are trying to incrementally ingest data in both silver and gold layers.&lt;/P&gt;&lt;P&gt;The silver tables were created as streaming DLT tables using&amp;nbsp;dlt.create_streaming_table(....) and the apply_merge(....) which is working perfectly&lt;/P&gt;&lt;P&gt;I have a DLT pipeline reading from the silver streaming DLT table and try to ingest to gold using the same concept:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;dlt.create_streaming_table(GOLD_TABLE)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;then&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;apply_merge({UNITY_CATALOG_NAME}.{SCHEMA_SILVER}.{SOURCE_SILVER_TABLE} &amp;gt;&amp;gt;&amp;gt;&amp;gt; {GOLD_TABLE} . . . )&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and the pipeline is throwing this error:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;Failed to read dataset '{UNITY_CATALOG}.{SCHEMA_SILVER}.{TABLE_NAME}'. Dataset is not defined in the pipeline&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;U&gt;&lt;STRONG&gt;Configuration:&lt;/STRONG&gt;&lt;/U&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;The Catalog was created by specifying an ADL storage for it.&lt;/LI&gt;&lt;LI&gt;Schema was created without specifying any storage for it (because I faced another issue with the DLT pipeline, not allowing to execute pipeline for specified SCHEMA Location)&lt;/LI&gt;&lt;LI&gt;Schema details:&lt;UL&gt;&lt;LI&gt;Catalog Type: MANAGED_CATALOG&lt;/LI&gt;&lt;LI&gt;Securable Type: SCHEMA&lt;/LI&gt;&lt;LI&gt;Securable Kind: SCHEMA_STANDARD&lt;/LI&gt;&lt;LI&gt;Browse Only: false&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;Catalog Details:&lt;UL&gt;&lt;LI&gt;Storage root: abfss://{CONTAINER}@{STORAGE_ACCOUNT}.dfs.core.windows.net/{FOLDER_NAME}&lt;BR /&gt;Storage location: abfss://{CONTAINER}@{STORAGE_ACCOUNT}.dfs.core.windows.net/{FOLDER_NAME}/__unitystorage/catalogs/be07xxxxx-xxxxxxxxxxxx&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;PS: The same was working perfectly before using the Unity Catalog, by simply using the hive_metastore.&lt;/P&gt;</description>
      <pubDate>Mon, 03 Jul 2023 15:28:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-unity-catalogue-issue-accessing-dataset-not-defined-in-the/m-p/36856#M26203</guid>
      <dc:creator>Fz1</dc:creator>
      <dc:date>2023-07-03T15:28:25Z</dc:date>
    </item>
  </channel>
</rss>

