<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic DLT SQL schema definition in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75770#M35056</link>
    <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;While defining a schema in creating a table using Autoloader and DLT using SQL, I am getting schema mismatch error between the defined schema and inferred schema.&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;CREATE&lt;/SPAN&gt; &lt;SPAN&gt;OR&lt;/SPAN&gt;&lt;SPAN&gt; REFRESH STREAMING &lt;/SPAN&gt;&lt;SPAN&gt;TABLE&lt;/SPAN&gt;&lt;SPAN&gt; csv_test&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;(a0 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a1 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a2 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a3 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a4 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a5 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a6 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a7 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a8 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a9 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,rescue_data STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;AS&lt;/SPAN&gt; &lt;SPAN&gt;SELECT&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;*&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;FROM&lt;/SPAN&gt;&lt;SPAN&gt; cloud_files(&lt;/SPAN&gt;&lt;SPAN&gt;"s3://Bucket/test_data/"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"csv"&lt;/SPAN&gt;&lt;SPAN&gt;, map(&lt;/SPAN&gt;&lt;SPAN&gt;"delimiter"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"|"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"header"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"false"&lt;/SPAN&gt;&lt;SPAN&gt;))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;Is there a known limitation or am I missing something here?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Sudheer_DB_0-1719375711422.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/8945iF37EB4CAF8B9DDBB/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="Sudheer_DB_0-1719375711422.png" alt="Sudheer_DB_0-1719375711422.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 26 Jun 2024 04:28:39 GMT</pubDate>
    <dc:creator>Sudheer_DB</dc:creator>
    <dc:date>2024-06-26T04:28:39Z</dc:date>
    <item>
      <title>DLT SQL schema definition</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75770#M35056</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;&lt;P&gt;While defining a schema in creating a table using Autoloader and DLT using SQL, I am getting schema mismatch error between the defined schema and inferred schema.&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;CREATE&lt;/SPAN&gt; &lt;SPAN&gt;OR&lt;/SPAN&gt;&lt;SPAN&gt; REFRESH STREAMING &lt;/SPAN&gt;&lt;SPAN&gt;TABLE&lt;/SPAN&gt;&lt;SPAN&gt; csv_test&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;(a0 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a1 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a2 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a3 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a4 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a5 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a6 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a7 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a8 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,a9 STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;,rescue_data STRING&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;AS&lt;/SPAN&gt; &lt;SPAN&gt;SELECT&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;*&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;FROM&lt;/SPAN&gt;&lt;SPAN&gt; cloud_files(&lt;/SPAN&gt;&lt;SPAN&gt;"s3://Bucket/test_data/"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"csv"&lt;/SPAN&gt;&lt;SPAN&gt;, map(&lt;/SPAN&gt;&lt;SPAN&gt;"delimiter"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"|"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"header"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;"false"&lt;/SPAN&gt;&lt;SPAN&gt;))&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;Is there a known limitation or am I missing something here?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Sudheer_DB_0-1719375711422.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/8945iF37EB4CAF8B9DDBB/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="Sudheer_DB_0-1719375711422.png" alt="Sudheer_DB_0-1719375711422.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 26 Jun 2024 04:28:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75770#M35056</guid>
      <dc:creator>Sudheer_DB</dc:creator>
      <dc:date>2024-06-26T04:28:39Z</dc:date>
    </item>
    <item>
      <title>Re: DLT SQL schema definition</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75773#M35057</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/104792"&gt;@Sudheer_DB&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;In your schema there's a column named&amp;nbsp;&lt;EM&gt;&lt;SPAN&gt;rescue_data&lt;/SPAN&gt;&lt;/EM&gt;, while the default autoloader column name for faulty data is &lt;EM&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;_&lt;/FONT&gt;&lt;/STRONG&gt;rescue&lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;d&lt;/STRONG&gt;&lt;/FONT&gt;_data&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 26 Jun 2024 06:24:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75773#M35057</guid>
      <dc:creator>daniel_sahal</dc:creator>
      <dc:date>2024-06-26T06:24:21Z</dc:date>
    </item>
    <item>
      <title>Re: DLT SQL schema definition</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75887#M35087</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/79106"&gt;@daniel_sahal&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Thank you for your response. The whole idea is to define my own column names. Shouldn't I rename the rescued_data column?&lt;/P&gt;</description>
      <pubDate>Wed, 26 Jun 2024 19:49:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75887#M35087</guid>
      <dc:creator>Sudheer_DB</dc:creator>
      <dc:date>2024-06-26T19:49:08Z</dc:date>
    </item>
    <item>
      <title>Re: DLT SQL schema definition</title>
      <link>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75919#M35098</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/104792"&gt;@Sudheer_DB&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;You can specify your own _rescued_data column name by setting up&amp;nbsp;&lt;EM&gt;rescuedDataColumn&amp;nbsp;&lt;/EM&gt;option.&lt;BR /&gt;&lt;A href="https://docs.databricks.com/en/ingestion/auto-loader/schema.html#what-is-the-rescued-data-column" target="_blank"&gt;https://docs.databricks.com/en/ingestion/auto-loader/schema.html#what-is-the-rescued-data-column&lt;/A&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jun 2024 06:26:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/dlt-sql-schema-definition/m-p/75919#M35098</guid>
      <dc:creator>daniel_sahal</dc:creator>
      <dc:date>2024-06-27T06:26:20Z</dc:date>
    </item>
  </channel>
</rss>

