<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Data Engineering - CTAS - External Tables - Limitations of CTAS for external tables - can or cannot use options and location in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16158#M10370</link>
    <description>&lt;P&gt;Glad I could help&lt;/P&gt;</description>
    <pubDate>Mon, 19 Dec 2022 00:11:56 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2022-12-19T00:11:56Z</dc:date>
    <item>
      <title>Data Engineering - CTAS - External Tables - Limitations of CTAS for external tables - can or cannot use options and location</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16155#M10367</link>
      <description>&lt;P&gt;Data Engineering - CTAS - External Tables&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Can someone help me understand why In chapter 3.3, we cannot not directly use CTAS with OPTIONS and LOCATION to specify delimiter and location of CSV?&lt;/LI&gt;&lt;LI&gt;Or I misunderstood?&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Details:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;In &lt;A href="https://customer-academy.databricks.com/learn/course/1266/play/7860/de-33-set-up-delta-tables" alt="https://customer-academy.databricks.com/learn/course/1266/play/7860/de-33-set-up-delta-tables" target="_blank"&gt;Data Engineering with Databricks V3&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;Video DE 2.2 - providing options&lt;UL&gt;&lt;LI&gt;We create external table where we leverage - USING for delimiter and LOCATION&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;PRE&gt;&lt;CODE&gt;CREATE TABLE IF NOT EXISTS sales_csv
  (order_id LONG, email STRING, transactions_timestamp LONG, total_item_quantity INTEGER, purchase_revenue_in_usd DOUBLE, unique_items INTEGER, items STRING)
USING CSV
OPTIONS (
  header = "true",
  delimiter = "|"
)
LOCATION "${DA.paths.sales_csv}"&lt;/CODE&gt;&lt;/PRE&gt;&lt;UL&gt;&lt;LI&gt;Video DE 3.3 - Set up Delta Tables&lt;UL&gt;&lt;LI&gt;It is said that CTAS has limitations to support additional file OPTIONS. The example is to show how we could not define a pipe delimiter to correctly ingest data.&lt;/LI&gt;&lt;LI&gt;But this example opposes what was told in Chapter 2.2.&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;PRE&gt;&lt;CODE&gt;CREATE OR REPLACE TABLE sales_unparsed AS
SELECT * FROM csv.`${da.paths.datasets}/ecommerce/raw/sales-csv`;&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;Please help me understand.&lt;/P&gt;</description>
      <pubDate>Sun, 18 Dec 2022 01:17:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16155#M10367</guid>
      <dc:creator>Kopal</dc:creator>
      <dc:date>2022-12-18T01:17:10Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering - CTAS - External Tables - Limitations of CTAS for external tables - can or cannot use options and location</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16156#M10368</link>
      <description>&lt;P&gt;The 2nd statement CTAS will not be able to parse the csv in any manner because it's just the from statement that points to a file.  It's more of a traditional SQL statement with select and from.   It will create a Delta Table.  This just happens to be the way the syntax works with AS not allowing for options.  &lt;/P&gt;</description>
      <pubDate>Sun, 18 Dec 2022 20:15:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16156#M10368</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-12-18T20:15:00Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering - CTAS - External Tables - Limitations of CTAS for external tables - can or cannot use options and location</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16157#M10369</link>
      <description>&lt;P&gt;Ahh the devil was in the details. The statements differ in &lt;B&gt;&lt;I&gt;USING CSV&lt;/I&gt;&lt;/B&gt; -&amp;gt;&lt;B&gt; external table &lt;/B&gt;and &lt;B&gt;&lt;I&gt;CTAS + SELECT FROM&lt;/I&gt;&lt;/B&gt;-&amp;gt; &lt;B&gt;managed table&lt;/B&gt;. I hope I got it right. &lt;/P&gt;&lt;P&gt;Thanks a lot for explanation. Cheers!&lt;/P&gt;</description>
      <pubDate>Sun, 18 Dec 2022 20:53:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16157#M10369</guid>
      <dc:creator>Kopal</dc:creator>
      <dc:date>2022-12-18T20:53:32Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering - CTAS - External Tables - Limitations of CTAS for external tables - can or cannot use options and location</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16158#M10370</link>
      <description>&lt;P&gt;Glad I could help&lt;/P&gt;</description>
      <pubDate>Mon, 19 Dec 2022 00:11:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-ctas-external-tables-limitations-of-ctas-for/m-p/16158#M10370</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-12-19T00:11:56Z</dc:date>
    </item>
  </channel>
</rss>

