<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Creating an external table reference vs creating a view in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/creating-an-external-table-reference-vs-creating-a-view/m-p/9827#M5126</link>
    <description>&lt;P&gt;@Nicholas Mead​&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;External tables are metadata definitions that map to files in a data lake, and the files are read when queries are executed against the table. This means that external tables can benefit from partitioning, indexing, and caching to improve query performance. External tables can also be used in joins, and they can be queried using SQL and Spark APIs.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Views, on the other hand, are virtual tables that are defined by a SQL query. When a view is queried, the SQL query is executed against the data lake files. Views can be useful for simplifying complex queries or hiding details about the underlying data. However, they do not support partitioning, indexing, or caching, and they cannot be used in joins.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In terms of performance, external tables are generally faster than views because they can take advantage of partitioning and caching.Overall, creating an external table is usually the better option if you want to optimize query performance and take advantage of advanced features like partitioning and caching. However, if you just need a simple view to simplify a complex query, then creating a view can be a good option&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Wed, 08 Mar 2023 07:02:44 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2023-03-08T07:02:44Z</dc:date>
    <item>
      <title>Creating an external table reference vs creating a view</title>
      <link>https://community.databricks.com/t5/data-engineering/creating-an-external-table-reference-vs-creating-a-view/m-p/9826#M5125</link>
      <description>&lt;P&gt;In a practical sense, what is the difference between creating an external table;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;create table my_catalog.my_schema.my_favourite_table &lt;/P&gt;&lt;P&gt;location 'abfss://path/to/my/data&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;versus creating a view that references the same dataset;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;create view my_catalog.my_schema.my_favourite_table as&lt;/P&gt;&lt;P&gt;select *&lt;/P&gt;&lt;P&gt;from parquet.'abfss://path/to/my/data&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Neither option is materializing the data itself.  Both are just pointers to a location in the datalake.  Are there differences under the hood from databricks perspective that might impact on performance?&lt;/P&gt;</description>
      <pubDate>Tue, 07 Feb 2023 23:00:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/creating-an-external-table-reference-vs-creating-a-view/m-p/9826#M5125</guid>
      <dc:creator>najmead</dc:creator>
      <dc:date>2023-02-07T23:00:27Z</dc:date>
    </item>
    <item>
      <title>Re: Creating an external table reference vs creating a view</title>
      <link>https://community.databricks.com/t5/data-engineering/creating-an-external-table-reference-vs-creating-a-view/m-p/9827#M5126</link>
      <description>&lt;P&gt;@Nicholas Mead​&amp;nbsp;:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;External tables are metadata definitions that map to files in a data lake, and the files are read when queries are executed against the table. This means that external tables can benefit from partitioning, indexing, and caching to improve query performance. External tables can also be used in joins, and they can be queried using SQL and Spark APIs.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Views, on the other hand, are virtual tables that are defined by a SQL query. When a view is queried, the SQL query is executed against the data lake files. Views can be useful for simplifying complex queries or hiding details about the underlying data. However, they do not support partitioning, indexing, or caching, and they cannot be used in joins.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In terms of performance, external tables are generally faster than views because they can take advantage of partitioning and caching.Overall, creating an external table is usually the better option if you want to optimize query performance and take advantage of advanced features like partitioning and caching. However, if you just need a simple view to simplify a complex query, then creating a view can be a good option&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 08 Mar 2023 07:02:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/creating-an-external-table-reference-vs-creating-a-view/m-p/9827#M5126</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-03-08T07:02:44Z</dc:date>
    </item>
    <item>
      <title>Re: Creating an external table reference vs creating a view</title>
      <link>https://community.databricks.com/t5/data-engineering/creating-an-external-table-reference-vs-creating-a-view/m-p/9828#M5127</link>
      <description>&lt;P&gt;Hi @Nicholas Mead​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Please help us select the best solution by clicking on "Select As Best" if it does.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Your feedback will help us ensure that we are providing the best possible service to you. Thank you!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 09 Apr 2023 06:59:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/creating-an-external-table-reference-vs-creating-a-view/m-p/9828#M5127</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2023-04-09T06:59:05Z</dc:date>
    </item>
  </channel>
</rss>

