<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Unmanaged Table  - Newly added data directories are not reflected in the table

We have created an unmanaged table with partitions on the dbfs location, using SQL.

After creating the tables, via SQL we are running in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/unmanaged-table-newly-added-data-directories-are-not-reflected/m-p/22130#M15123</link>
    <description>&lt;P&gt;We have created an unmanaged table with partitions on the dbfs location, using SQL.&lt;/P&gt;&lt;P&gt;example: &lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;%sql
CREATE TABLE EnterpriseDailyTrafficSummarytest(EnterpriseID String,ServiceLocationID String, ReportDate String ) USING parquet PARTITIONED BY(ReportDate) LOCATION "dbfs:/mnt/enterprisecalldatabase/reports/EnterpriseDailyTrafficSummary"&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;After creating the tables, via SQL we are running&amp;nbsp; REPAI to make the partitions registered in the Hive&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;%sql
MSCK REPAIR TABLE default.EnterpriseDailyTrafficSummarytest&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;After that command the queries are displaying the data on the table.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;When we are adding new files, this is not reflected in the table, even if we are running REFRESH TABLE&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So only when we are running MSCK REPAIR TABEL then only the newly added files are reflecting.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 26 Apr 2022 11:20:29 GMT</pubDate>
    <dc:creator>kjoth</dc:creator>
    <dc:date>2022-04-26T11:20:29Z</dc:date>
    <item>
      <title>Unmanaged Table  - Newly added data directories are not reflected in the table

We have created an unmanaged table with partitions on the dbfs location, using SQL.

After creating the tables, via SQL we are running</title>
      <link>https://community.databricks.com/t5/data-engineering/unmanaged-table-newly-added-data-directories-are-not-reflected/m-p/22130#M15123</link>
      <description>&lt;P&gt;We have created an unmanaged table with partitions on the dbfs location, using SQL.&lt;/P&gt;&lt;P&gt;example: &lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;%sql
CREATE TABLE EnterpriseDailyTrafficSummarytest(EnterpriseID String,ServiceLocationID String, ReportDate String ) USING parquet PARTITIONED BY(ReportDate) LOCATION "dbfs:/mnt/enterprisecalldatabase/reports/EnterpriseDailyTrafficSummary"&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;After creating the tables, via SQL we are running&amp;nbsp; REPAI to make the partitions registered in the Hive&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;%sql
MSCK REPAIR TABLE default.EnterpriseDailyTrafficSummarytest&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;After that command the queries are displaying the data on the table.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;When we are adding new files, this is not reflected in the table, even if we are running REFRESH TABLE&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;So only when we are running MSCK REPAIR TABEL then only the newly added files are reflecting.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 26 Apr 2022 11:20:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/unmanaged-table-newly-added-data-directories-are-not-reflected/m-p/22130#M15123</guid>
      <dc:creator>kjoth</dc:creator>
      <dc:date>2022-04-26T11:20:29Z</dc:date>
    </item>
  </channel>
</rss>

