<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: COPY INTO size limit in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/copy-into-size-limit/m-p/98377#M39710</link>
    <description>&lt;P&gt;Thanks for replying, i'm running the command on sql warehouse.&lt;/P&gt;</description>
    <pubDate>Mon, 11 Nov 2024 22:12:29 GMT</pubDate>
    <dc:creator>DBUser2</dc:creator>
    <dc:date>2024-11-11T22:12:29Z</dc:date>
    <item>
      <title>COPY INTO size limit</title>
      <link>https://community.databricks.com/t5/data-engineering/copy-into-size-limit/m-p/98368#M39707</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;I'm using the COPY INTO command to ingest data into a delta table in my Azure Databricks instance. Sometime I get a timeout error running this command. Is there a limit on the size of the data that can be ingested using "COPY INTO" or limit on the number of files that can be ingested at a time?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 11 Nov 2024 19:21:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/copy-into-size-limit/m-p/98368#M39707</guid>
      <dc:creator>DBUser2</dc:creator>
      <dc:date>2024-11-11T19:21:57Z</dc:date>
    </item>
    <item>
      <title>Re: COPY INTO size limit</title>
      <link>https://community.databricks.com/t5/data-engineering/copy-into-size-limit/m-p/98371#M39708</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/88514"&gt;@DBUser2&lt;/a&gt;,&lt;/P&gt;
&lt;P class="p1"&gt;The COPY INTO command does not have a specific documented limit on the size of the data or the number of files that can be ingested at a time. Timeout errors can occur due to network issues, resource limitations, or long-running operations. Are you running the commands on a warehouse or over notebook?&lt;/P&gt;
&lt;P class="p1"&gt;Looking at cluster/warehouse metrics would be a good way to start investigating.&lt;/P&gt;
&lt;P class="p1"&gt;Also, do you have the statementID is executed via warehouse?&lt;/P&gt;</description>
      <pubDate>Mon, 11 Nov 2024 20:16:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/copy-into-size-limit/m-p/98371#M39708</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2024-11-11T20:16:21Z</dc:date>
    </item>
    <item>
      <title>Re: COPY INTO size limit</title>
      <link>https://community.databricks.com/t5/data-engineering/copy-into-size-limit/m-p/98377#M39710</link>
      <description>&lt;P&gt;Thanks for replying, i'm running the command on sql warehouse.&lt;/P&gt;</description>
      <pubDate>Mon, 11 Nov 2024 22:12:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/copy-into-size-limit/m-p/98377#M39710</guid>
      <dc:creator>DBUser2</dc:creator>
      <dc:date>2024-11-11T22:12:29Z</dc:date>
    </item>
  </channel>
</rss>

