<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Cannot reserve additional contiguous bytes in the vectorized reader (requested xxxxxxxxx bytes). in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/cannot-reserve-additional-contiguous-bytes-in-the-vectorized/m-p/13774#M8391</link>
    <description>&lt;P&gt;I got the below error when running a streaming workload from a source Delta table&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;Caused by: java.lang.RuntimeException: Cannot reserve additional contiguous bytes in the vectorized reader (requested xxxxxxxxx bytes). As a workaround, you can reduce the vectorized reader batch size, or disable the vectorized reader, or disable spark.sql.sources.bucketing.enabled if you read from bucket table. For Parquet file format, refer to spark.sql.parquet.columnarReaderBatchSize (default 4096) and spark.sql.parquet.enableVectorizedReader; for ORC file format, refer to spark.sql.orc.columnarReaderBatchSize (default 4096) and spark.sql.orc.enableVectorizedReader&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;could you please let us know how to mitigate the issue?&lt;/P&gt;</description>
    <pubDate>Mon, 11 Oct 2021 16:46:16 GMT</pubDate>
    <dc:creator>shan_chandra</dc:creator>
    <dc:date>2021-10-11T16:46:16Z</dc:date>
    <item>
      <title>Cannot reserve additional contiguous bytes in the vectorized reader (requested xxxxxxxxx bytes).</title>
      <link>https://community.databricks.com/t5/data-engineering/cannot-reserve-additional-contiguous-bytes-in-the-vectorized/m-p/13774#M8391</link>
      <description>&lt;P&gt;I got the below error when running a streaming workload from a source Delta table&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;Caused by: java.lang.RuntimeException: Cannot reserve additional contiguous bytes in the vectorized reader (requested xxxxxxxxx bytes). As a workaround, you can reduce the vectorized reader batch size, or disable the vectorized reader, or disable spark.sql.sources.bucketing.enabled if you read from bucket table. For Parquet file format, refer to spark.sql.parquet.columnarReaderBatchSize (default 4096) and spark.sql.parquet.enableVectorizedReader; for ORC file format, refer to spark.sql.orc.columnarReaderBatchSize (default 4096) and spark.sql.orc.enableVectorizedReader&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;could you please let us know how to mitigate the issue?&lt;/P&gt;</description>
      <pubDate>Mon, 11 Oct 2021 16:46:16 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cannot-reserve-additional-contiguous-bytes-in-the-vectorized/m-p/13774#M8391</guid>
      <dc:creator>shan_chandra</dc:creator>
      <dc:date>2021-10-11T16:46:16Z</dc:date>
    </item>
    <item>
      <title>Re: Cannot reserve additional contiguous bytes in the vectorized reader (requested xxxxxxxxx bytes).</title>
      <link>https://community.databricks.com/t5/data-engineering/cannot-reserve-additional-contiguous-bytes-in-the-vectorized/m-p/13775#M8392</link>
      <description>&lt;P&gt;This is happening because the delta/parquet source has one or more of the following:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;a huge number of columns&lt;/LI&gt;&lt;LI&gt;huge strings in one or more columns&lt;/LI&gt;&lt;LI&gt;huge arrays/map, possibly nested in each other&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In order to mitigate this issue, could you please reduce &lt;B&gt;spark.sql.parquet.columnarReaderBatchSize&lt;/B&gt;&amp;nbsp;from default value - 4096 ?&lt;/P&gt;</description>
      <pubDate>Mon, 11 Oct 2021 16:49:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/cannot-reserve-additional-contiguous-bytes-in-the-vectorized/m-p/13775#M8392</guid>
      <dc:creator>shan_chandra</dc:creator>
      <dc:date>2021-10-11T16:49:57Z</dc:date>
    </item>
  </channel>
</rss>

