<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: IOStream.flush Timed Out in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/iostream-flush-timed-out/m-p/72813#M34612</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105326"&gt;@dbengineer516&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;From my research it looks to be an IPython cache error. Maybe your python REPL is getting throttled due to too many requests.&lt;/P&gt;
&lt;P&gt;Please check:&amp;nbsp;&lt;A href="https://github.com/ipython/ipykernel/issues/334" target="_blank"&gt;https://github.com/ipython/ipykernel/issues/334&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;This comment seems to be a possible solution:&amp;nbsp;&lt;A href="https://github.com/ipython/ipykernel/issues/334#issuecomment-1357140493" target="_blank"&gt;https://github.com/ipython/ipykernel/issues/334#issuecomment-1357140493&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;I hope it helps &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 12 Jun 2024 15:42:19 GMT</pubDate>
    <dc:creator>raphaelblg</dc:creator>
    <dc:date>2024-06-12T15:42:19Z</dc:date>
    <item>
      <title>IOStream.flush Timed Out</title>
      <link>https://community.databricks.com/t5/data-engineering/iostream-flush-timed-out/m-p/72791#M34605</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I'm encountering an issue with a Python script/notebook that I have developed and used in a daily job ran in Databricks. It has worked perfectly fine for months, but now continues to fail constantly. After digging a little deeper, when running the notebook that the job was connected to, it would throw a "IOStream.flush timed out" warning message but would still continue to run endlessly and not perform the other operations in the script. All my script does is call the Databricks API for Query History to obtain the previous day's data, walk through each page of query history data, create a data frame from the data, and store it into a table. Typically, I'd see this complete in 5-10 minutes even for heavier volume days, but now, it'll run for hours on end and won't fail unless we set timeout limits. I'm assuming it has something to do with memory/resources, but I'm not sure how to resolve it.&lt;/P&gt;&lt;P&gt;Thank you!&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2024 14:06:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/iostream-flush-timed-out/m-p/72791#M34605</guid>
      <dc:creator>dbengineer516</dc:creator>
      <dc:date>2024-06-12T14:06:13Z</dc:date>
    </item>
    <item>
      <title>Re: IOStream.flush Timed Out</title>
      <link>https://community.databricks.com/t5/data-engineering/iostream-flush-timed-out/m-p/72813#M34612</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/105326"&gt;@dbengineer516&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;From my research it looks to be an IPython cache error. Maybe your python REPL is getting throttled due to too many requests.&lt;/P&gt;
&lt;P&gt;Please check:&amp;nbsp;&lt;A href="https://github.com/ipython/ipykernel/issues/334" target="_blank"&gt;https://github.com/ipython/ipykernel/issues/334&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;This comment seems to be a possible solution:&amp;nbsp;&lt;A href="https://github.com/ipython/ipykernel/issues/334#issuecomment-1357140493" target="_blank"&gt;https://github.com/ipython/ipykernel/issues/334#issuecomment-1357140493&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;I hope it helps &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 12 Jun 2024 15:42:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/iostream-flush-timed-out/m-p/72813#M34612</guid>
      <dc:creator>raphaelblg</dc:creator>
      <dc:date>2024-06-12T15:42:19Z</dc:date>
    </item>
  </channel>
</rss>

