<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: ConnectException: Connection refused (Connection refused) This is often caused by an OOM error in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12543#M7343</link>
    <description>&lt;P&gt;hi @RN mj​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Could you provide more details? how do you read your JSON file? are you using an autoscaling cluster? what is the full error stack-trace?&lt;/P&gt;</description>
    <pubDate>Tue, 26 Oct 2021 20:33:36 GMT</pubDate>
    <dc:creator>jose_gonzalez</dc:creator>
    <dc:date>2021-10-26T20:33:36Z</dc:date>
    <item>
      <title>ConnectException: Connection refused (Connection refused) This is often caused by an OOM error</title>
      <link>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12540#M7340</link>
      <description>&lt;P&gt;I am trying to run a python code where a json file is flattened to pipe separated file . The code works with smaller files but for huge files of 2.4 GB I get below error:&lt;/P&gt;&lt;P&gt;ConnectException: Connection refused (Connection refused)&lt;/P&gt;&lt;P&gt;Error while obtaining a new communication channel&lt;/P&gt;&lt;P&gt;ConnectException error: This is often caused by an OOM error that causes the connection to the Python REPL to be closed. Check your query's memory usage.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Databricks version 9.1 LTS&lt;/P&gt;&lt;P&gt;The  cluster is 5 node Standard_DS4_V2&lt;/P&gt;</description>
      <pubDate>Mon, 25 Oct 2021 12:25:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12540#M7340</guid>
      <dc:creator>Rnmj</dc:creator>
      <dc:date>2021-10-25T12:25:36Z</dc:date>
    </item>
    <item>
      <title>Re: ConnectException: Connection refused (Connection refused) This is often caused by an OOM error</title>
      <link>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12542#M7342</link>
      <description>&lt;P&gt;Can you check this topic?&lt;/P&gt;&lt;P&gt;It might be what you are looking for:&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.databricks.com/s/question/0D53f00001Q0Rq9CAF/bufferholder-exceeded-on-json-flattening" alt="https://community.databricks.com/s/question/0D53f00001Q0Rq9CAF/bufferholder-exceeded-on-json-flattening" target="_blank"&gt;https://community.databricks.com/s/question/0D53f00001Q0Rq9CAF/bufferholder-exceeded-on-json-flattening&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 25 Oct 2021 13:40:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12542#M7342</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2021-10-25T13:40:55Z</dc:date>
    </item>
    <item>
      <title>Re: ConnectException: Connection refused (Connection refused) This is often caused by an OOM error</title>
      <link>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12543#M7343</link>
      <description>&lt;P&gt;hi @RN mj​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Could you provide more details? how do you read your JSON file? are you using an autoscaling cluster? what is the full error stack-trace?&lt;/P&gt;</description>
      <pubDate>Tue, 26 Oct 2021 20:33:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12543#M7343</guid>
      <dc:creator>jose_gonzalez</dc:creator>
      <dc:date>2021-10-26T20:33:36Z</dc:date>
    </item>
    <item>
      <title>Re: ConnectException: Connection refused (Connection refused) This is often caused by an OOM error</title>
      <link>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12544#M7344</link>
      <description>&lt;P&gt;Hi @Jose Gonzalez​&amp;nbsp;, @Werner Stinckens​&amp;nbsp; @Kaniz Fatma​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Thanks for your response .Appreciate a lot. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The issue was in the code, it was a python /panda code running on Spark. Due to this only driver node was being used. i did validate this by increasing the driver configuration. The next steps is to revisit the code and use RDD/dataframes so code has some parallel processing &lt;/P&gt;</description>
      <pubDate>Fri, 29 Oct 2021 03:58:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/connectexception-connection-refused-connection-refused-this-is/m-p/12544#M7344</guid>
      <dc:creator>Rnmj</dc:creator>
      <dc:date>2021-10-29T03:58:14Z</dc:date>
    </item>
  </channel>
</rss>

