<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: is command stuck? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29578#M21301</link>
    <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Today at some point I created new cluster again.&lt;/P&gt;
&lt;P&gt;Suddenly everything got much faster. It is back to 270 - 330 sec.&lt;/P&gt;
&lt;P&gt;My question still stands - &lt;I&gt;&lt;B&gt;how do I know what is server doing/why is it slow/stuck&lt;/B&gt;&lt;/I&gt;?&lt;/P&gt;
&lt;P&gt;btw, how long does it take to moderate question?&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 05 Aug 2016 20:16:30 GMT</pubDate>
    <dc:creator>DejanSunderic</dc:creator>
    <dc:date>2016-08-05T20:16:30Z</dc:date>
    <item>
      <title>is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29574#M21297</link>
      <description>&lt;P&gt;&lt;/P&gt;&lt;P&gt;I created some ETL using DataFrames in python. It used to run 180 sec. But it is not taking ~ 1200 sec. I have been changing it, so it could be something that I introduced, or something in the environment.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Part of the process is appending results into a file on S3.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I a looking at Apache Jobs and I cannot see that any of them is active.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;While I was writing this, I got: org.apache.spark.SparkException: Job aborted. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Command took 1274.63s -- by xxxxxxxx@gmail.com&lt;/P&gt;&lt;P&gt; at 8/4/2016, 12:44:17 PM on def4 (150 GB) &lt;/P&gt;&lt;P&gt;I have attached output that I got:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;A href="https://forums.databricks.com/storage/attachments/312-command-output.txt" alt="https://forums.databricks.com/storage/attachments/312-command-output.txt" target="_blank"&gt;command-output.txt&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I assume that I should be able to see in Spark UI what is active. I was surprised that Active Tasks on all executors was 0. Should I look at something else? &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I tried to restart the cluster, but it was the same before and after. I used the same version of Spark 1.6.2 (Hadoop 2).&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 17:49:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29574#M21297</guid>
      <dc:creator>DejanSunderic</dc:creator>
      <dc:date>2016-08-04T17:49:04Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29575#M21298</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;While I waiting for some response (I had lunch and then) I decided to do something else on this notebook, so I cloned it...&lt;/P&gt;
&lt;P&gt;I have some initialization code in the notebook. It was taking 60 sec before and after cloning 1.4 sec. Wow!&lt;/P&gt;
&lt;P&gt;Did you (Databricks support) do something on the cluster?&lt;/P&gt;
&lt;P&gt;I am going to run my etl command. &lt;/P&gt;
&lt;P&gt;It was running very fast and then it got "stuck" again. I do not see any Spark job running.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 18:43:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29575#M21298</guid>
      <dc:creator>DejanSunderic</dc:creator>
      <dc:date>2016-08-04T18:43:33Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29576#M21299</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;In the meanwhile I got an idea to look into driver log. I've found this:&lt;/P&gt;
&lt;P&gt; &lt;B&gt;2016-08-&lt;/B&gt;&lt;B&gt;04T19:19:57.980+0000: [GC (Allocation Failure) [PSYoungGen: 6827008K-&amp;gt;52511K(7299584K)] 7660819K-&amp;gt;886330K(22848000K), 0.0142959 secs] [Times: user=0.08 sys=0.01, real=0.01 secs] &lt;/B&gt;&lt;/P&gt;
&lt;P&gt;&lt;B&gt;...&lt;/B&gt;&lt;/P&gt;
&lt;P&gt;&lt;B&gt;04T19:27:03.294+0000: [GC (Allocation Failure) [PSYoungGen: 7270001K-&amp;gt;134234K(7454208K)] 8103861K-&amp;gt;968093K(23002624K), 0.0509207 secs] [Times: user=0.33 sys=0.00, real=0.05 secs] &lt;/B&gt;&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 19:35:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29576#M21299</guid>
      <dc:creator>DejanSunderic</dc:creator>
      <dc:date>2016-08-04T19:35:29Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29577#M21300</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;the process finally finished after 3600 sec (3x slower then long duration that i was complaining about).&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 04 Aug 2016 20:02:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29577#M21300</guid>
      <dc:creator>DejanSunderic</dc:creator>
      <dc:date>2016-08-04T20:02:23Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29578#M21301</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Today at some point I created new cluster again.&lt;/P&gt;
&lt;P&gt;Suddenly everything got much faster. It is back to 270 - 330 sec.&lt;/P&gt;
&lt;P&gt;My question still stands - &lt;I&gt;&lt;B&gt;how do I know what is server doing/why is it slow/stuck&lt;/B&gt;&lt;/I&gt;?&lt;/P&gt;
&lt;P&gt;btw, how long does it take to moderate question?&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Aug 2016 20:16:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29578#M21301</guid>
      <dc:creator>DejanSunderic</dc:creator>
      <dc:date>2016-08-05T20:16:30Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29579#M21302</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Was this issue resolved? I'm also getting the same problem on my spark cluster.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 05 Mar 2019 06:36:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29579#M21302</guid>
      <dc:creator>amanpreetkaur</dc:creator>
      <dc:date>2019-03-05T06:36:06Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29580#M21303</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I have a similar issue. Several times per week I experience very slow (5 minutes +) of "running command" on a cell that should take sub 1 second to execute. It usually solves the problem to restart the cluster, but still a major inconvenience. &lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Oct 2019 16:38:42 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29580#M21303</guid>
      <dc:creator>NickStudenski</dc:creator>
      <dc:date>2019-10-23T16:38:42Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29581#M21304</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Check for GC (garbage collection) errors in standard out for the cluster. &lt;/P&gt;
&lt;P&gt;&lt;A href="https://databricks.com/blog/2015/05/28/tuning-java-garbage-collection-for-spark-applications.html" target="test_blank"&gt;https://databricks.com/blog/2015/05/28/tuning-java-garbage-collection-for-spark-applications.html&lt;/A&gt;&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Dec 2019 17:13:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29581#M21304</guid>
      <dc:creator>datadro</dc:creator>
      <dc:date>2019-12-06T17:13:24Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29582#M21305</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;I am getting this same issue. Occasionally a cell will display "Running Command" for as long as an hour. This can happen even for simple commands that ordinarily run in less than a second. I have tried restarting the cluster, attaching to a different cluster. Nothing seems to help.&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jan 2020 18:04:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29582#M21305</guid>
      <dc:creator>NickStudenski</dc:creator>
      <dc:date>2020-01-14T18:04:22Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29583#M21306</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;Facing same issue. Does anyone found the solution?&lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 20 Apr 2020 01:19:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29583#M21306</guid>
      <dc:creator>sandeep8530</dc:creator>
      <dc:date>2020-04-20T01:19:09Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29584#M21307</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Mm, probably yes &lt;/P&gt; 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 19 May 2020 16:21:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29584#M21307</guid>
      <dc:creator>Risingi</dc:creator>
      <dc:date>2020-05-19T16:21:43Z</dc:date>
    </item>
    <item>
      <title>Re: is command stuck?</title>
      <link>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29585#M21308</link>
      <description>&lt;P&gt;I am having a problem very similar. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Since yesterday, without a known reason, some commands that used to run daily are now stuck in a "Running command" state. Commands like: &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;dataframe.show(n=1) &lt;/P&gt;&lt;P&gt;dataframe.toPandas() &lt;/P&gt;&lt;P&gt;dataframe.description() &lt;/P&gt;&lt;P&gt;dataframe.write.format("csv").save(location) &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;are now stuck also for quite small dataframes with 28 rows and 5 columns, for example. I would appreciate any help since the problem is also in important daily jobs. &lt;/P&gt;</description>
      <pubDate>Thu, 28 Apr 2022 16:09:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/is-command-stuck/m-p/29585#M21308</guid>
      <dc:creator>Carneiro</dc:creator>
      <dc:date>2022-04-28T16:09:29Z</dc:date>
    </item>
  </channel>
</rss>

