<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Unable to start Cluster in Databricks because of `BOOTSTRAP_TIMEOUT` in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/unable-to-start-cluster-in-databricks-because-of-bootstrap/m-p/38611#M26684</link>
    <description>&lt;P&gt;Unable to start the Cluster in AWS-hosted Databricks because of the below reason&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;{
  "reason": {
    "code": "BOOTSTRAP_TIMEOUT",
    "parameters": {
      "databricks_error_message": "[id: InstanceId(i-0634ee9c2d420edc8), status: INSTANCE_INITIALIZING, workerEnvId:WorkerEnvId(workerenv-1653812411865398-41432fd9-c426-43ee-84fd-f161341ac1db), lastStatusChangeTime: 1690489336310, groupIdOpt Some(0),requestIdOpt Some(0717-085009-xgjqxcqe-70e3f1a0-0524-43cc-b),version 1] with threshold 700 seconds timed out after 701058 milliseconds. Please check network connectivity from the data plane to the control plane.",
      "instance_id": "i-0634ee9c2d420edc8"
    }
  },
  "add_node_failure_details": {
    "failure_count": 2,
    "resource_type": "container",
    "will_retry": false
  }
}&amp;nbsp;&amp;nbsp;&lt;/PRE&gt;&lt;P&gt;It all started when we arranged a Peering connection between Databricks default VPC and out VPC. Rolled back all the changes, but the problem still persists. In AWS, I can see the EC2 instances are initialized and running but something wrong other than that.&lt;/P&gt;&lt;P&gt;&amp;nbsp;Any help would be greatly appreciated.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 27 Jul 2023 20:57:28 GMT</pubDate>
    <dc:creator>Sahha_Krishna</dc:creator>
    <dc:date>2023-07-27T20:57:28Z</dc:date>
    <item>
      <title>Unable to start Cluster in Databricks because of `BOOTSTRAP_TIMEOUT`</title>
      <link>https://community.databricks.com/t5/data-engineering/unable-to-start-cluster-in-databricks-because-of-bootstrap/m-p/38611#M26684</link>
      <description>&lt;P&gt;Unable to start the Cluster in AWS-hosted Databricks because of the below reason&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;PRE&gt;{
  "reason": {
    "code": "BOOTSTRAP_TIMEOUT",
    "parameters": {
      "databricks_error_message": "[id: InstanceId(i-0634ee9c2d420edc8), status: INSTANCE_INITIALIZING, workerEnvId:WorkerEnvId(workerenv-1653812411865398-41432fd9-c426-43ee-84fd-f161341ac1db), lastStatusChangeTime: 1690489336310, groupIdOpt Some(0),requestIdOpt Some(0717-085009-xgjqxcqe-70e3f1a0-0524-43cc-b),version 1] with threshold 700 seconds timed out after 701058 milliseconds. Please check network connectivity from the data plane to the control plane.",
      "instance_id": "i-0634ee9c2d420edc8"
    }
  },
  "add_node_failure_details": {
    "failure_count": 2,
    "resource_type": "container",
    "will_retry": false
  }
}&amp;nbsp;&amp;nbsp;&lt;/PRE&gt;&lt;P&gt;It all started when we arranged a Peering connection between Databricks default VPC and out VPC. Rolled back all the changes, but the problem still persists. In AWS, I can see the EC2 instances are initialized and running but something wrong other than that.&lt;/P&gt;&lt;P&gt;&amp;nbsp;Any help would be greatly appreciated.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 27 Jul 2023 20:57:28 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/unable-to-start-cluster-in-databricks-because-of-bootstrap/m-p/38611#M26684</guid>
      <dc:creator>Sahha_Krishna</dc:creator>
      <dc:date>2023-07-27T20:57:28Z</dc:date>
    </item>
    <item>
      <title>Re: Unable to start Cluster in Databricks because of `BOOTSTRAP_TIMEOUT`</title>
      <link>https://community.databricks.com/t5/data-engineering/unable-to-start-cluster-in-databricks-because-of-bootstrap/m-p/49934#M28659</link>
      <description>&lt;P&gt;Hi, Sahha:&lt;/P&gt;
&lt;P&gt;Thanks for contacting Databricks Support.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;This is the common type of error, which indicates that the bootstrap failed due to a misconfigured data plane network. Databricks requested EC2 instances for a new cluster, but encountered a long delay while waiting for the EC2 instance to bootstrap, and connect to the control plane. The cluster manager terminates the instances, and reports this error.&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Please go to AWS console and download the EC2 system log by following the instructions:&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;
&lt;OL&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;SPAN&gt;Open the Amazon EC2 console&amp;nbsp;&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;SPAN&gt;In the left navigation pane, choose&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;Instances&lt;/STRONG&gt;&lt;SPAN&gt;, and select the instance using the instance ID. &lt;/SPAN&gt;&lt;SPAN&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;The instance ID, which starts with &lt;/SPAN&gt;&lt;SPAN&gt;i-xxxxxx&lt;/SPAN&gt;&lt;SPAN&gt;, will be printed in the &lt;/SPAN&gt;&lt;STRONG&gt;Event Log&lt;/STRONG&gt;&lt;SPAN&gt; section of the cluster details page. Note that the instance must be terminated within the last hour; otherwise, it will not show up in the list. If the cluster creation failure happened a long time ago, restart the cluster to reproduce the error first.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;SPAN&gt;Choose&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;Actions&lt;/STRONG&gt;&lt;SPAN&gt; &amp;gt; &lt;/SPAN&gt;&lt;STRONG&gt;Monitor and troubleshoot&lt;/STRONG&gt;&lt;SPAN&gt; &amp;gt; &lt;/SPAN&gt;&lt;STRONG&gt;Get System Log&lt;/STRONG&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;LI style="font-weight: 400;" aria-level="2"&gt;&lt;SPAN&gt;Click the &lt;/SPAN&gt;&lt;STRONG&gt;Download&lt;/STRONG&gt;&lt;SPAN&gt; button to download the system log. It may take a few minutes for the system log to show up if the cluster is just started.&lt;/SPAN&gt;&lt;/LI&gt;
&lt;/OL&gt;
&lt;P&gt;&lt;SPAN&gt;Check&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;the &lt;/SPAN&gt;&lt;SPAN&gt;system log, look for&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;messages starting with the prefix:&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;[&lt;/SPAN&gt;&lt;I&gt;&lt;SPAN&gt;timestamp&lt;/SPAN&gt;&lt;/I&gt;&lt;SPAN&gt;, Bootstrap Event].&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;S&lt;/SPAN&gt;&lt;SPAN&gt;earch for &lt;/SPAN&gt;&lt;SPAN&gt;FAILED_MESSAGE&lt;/SPAN&gt;&lt;SPAN&gt;, and &lt;/SPAN&gt;&lt;SPAN&gt;use a &lt;/SPAN&gt;&lt;A href="https://www.base64decode.org/" target="_blank"&gt;&lt;SPAN&gt;B&lt;/SPAN&gt;&lt;SPAN&gt;ase64 decode tool&lt;/SPAN&gt;&lt;/A&gt;&lt;SPAN&gt; to decode the message. The message should give the reason why bootstrap failed.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;Regards,&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 26 Oct 2023 17:48:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/unable-to-start-cluster-in-databricks-because-of-bootstrap/m-p/49934#M28659</guid>
      <dc:creator>User16539034020</dc:creator>
      <dc:date>2023-10-26T17:48:23Z</dc:date>
    </item>
  </channel>
</rss>

