I have a large Docker image in our AWS ECR repo. The image is 27.4 GB locally and 11539.79 MB compressed in ECR.
The error from the Event Log is:
Failed to add 2 containers to the compute. Will attempt retry: true. Reason: Docker image pull failure
JSON:
{
"reason": {
"code": "DOCKER_IMAGE_PULL_FAILURE",
"type": "SERVICE_FAULT",
"parameters": {
"instance_id": "i-0172cf9b70a25df47",
"databricks_error_message": "Downloading docker image has timed out"
}
},
"add_node_failure_details": {
"failure_count": 2,
"resource_type": "container",
"will_retry": true
}
}