<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: preloaded_docker_images: how do they work? in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105494#M9497</link>
    <description>&lt;P&gt;Sure, I will inform the team in charge of it to review it.&lt;/P&gt;</description>
    <pubDate>Mon, 13 Jan 2025 20:48:20 GMT</pubDate>
    <dc:creator>Alberto_Umana</dc:creator>
    <dc:date>2025-01-13T20:48:20Z</dc:date>
    <item>
      <title>preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105449#M9491</link>
      <description>&lt;P&gt;At my org, when we start a databricks cluster, it oftens takes awhile to become available (due to (1) instance provisioning, (2) library loading, and (3) init script execution). I'm exploring whether an instance pool could be a viable strategy for improving cluster execution time.&lt;/P&gt;&lt;P&gt;I see there's a feature called "preloaded_docker_images" (&lt;A href="https://docs.databricks.com/api/workspace/instancepools/get#preloaded_docker_images" target="_blank"&gt;https://docs.databricks.com/api/workspace/instancepools/get#preloaded_docker_images&lt;/A&gt;), but the docs are limited. Is there canonincal documentation the explains:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;When are the docker images loaded in the lifecycle of the instance?&lt;/LI&gt;&lt;LI&gt;Given that you can supply N images, how does container isolation work? I only need to load one container, but it's unclear to me if my Spark job will run "inside" that container? (Given that I'm using the container to preload jars, I'm pretty sure the Spark job needs to be able to see those jars!)&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Mon, 13 Jan 2025 16:44:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105449#M9491</guid>
      <dc:creator>mrstevegross</dc:creator>
      <dc:date>2025-01-13T16:44:15Z</dc:date>
    </item>
    <item>
      <title>Re: preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105453#M9492</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/142679"&gt;@mrstevegross&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;About your cluster startup time, how long does it take to come up?&lt;/P&gt;
&lt;UL class="ul1"&gt;
&lt;UL class="ul1"&gt;
&lt;LI class="li1"&gt;&lt;STRONG&gt;1 - &lt;/STRONG&gt;Docker images specified in the preloaded_docker_images field are loaded when the instance pool is created or when instances are added to the pool. This means that the images are pulled and cached on the instances before they are used for running jobs.&lt;/LI&gt;
&lt;/UL&gt;
&lt;/UL&gt;
&lt;P class="p1"&gt;When you specify a Docker image for your Databricks cluster, the entire cluster runs within that Docker container. This means that all Spark jobs executed on the cluster will run inside the specified Docker container.&lt;/P&gt;
&lt;UL class="ul1"&gt;
&lt;UL class="ul1"&gt;
&lt;LI class="li1"&gt;&lt;STRONG&gt;2. &lt;/STRONG&gt;Since your Spark job runs inside the Docker container, it will have access to any jars or libraries that are preloaded within that container. This ensures that your Spark job can see and use the preloaded jars as expected.&lt;/LI&gt;
&lt;/UL&gt;
&lt;/UL&gt;
&lt;P&gt;Please be aware of some limitations:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/compute/custom-containers.html" target="_blank"&gt;https://docs.databricks.com/en/compute/custom-containers.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jan 2025 16:59:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105453#M9492</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-01-13T16:59:09Z</dc:date>
    </item>
    <item>
      <title>Re: preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105455#M9493</link>
      <description>&lt;P&gt;&amp;gt;&lt;SPAN&gt;When you specify a Docker image for your Databricks cluster, the entire cluster runs within that Docker container.&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Just to clarify: are you saying that the Databricks job request itself says&amp;nbsp;&lt;EM&gt;which&lt;/EM&gt;&lt;EM&gt; container to use&lt;/EM&gt;?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;gt;Please be aware of some limitations:&amp;nbsp;&lt;A href="https://docs.databricks.com/en/compute/custom-containers.html" target="_blank" rel="nofollow noopener noreferrer"&gt;https://docs.databricks.com/en/compute/custom-containers.html&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Roger that, reading docs now.&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jan 2025 17:02:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105455#M9493</guid>
      <dc:creator>mrstevegross</dc:creator>
      <dc:date>2025-01-13T17:02:20Z</dc:date>
    </item>
    <item>
      <title>Re: preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105461#M9494</link>
      <description>&lt;P&gt;&amp;gt;&lt;SPAN&gt;Just to clarify: are you saying that the Databricks job request itself says&amp;nbsp;&lt;/SPAN&gt;&lt;EM&gt;which&lt;/EM&gt;&lt;EM&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;container to use&lt;/EM&gt;&lt;SPAN&gt;?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;I see here (&lt;A href="https://docs.databricks.com/api/workspace/clusters/create#docker_image" target="_blank"&gt;https://docs.databricks.com/api/workspace/clusters/create#docker_image&lt;/A&gt;) that the create-cluster request can include an image-to-load. How does that interact with the instance pool's "preloaded_docker_images" feature?&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jan 2025 17:25:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105461#M9494</guid>
      <dc:creator>mrstevegross</dc:creator>
      <dc:date>2025-01-13T17:25:18Z</dc:date>
    </item>
    <item>
      <title>Re: preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105479#M9495</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/142679"&gt;@mrstevegross&lt;/a&gt;, not exactly it should come with the API request.&lt;/P&gt;
&lt;P class="p1"&gt;When you create a cluster using an instance pool with preloaded Docker images, the cluster can use one of the preloaded images if it matches the docker_image specified in the create-cluster request. If the specified docker_image is not preloaded in the instance pool, the cluster will load the specified image, which may take additional time.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jan 2025 19:39:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105479#M9495</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-01-13T19:39:43Z</dc:date>
    </item>
    <item>
      <title>Re: preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105485#M9496</link>
      <description>&lt;P&gt;&amp;gt;&lt;SPAN&gt;&amp;nbsp;if it matches the docker_image specified in the create-cluster request.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Aha, good to know. Can y'all update the reference docs to clarify these semantics?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jan 2025 19:48:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105485#M9496</guid>
      <dc:creator>mrstevegross</dc:creator>
      <dc:date>2025-01-13T19:48:14Z</dc:date>
    </item>
    <item>
      <title>Re: preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105494#M9497</link>
      <description>&lt;P&gt;Sure, I will inform the team in charge of it to review it.&lt;/P&gt;</description>
      <pubDate>Mon, 13 Jan 2025 20:48:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/105494#M9497</guid>
      <dc:creator>Alberto_Umana</dc:creator>
      <dc:date>2025-01-13T20:48:20Z</dc:date>
    </item>
    <item>
      <title>Re: preloaded_docker_images: how do they work?</title>
      <link>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/116963#M9897</link>
      <description>&lt;P&gt;Hello, when we specify docker image with credentials in instance pool configuration, should we also specify credentials in cluster configuration?. as we already have image pulled into the pool instance.&lt;/P&gt;</description>
      <pubDate>Tue, 29 Apr 2025 12:15:15 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/preloaded-docker-images-how-do-they-work/m-p/116963#M9897</guid>
      <dc:creator>naha3456</dc:creator>
      <dc:date>2025-04-29T12:15:15Z</dc:date>
    </item>
  </channel>
</rss>

