<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Managing libraries in workflows with multiple tasks - need to configure a list of libs for all t in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78437#M35526</link>
    <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/102604"&gt;@brian999&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It should be doable.&lt;/P&gt;&lt;P&gt;In requirements.txt you can specify the path to the .whl file&lt;/P&gt;</description>
    <pubDate>Fri, 12 Jul 2024 05:29:52 GMT</pubDate>
    <dc:creator>daniel_sahal</dc:creator>
    <dc:date>2024-07-12T05:29:52Z</dc:date>
    <item>
      <title>Managing libraries in workflows with multiple tasks - need to configure a list of libs for all tasks</title>
      <link>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78155#M35479</link>
      <description>&lt;P&gt;I have workflows with multiple tasks, each of which need 5 different libraries to run. When I have to update those libraries, I have to go in and make the update in each and every task. So for one workflow I have 20 different places where I have to go through and update the libraries.&lt;/P&gt;&lt;P&gt;I need to be able to designate a list of libraries to be available on the job cluster for all the task that use it, so that I only have to update the libraries in one place.&lt;/P&gt;&lt;P&gt;But from what I can tell, an entirely new cluster definition gets created for job compute every time the workflow runs, so I don't have a single cluster to configure. What am I missing?&lt;/P&gt;</description>
      <pubDate>Wed, 10 Jul 2024 17:03:19 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78155#M35479</guid>
      <dc:creator>brian999</dc:creator>
      <dc:date>2024-07-10T17:03:19Z</dc:date>
    </item>
    <item>
      <title>Re: Managing libraries in workflows with multiple tasks - need to configure a list of libs for all t</title>
      <link>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78256#M35491</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/102604"&gt;@brian999&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;Creating a requirements.txt would be a solution for you.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/en/libraries/workspace-files-libraries.html#add-dependent-libraries-to-workflow-tasks-from-workspace-files" target="_blank"&gt;https://docs.databricks.com/en/libraries/workspace-files-libraries.html#add-dependent-libraries-to-workflow-tasks-from-workspace-files&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2024 07:01:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78256#M35491</guid>
      <dc:creator>daniel_sahal</dc:creator>
      <dc:date>2024-07-11T07:01:56Z</dc:date>
    </item>
    <item>
      <title>Re: Managing libraries in workflows with multiple tasks - need to configure a list of libs for all t</title>
      <link>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78340#M35507</link>
      <description>&lt;P&gt;The libs I need to install are all private and not on Pypi. They are .whl files in repo folders. Can that be done with a requirements.txt file?&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2024 12:34:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78340#M35507</guid>
      <dc:creator>brian999</dc:creator>
      <dc:date>2024-07-11T12:34:05Z</dc:date>
    </item>
    <item>
      <title>Re: Managing libraries in workflows with multiple tasks - need to configure a list of libs for all t</title>
      <link>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78368#M35511</link>
      <description>&lt;P&gt;Actually I think I found most of a solution here in one of the replies:&amp;nbsp;&lt;A href="https://community.databricks.com/t5/administration-architecture/installing-libraries-on-job-clusters/m-p/37365/highlight/true#M245" target="_blank"&gt;https://community.databricks.com/t5/administration-architecture/installing-libraries-on-job-clusters/m-p/37365/highlight/true#M245&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;It seems like I only have to define libs for the first task, and as long as all other tasks use the same job compute, I'm good to go. I'm assuming tasks within a workflow share compute by default?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jul 2024 15:22:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78368#M35511</guid>
      <dc:creator>brian999</dc:creator>
      <dc:date>2024-07-11T15:22:53Z</dc:date>
    </item>
    <item>
      <title>Re: Managing libraries in workflows with multiple tasks - need to configure a list of libs for all t</title>
      <link>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78437#M35526</link>
      <description>&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/102604"&gt;@brian999&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It should be doable.&lt;/P&gt;&lt;P&gt;In requirements.txt you can specify the path to the .whl file&lt;/P&gt;</description>
      <pubDate>Fri, 12 Jul 2024 05:29:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/78437#M35526</guid>
      <dc:creator>daniel_sahal</dc:creator>
      <dc:date>2024-07-12T05:29:52Z</dc:date>
    </item>
    <item>
      <title>Re: Managing libraries in workflows with multiple tasks - need to configure a list of libs for all t</title>
      <link>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/130460#M48801</link>
      <description>&lt;P&gt;But it's requirement.txt always executed before we run a job in databricks? I have similar "issue" and the problem is that before the job execute the task it needs to have that dependency in the context, otherwise it wont be able to find the .py file&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 02 Sep 2025 08:06:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/managing-libraries-in-workflows-with-multiple-tasks-need-to/m-p/130460#M48801</guid>
      <dc:creator>pablogarcia</dc:creator>
      <dc:date>2025-09-02T08:06:50Z</dc:date>
    </item>
  </channel>
</rss>

