<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Create a Workflow Schedule with varying Parameters in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/57839#M8350</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I don't think this really helps me in this use case. Optimally I was after having 1 workflow with the following settings:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Schedule 1:&lt;UL&gt;&lt;LI&gt;Running every hour&lt;/LI&gt;&lt;LI&gt;job_parameter = "SAP" to import Class SAP where we get data related to that ERP system&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;Schedule 2:&lt;UL&gt;&lt;LI&gt;Runs every 10 minutes&lt;/LI&gt;&lt;LI&gt;job_parameter = "Workday" to import Class Workday where we get data related to that ERP system&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;We could have 5-6 different workflows with varying parameters/schedues, but all these workflows run the very same Notebook but with varying parameters/schedules.&lt;/P&gt;&lt;P&gt;A little picture to illustrate &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Skjermbilde 2024-01-18 214302.png" style="width: 506px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5873i8676A776AAFDC6E4/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999" role="button" title="Skjermbilde 2024-01-18 214302.png" alt="Skjermbilde 2024-01-18 214302.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 18 Jan 2024 20:44:30 GMT</pubDate>
    <dc:creator>marcuskw</dc:creator>
    <dc:date>2024-01-18T20:44:30Z</dc:date>
    <item>
      <title>Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/56844#M8345</link>
      <description>&lt;P&gt;We aim to reduce the amount of notebooks we create to a minimum and instead make these fairly flexible. Therefore we have a Factory setup that takes in a parameter to varies the logic.&lt;/P&gt;&lt;P&gt;However when it comes to Workflows we are forced to create multiple workflows that do more or less the same thing:&lt;BR /&gt;- Run notebook with Parameter X&lt;BR /&gt;- Run notebook with Parameter Y&lt;BR /&gt;- Run notebook with Parameter Z&lt;/P&gt;&lt;P&gt;Is there any development ongoing to have multiple schedules per Workflow? And the schedule could come with a parameter input?&lt;/P&gt;&lt;P&gt;That way we only have 1 Workflow and we instead have 3 different schedules with parameter X, Y, Z.&lt;/P&gt;</description>
      <pubDate>Wed, 10 Jan 2024 10:05:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/56844#M8345</guid>
      <dc:creator>marcuskw</dc:creator>
      <dc:date>2024-01-10T10:05:21Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/56970#M8346</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/88429"&gt;@marcuskw&lt;/a&gt;&amp;nbsp;, could you share more details on your use case. It would be helpful to know why you need multiple schedules per Workflow?&lt;/P&gt;</description>
      <pubDate>Thu, 11 Jan 2024 17:18:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/56970#M8346</guid>
      <dc:creator>Lakshay</dc:creator>
      <dc:date>2024-01-11T17:18:24Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/57072#M8347</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&amp;nbsp;&lt;BR /&gt;We have a Factory logic that looks something like this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;class Factory:
    def __init__(self, job_parameter: str): 
        self.job_parameter = job_parameter 
    
    def set_objects(self): 
        if self.job_parameter == "A": 
            from path.A import LogicClass 

        elif self.job_parameter == "B": 
            from path.B import LogicClass

        return LogicClass&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The aim is to have a generic notebook that would then look like this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;from pyspark.dbutils import DBUtils 
from path.factory import Factory 

job_parameter = dbutils.widgets.get("job_parameter") 
LogicClass= Factory(job_parameter).set_objects()
LogicClass.run_business_logic()&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;When we want to use orchestration we are forced to create multiple jobs:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Job "A" which runs the generic notebook with "job_parameter" = "A" with a schedule&lt;/LI&gt;&lt;LI&gt;Job "B" which runs the generic notebook with "job_parameter" = "B" with a schedule&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;A solution where we only have 1 job would help us here where we have:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Job "Run generic notebook"&lt;/LI&gt;&lt;UL&gt;&lt;LI&gt;Schedule 1 with "job_parameter" = "A"&lt;/LI&gt;&lt;LI&gt;Schedule 2 with "job_parameter" = "B"&lt;/LI&gt;&lt;/UL&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 12 Jan 2024 14:40:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/57072#M8347</guid>
      <dc:creator>marcuskw</dc:creator>
      <dc:date>2024-01-12T14:40:11Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/57192#M8348</link>
      <description>&lt;P&gt;As far as scheduling is concerned, you should be able to combine the two schedules into one cron schedule.&lt;/P&gt;
&lt;P&gt;To pass different parameters, you can store the parameters in a small table and fetch the values from there based on condition. For ex:- if it is a delta load or a historical load.&lt;/P&gt;</description>
      <pubDate>Fri, 12 Jan 2024 19:16:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/57192#M8348</guid>
      <dc:creator>Lakshay</dc:creator>
      <dc:date>2024-01-12T19:16:55Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/57839#M8350</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I don't think this really helps me in this use case. Optimally I was after having 1 workflow with the following settings:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Schedule 1:&lt;UL&gt;&lt;LI&gt;Running every hour&lt;/LI&gt;&lt;LI&gt;job_parameter = "SAP" to import Class SAP where we get data related to that ERP system&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;LI&gt;Schedule 2:&lt;UL&gt;&lt;LI&gt;Runs every 10 minutes&lt;/LI&gt;&lt;LI&gt;job_parameter = "Workday" to import Class Workday where we get data related to that ERP system&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;We could have 5-6 different workflows with varying parameters/schedues, but all these workflows run the very same Notebook but with varying parameters/schedules.&lt;/P&gt;&lt;P&gt;A little picture to illustrate &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Skjermbilde 2024-01-18 214302.png" style="width: 506px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/5873i8676A776AAFDC6E4/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999" role="button" title="Skjermbilde 2024-01-18 214302.png" alt="Skjermbilde 2024-01-18 214302.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jan 2024 20:44:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/57839#M8350</guid>
      <dc:creator>marcuskw</dc:creator>
      <dc:date>2024-01-18T20:44:30Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61270#M8351</link>
      <description>&lt;P&gt;Did you figure out if this was possible?&lt;/P&gt;&lt;P&gt;I too find it that we have too many workflows and I would rather have them combined, but have different parts or the workflow run on different schedules.&lt;/P&gt;</description>
      <pubDate>Tue, 20 Feb 2024 14:50:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61270#M8351</guid>
      <dc:creator>AlexVB</dc:creator>
      <dc:date>2024-02-20T14:50:14Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61271#M8352</link>
      <description>&lt;P&gt;Unfortunatly not, our current solution is to have multiple workflows that run the same notebooks but with varying input parameters.&lt;/P&gt;&lt;P&gt;Results in a bit of workflow bloat both in UI and CI/CD process&lt;/P&gt;</description>
      <pubDate>Tue, 20 Feb 2024 14:54:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61271#M8352</guid>
      <dc:creator>marcuskw</dc:creator>
      <dc:date>2024-02-20T14:54:25Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61272#M8353</link>
      <description>&lt;P&gt;Thanks for the quick reply. Sorry to hear that. I think that we will quickly grow tired of workflow bloating. We've also considering starting to use Databricks MLOps offering, where MLJobs etc are workflows which will further add noise to the workflows tab. Kinda sad. The UI is kinda shit as it is already.&lt;/P&gt;&lt;P&gt;I got an idea but haven't tried it though, you can do conditional tasks in workflows, do you think one is able to use those to have only parts of a workflow trigger when the workflow is triggered?&lt;/P&gt;&lt;P&gt;I have also considered creating a run table but then it quickly becomes work that I don't want to do and query that to determine what parts of the workflow to run. I really feel like this is an easy feature for Databricks to implement if they just start logging workflow runs (which they already do)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Feb 2024 14:58:13 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61272#M8353</guid>
      <dc:creator>AlexVB</dc:creator>
      <dc:date>2024-02-20T14:58:13Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61274#M8354</link>
      <description>&lt;P&gt;I imagine Databricks would have to alter the schema of their Jobs API to implement a solution where schedule would also be an Id field instead of just Job Id. I imagine it would be possible to have a lookup table and append what parameters were run and then infer what the next parameter would be, but that would increase ETL time.&lt;/P&gt;&lt;P&gt;Our team haven't seen the need to implement more complicated workflows, here all our workflows have 1 task and that is to run a notebook. That one notebook runs different endpoints/logic/methods using a parallelism/async logic so that is our way of implementing multiple "tasks".&lt;BR /&gt;We build solutions where ETL time is an important factor, here multiple tasks also create an issue. For example if you create a Task1 -&amp;gt; Task2 -&amp;gt; Task3 that does a simple print(1) you will see that there is an overhead of approximately 7 seconds between tasks.&lt;/P&gt;</description>
      <pubDate>Tue, 20 Feb 2024 15:17:14 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/61274#M8354</guid>
      <dc:creator>marcuskw</dc:creator>
      <dc:date>2024-02-20T15:17:14Z</dc:date>
    </item>
    <item>
      <title>Re: Create a Workflow Schedule with varying Parameters</title>
      <link>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/90976#M8355</link>
      <description>&lt;P&gt;We're also running into this issue on my team where having multiple cron schedules would be handy. We have some pipelines that we want run on multiple schedules, say to refresh data "Run every Sunday at midnight" and "Run on the first day of the month at midnight". Right now we ended up building in our own logic for run concurrent and have the sub-workflow inside two different master workflows that run on those schedules.&lt;/P&gt;</description>
      <pubDate>Wed, 18 Sep 2024 23:49:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/create-a-workflow-schedule-with-varying-parameters/m-p/90976#M8355</guid>
      <dc:creator>SamAdams</dc:creator>
      <dc:date>2024-09-18T23:49:46Z</dc:date>
    </item>
  </channel>
</rss>

