<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Scheduling and triggering jobs based on time and frequency precedence in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/scheduling-and-triggering-jobs-based-on-time-and-frequency/m-p/136580#M50594</link>
    <description>&lt;P&gt;I have a table in Databricks that stores job information, including fields such as job_name, job_id, frequency, scheduled_time, and last_run_time.&lt;/P&gt;&lt;P&gt;I want to run a query every 10 minutes that checks this table and triggers a job if the scheduled_time is less than or equal to the current time.&lt;/P&gt;&lt;P&gt;Some jobs have multiple frequencies, for example, the same job might run daily and monthly. In such cases, I want the lower-frequency job (e.g., monthly) to take precedence, meaning only the monthly job should trigger and the higher-frequency job (daily) should be skipped when both are due.&lt;/P&gt;&lt;P&gt;What is the best way to implement this scheduling and job-triggering logic in Databricks?&lt;/P&gt;</description>
    <pubDate>Wed, 29 Oct 2025 16:08:50 GMT</pubDate>
    <dc:creator>Anoora</dc:creator>
    <dc:date>2025-10-29T16:08:50Z</dc:date>
    <item>
      <title>Scheduling and triggering jobs based on time and frequency precedence</title>
      <link>https://community.databricks.com/t5/data-engineering/scheduling-and-triggering-jobs-based-on-time-and-frequency/m-p/136580#M50594</link>
      <description>&lt;P&gt;I have a table in Databricks that stores job information, including fields such as job_name, job_id, frequency, scheduled_time, and last_run_time.&lt;/P&gt;&lt;P&gt;I want to run a query every 10 minutes that checks this table and triggers a job if the scheduled_time is less than or equal to the current time.&lt;/P&gt;&lt;P&gt;Some jobs have multiple frequencies, for example, the same job might run daily and monthly. In such cases, I want the lower-frequency job (e.g., monthly) to take precedence, meaning only the monthly job should trigger and the higher-frequency job (daily) should be skipped when both are due.&lt;/P&gt;&lt;P&gt;What is the best way to implement this scheduling and job-triggering logic in Databricks?&lt;/P&gt;</description>
      <pubDate>Wed, 29 Oct 2025 16:08:50 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/scheduling-and-triggering-jobs-based-on-time-and-frequency/m-p/136580#M50594</guid>
      <dc:creator>Anoora</dc:creator>
      <dc:date>2025-10-29T16:08:50Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling and triggering jobs based on time and frequency precedence</title>
      <link>https://community.databricks.com/t5/data-engineering/scheduling-and-triggering-jobs-based-on-time-and-frequency/m-p/136596#M50596</link>
      <description>&lt;P&gt;You could add a job with a scheduled based trigger that runs every 10 minutes. The task at the start of the job runs a SQL query against the job information table and uses the logic you described above to output a boolean value. Then feed that boolean value into a conditional task and if True then have a job task that executes your main pipeline.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/aws/en/jobs/run-if" target="_blank"&gt;https://docs.databricks.com/aws/en/jobs/run-if&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 29 Oct 2025 18:14:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/scheduling-and-triggering-jobs-based-on-time-and-frequency/m-p/136596#M50596</guid>
      <dc:creator>SamAdams</dc:creator>
      <dc:date>2025-10-29T18:14:09Z</dc:date>
    </item>
    <item>
      <title>Re: Scheduling and triggering jobs based on time and frequency precedence</title>
      <link>https://community.databricks.com/t5/data-engineering/scheduling-and-triggering-jobs-based-on-time-and-frequency/m-p/136606#M50599</link>
      <description>&lt;P&gt;Yeah, that’s basically what I’m doing now, but it’s becoming pretty difficult to manage since there are around 120 pipelines in total. Maintaining a separate scheduled job and conditional logic for each one adds a lot of overhead, and it’s getting hard to scale or keep track of everything. That’s why I was exploring whether there’s a cleaner way to handle this kind of multi-schedule orchestration directly.&lt;/P&gt;</description>
      <pubDate>Wed, 29 Oct 2025 19:11:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/scheduling-and-triggering-jobs-based-on-time-and-frequency/m-p/136606#M50599</guid>
      <dc:creator>Anoora</dc:creator>
      <dc:date>2025-10-29T19:11:09Z</dc:date>
    </item>
  </channel>
</rss>

