<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How does Task Orchestration compare to Airflow (for Databricks-only jobs)? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-does-task-orchestration-compare-to-airflow-for-databricks/m-p/16017#M10242</link>
    <description>&lt;P&gt;We have a setup with both types of orchestration. The split comes down to ease of use. For example ETL jobs are running via Airflow, but some notebooks that users just want to schedule for themselves are done within Databricks.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It's just so much more easier to click "schedule" and choose a cluster and be done with it instead of writing an Airflow task or a DAG. Data scientists don't want to learn how to use specific scheduling software - they just want to have their notebook run every morning.&lt;/P&gt;</description>
    <pubDate>Thu, 09 Sep 2021 19:23:17 GMT</pubDate>
    <dc:creator>Lauri</dc:creator>
    <dc:date>2021-09-09T19:23:17Z</dc:date>
    <item>
      <title>How does Task Orchestration compare to Airflow (for Databricks-only jobs)?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-does-task-orchestration-compare-to-airflow-for-databricks/m-p/16015#M10240</link>
      <description>&lt;P&gt;One of my clients has been orchestration Databricks notebooks using Airflow + REST API. They're curious about the pros/cons of switching these jobs to Databricks jobs with Task Orchestration.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;I know there are all sorts of considerations - for example, if they're already running Airflow for non-Databricks jobs, they'll most likely continue using Airflow to centralize workflow management. But I'm curious about people's experiences with Task Orchestration, and what features or benefits it might have over Airflow for jobs that are 100% Databricks anyway.&lt;/P&gt;</description>
      <pubDate>Wed, 25 Aug 2021 16:00:36 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-does-task-orchestration-compare-to-airflow-for-databricks/m-p/16015#M10240</guid>
      <dc:creator>arthur_wang</dc:creator>
      <dc:date>2021-08-25T16:00:36Z</dc:date>
    </item>
    <item>
      <title>Re: How does Task Orchestration compare to Airflow (for Databricks-only jobs)?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-does-task-orchestration-compare-to-airflow-for-databricks/m-p/16017#M10242</link>
      <description>&lt;P&gt;We have a setup with both types of orchestration. The split comes down to ease of use. For example ETL jobs are running via Airflow, but some notebooks that users just want to schedule for themselves are done within Databricks.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It's just so much more easier to click "schedule" and choose a cluster and be done with it instead of writing an Airflow task or a DAG. Data scientists don't want to learn how to use specific scheduling software - they just want to have their notebook run every morning.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Sep 2021 19:23:17 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-does-task-orchestration-compare-to-airflow-for-databricks/m-p/16017#M10242</guid>
      <dc:creator>Lauri</dc:creator>
      <dc:date>2021-09-09T19:23:17Z</dc:date>
    </item>
    <item>
      <title>Re: How does Task Orchestration compare to Airflow (for Databricks-only jobs)?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-does-task-orchestration-compare-to-airflow-for-databricks/m-p/16018#M10243</link>
      <description>&lt;P&gt;@Kaniz Fatma​&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hello Kaniz, I'm currently working with a major Enterprise Client looking to make the choice between the Airflow vs Databricks for Jobs scheduling. Our Entire code base is in Databricks and we are trying to figure out the complexities that might come up if we go ahead with Databricks. Could you please provide any details this issue - on Databricks vs Airflow ? We are essentially looking for retries, notifications, task dependencies and Real-time Use case scenarios? If you can please include how it might benefit/not  benefit different employees Like a Data Analyst/ Data Scientist etc. ?&lt;/P&gt;</description>
      <pubDate>Tue, 06 Sep 2022 14:34:33 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-does-task-orchestration-compare-to-airflow-for-databricks/m-p/16018#M10243</guid>
      <dc:creator>Shourya</dc:creator>
      <dc:date>2022-09-06T14:34:33Z</dc:date>
    </item>
  </channel>
</rss>

