<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Using AutoML to predict completion dates of a project management dataset in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/using-automl-to-predict-completion-dates-of-a-project-management/m-p/60712#M3014</link>
    <description>&lt;P&gt;Hello! I am fairly new to Databricks. I'm trying to do a proof of concept with AutoML in Databricks at my organization, and the dataset I am using is a project management dataset. Here's a sample:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE border="0" width="640" cellspacing="0" cellpadding="0"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD width="64" height="20"&gt;project_id&lt;/TD&gt;&lt;TD width="64"&gt;market&lt;/TD&gt;&lt;TD width="64"&gt;general_contractor&lt;/TD&gt;&lt;TD width="64"&gt;project_type&lt;/TD&gt;&lt;TD width="64"&gt;permit_date&lt;/TD&gt;&lt;TD width="64"&gt;permit_status&lt;/TD&gt;&lt;TD width="64"&gt;construction_date&lt;/TD&gt;&lt;TD width="64"&gt;construction_status&lt;/TD&gt;&lt;TD width="64"&gt;completion_date&lt;/TD&gt;&lt;TD width="64"&gt;completion_status&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;project_1&lt;/TD&gt;&lt;TD&gt;NY&lt;/TD&gt;&lt;TD&gt;acme inc&lt;/TD&gt;&lt;TD&gt;rehab&lt;/TD&gt;&lt;TD&gt;2/1/2024&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;TD&gt;3/1/2024&lt;/TD&gt;&lt;TD&gt;projected&lt;/TD&gt;&lt;TD&gt;4/1/2024&lt;/TD&gt;&lt;TD&gt;projected&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;project_2&lt;/TD&gt;&lt;TD&gt;LA&lt;/TD&gt;&lt;TD&gt;xyz inc&lt;/TD&gt;&lt;TD&gt;build to suit&lt;/TD&gt;&lt;TD&gt;1/1/2020&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;TD&gt;2/2/2023&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;TD&gt;3/4/2023&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;So based on this dataset, I want to be able to see how I can reduce completion_date period. For example, if I use acme inc in LA, will that reduce my completion date and if so, by how much? or for example if I reduce my permit_date by 2 days, how big of an impact will it have on completion_date? Of course I only have to rely on historical data so all the status fields must be set to "complete".&lt;/P&gt;&lt;P&gt;How do I go about doing this? Also, is there a way to output the result in a way for stakeholder to analyze, using a visual tool like tableau or powerbi?&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 17 Feb 2024 15:05:30 GMT</pubDate>
    <dc:creator>User100024</dc:creator>
    <dc:date>2024-02-17T15:05:30Z</dc:date>
    <item>
      <title>Using AutoML to predict completion dates of a project management dataset</title>
      <link>https://community.databricks.com/t5/machine-learning/using-automl-to-predict-completion-dates-of-a-project-management/m-p/60712#M3014</link>
      <description>&lt;P&gt;Hello! I am fairly new to Databricks. I'm trying to do a proof of concept with AutoML in Databricks at my organization, and the dataset I am using is a project management dataset. Here's a sample:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;TABLE border="0" width="640" cellspacing="0" cellpadding="0"&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD width="64" height="20"&gt;project_id&lt;/TD&gt;&lt;TD width="64"&gt;market&lt;/TD&gt;&lt;TD width="64"&gt;general_contractor&lt;/TD&gt;&lt;TD width="64"&gt;project_type&lt;/TD&gt;&lt;TD width="64"&gt;permit_date&lt;/TD&gt;&lt;TD width="64"&gt;permit_status&lt;/TD&gt;&lt;TD width="64"&gt;construction_date&lt;/TD&gt;&lt;TD width="64"&gt;construction_status&lt;/TD&gt;&lt;TD width="64"&gt;completion_date&lt;/TD&gt;&lt;TD width="64"&gt;completion_status&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;project_1&lt;/TD&gt;&lt;TD&gt;NY&lt;/TD&gt;&lt;TD&gt;acme inc&lt;/TD&gt;&lt;TD&gt;rehab&lt;/TD&gt;&lt;TD&gt;2/1/2024&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;TD&gt;3/1/2024&lt;/TD&gt;&lt;TD&gt;projected&lt;/TD&gt;&lt;TD&gt;4/1/2024&lt;/TD&gt;&lt;TD&gt;projected&lt;/TD&gt;&lt;/TR&gt;&lt;TR&gt;&lt;TD height="20"&gt;project_2&lt;/TD&gt;&lt;TD&gt;LA&lt;/TD&gt;&lt;TD&gt;xyz inc&lt;/TD&gt;&lt;TD&gt;build to suit&lt;/TD&gt;&lt;TD&gt;1/1/2020&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;TD&gt;2/2/2023&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;TD&gt;3/4/2023&lt;/TD&gt;&lt;TD&gt;complete&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;So based on this dataset, I want to be able to see how I can reduce completion_date period. For example, if I use acme inc in LA, will that reduce my completion date and if so, by how much? or for example if I reduce my permit_date by 2 days, how big of an impact will it have on completion_date? Of course I only have to rely on historical data so all the status fields must be set to "complete".&lt;/P&gt;&lt;P&gt;How do I go about doing this? Also, is there a way to output the result in a way for stakeholder to analyze, using a visual tool like tableau or powerbi?&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 17 Feb 2024 15:05:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/using-automl-to-predict-completion-dates-of-a-project-management/m-p/60712#M3014</guid>
      <dc:creator>User100024</dc:creator>
      <dc:date>2024-02-17T15:05:30Z</dc:date>
    </item>
    <item>
      <title>Re: Using AutoML to predict completion dates of a project management dataset</title>
      <link>https://community.databricks.com/t5/machine-learning/using-automl-to-predict-completion-dates-of-a-project-management/m-p/61143#M3031</link>
      <description>&lt;P&gt;Hello Kaniz, Thank you so much for your reply!! I am trying to follow your steps, but however I cannot seem to select Completion Date as my target. It is only showing SYS_ID's (which are numeric in nature):&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="DBricks1.jpg" style="width: 999px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/6316iF191BD66AE62DC16/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999" role="button" title="DBricks1.jpg" alt="DBricks1.jpg" /&gt;&lt;/span&gt;What should I add to "Time Column for Training/Validation/Testing Split"? Is that where completion_date goes?&lt;/P&gt;&lt;P&gt;One more question if you don't mind. On the right side, it lists all the columns that I have available with an "Impute with" function:&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-center" image-alt="DBricks2.jpg" style="width: 999px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/6317iD3A320F69B34A502/image-size/large/is-moderation-mode/true?v=v2&amp;amp;px=999" role="button" title="DBricks2.jpg" alt="DBricks2.jpg" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;Is there where I select what columns I need in my dataset? I was not sure what Impute With means here.&lt;/P&gt;&lt;P&gt;I appreciate all your help. Thank you so much &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Feb 2024 15:50:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/using-automl-to-predict-completion-dates-of-a-project-management/m-p/61143#M3031</guid>
      <dc:creator>User100024</dc:creator>
      <dc:date>2024-02-19T15:50:01Z</dc:date>
    </item>
  </channel>
</rss>

