<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: AutoML Forecast fails when using feature_store_lookups with timestamp key in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/automl-forecast-fails-when-using-feature-store-lookups-with/m-p/134599#M4348</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/170858"&gt;@ostae911&lt;/a&gt;&amp;nbsp;, are you still facing this issue?&lt;/P&gt;
&lt;P&gt;It looks like your usage of the timestamp column is correct. It can be used as a primary key on the time series feature table. Is it possible that there are other duplicate columns between the training dataframe and the `example_corona_features` table?&lt;/P&gt;
&lt;P&gt;You could try &lt;A href="https://docs.databricks.com/aws/en/machine-learning/feature-store/train-models-with-feature-store#create-a-training-dataset" target="_blank" rel="noopener"&gt;creating the training set&lt;/A&gt; first and then pass it to `automl.forecast` without the list of feature lookups. You also may be able to &lt;A href="https://api-docs.databricks.com/python/feature-store/latest/feature_store.feature_lookup.html#:~:text=rename_outputs%20%E2%80%93%20If%20provided%2C%20renames%20features%20in%20the%20TrainingSet%20returned%20by%20of%20FeatureStoreClient.create_training_set." target="_blank" rel="noopener"&gt;rename features&lt;/A&gt; if there are indeed duplicate column names.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I hope that helps, but feel free to ask any&amp;nbsp;follow up questions. If this reply does resolve the issue, please click the "Accept Solution" button to let us know!&lt;/P&gt;
&lt;P&gt;-James&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 11 Oct 2025 00:05:22 GMT</pubDate>
    <dc:creator>jamesl</dc:creator>
    <dc:date>2025-10-11T00:05:22Z</dc:date>
    <item>
      <title>AutoML Forecast fails when using feature_store_lookups with timestamp key</title>
      <link>https://community.databricks.com/t5/machine-learning/automl-forecast-fails-when-using-feature-store-lookups-with/m-p/122295#M4122</link>
      <description>&lt;P&gt;We are running &lt;STRONG&gt;AutoML Forecast&lt;/STRONG&gt; on Databricks Runtime &lt;STRONG&gt;15.4 ML LTS&lt;/STRONG&gt; and &lt;STRONG&gt;16.4 ML LTS&lt;/STRONG&gt;, using a time series dataset with temporal covariates from the Feature Store (e.g. a corona_dummy feature). We use feature_store_lookups with lookup_key and timestamp_lookup_key.&lt;/P&gt;&lt;P&gt;Our feature table is defined like this:&lt;/P&gt;&lt;LI-CODE lang="python"&gt;fs.create_table(
  name="...features.example_corona_features",
  primary_keys=["Monat", "Produkt", "Vertriebstyp_Art"],
  df=...,
  timestamp_keys="Monat",
  ...
)&lt;/LI-CODE&gt;&lt;P&gt;And we call AutoML with:&lt;/P&gt;&lt;LI-CODE lang="python"&gt;feature_store_lookups=[{
  "table_name": "...features.example_corona_features",
  "lookup_key": ["Produkt", "Vertriebstyp_Art"],
  "timestamp_lookup_key": "Monat"
}]&lt;/LI-CODE&gt;&lt;H3&gt;&lt;span class="lia-unicode-emoji" title=":white_heavy_check_mark:"&gt;✅&lt;/span&gt; Expected:&lt;/H3&gt;&lt;P&gt;AutoML performs a temporal join between the dataset and the feature table (via timestamp and keys) and proceeds with training including the covariate corona_dummy.&lt;/P&gt;&lt;HR /&gt;&lt;H3&gt;&lt;span class="lia-unicode-emoji" title=":cross_mark:"&gt;❌&lt;/span&gt; Actual:&lt;/H3&gt;&lt;P&gt;AutoML proceeds with the run, but fails &lt;STRONG&gt;during internal applyInPandas() or .toPandas() conversion&lt;/STRONG&gt;, throwing:&lt;/P&gt;&lt;LI-CODE lang="python"&gt;ValueError: Length mismatch: Expected axis has 8 elements, new values have 11 elements&lt;/LI-CODE&gt;&lt;P&gt;This crash occurs &lt;EM&gt;after&lt;/EM&gt; joining features and loading the training set — i.e., during execution of AutoML’s internal training loop.&lt;/P&gt;&lt;H3&gt;&lt;span class="lia-unicode-emoji" title=":magnifying_glass_tilted_left:"&gt;🔍&lt;/span&gt; Observations:&lt;/H3&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;When we &lt;STRONG&gt;remove the feature_store_lookups&lt;/STRONG&gt;, AutoML completes without errors.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;The issue appears only when the &lt;STRONG&gt;timestamp column (Monat) is both&lt;/STRONG&gt;:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;present in primary_keys, and&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;passed again as timestamp_lookup_key&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Can you confirm if this is a known issue, and what the &lt;STRONG&gt;correct contract&lt;/STRONG&gt; is for using feature_store_lookups with timestamp_lookup_key in AutoML?&lt;/P&gt;</description>
      <pubDate>Fri, 20 Jun 2025 00:44:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/automl-forecast-fails-when-using-feature-store-lookups-with/m-p/122295#M4122</guid>
      <dc:creator>ostae911</dc:creator>
      <dc:date>2025-06-20T00:44:11Z</dc:date>
    </item>
    <item>
      <title>Re: AutoML Forecast fails when using feature_store_lookups with timestamp key</title>
      <link>https://community.databricks.com/t5/machine-learning/automl-forecast-fails-when-using-feature-store-lookups-with/m-p/134599#M4348</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/170858"&gt;@ostae911&lt;/a&gt;&amp;nbsp;, are you still facing this issue?&lt;/P&gt;
&lt;P&gt;It looks like your usage of the timestamp column is correct. It can be used as a primary key on the time series feature table. Is it possible that there are other duplicate columns between the training dataframe and the `example_corona_features` table?&lt;/P&gt;
&lt;P&gt;You could try &lt;A href="https://docs.databricks.com/aws/en/machine-learning/feature-store/train-models-with-feature-store#create-a-training-dataset" target="_blank" rel="noopener"&gt;creating the training set&lt;/A&gt; first and then pass it to `automl.forecast` without the list of feature lookups. You also may be able to &lt;A href="https://api-docs.databricks.com/python/feature-store/latest/feature_store.feature_lookup.html#:~:text=rename_outputs%20%E2%80%93%20If%20provided%2C%20renames%20features%20in%20the%20TrainingSet%20returned%20by%20of%20FeatureStoreClient.create_training_set." target="_blank" rel="noopener"&gt;rename features&lt;/A&gt; if there are indeed duplicate column names.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I hope that helps, but feel free to ask any&amp;nbsp;follow up questions. If this reply does resolve the issue, please click the "Accept Solution" button to let us know!&lt;/P&gt;
&lt;P&gt;-James&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 11 Oct 2025 00:05:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/automl-forecast-fails-when-using-feature-store-lookups-with/m-p/134599#M4348</guid>
      <dc:creator>jamesl</dc:creator>
      <dc:date>2025-10-11T00:05:22Z</dc:date>
    </item>
  </channel>
</rss>

