<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Lakehouse Monitoring of Inference Table in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/lakehouse-monitoring-of-inference-table/m-p/108849#M3946</link>
    <description>&lt;P&gt;All,&lt;/P&gt;&lt;P&gt;I'm trying to setup a lakehouse monitoring process for the WineQuality model that is widely available. While setting up the Serving Endpoint, I enabled "Inference Table" option for which the inference table was created automatically. The columns in the winequality_payload table are as follows:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;client_request_id&lt;/LI&gt;&lt;LI&gt;databricks_request_id&lt;/LI&gt;&lt;LI&gt;date&lt;/LI&gt;&lt;LI&gt;timestamp_ms&lt;/LI&gt;&lt;LI&gt;status_code&lt;/LI&gt;&lt;LI&gt;execution_time_ms&lt;/LI&gt;&lt;LI&gt;request&lt;/LI&gt;&lt;LI&gt;response&lt;/LI&gt;&lt;LI&gt;sampling_fraction&lt;/LI&gt;&lt;LI&gt;request_metadata (MAP type)&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;The request_metadata contains "model_name","endpoint_name" &amp;amp; "model_version".&lt;/P&gt;&lt;P&gt;While configuring the monitor against the inference table, I selected request_metadata as the model_id column but it is erroring out with the below error:&lt;/P&gt;&lt;PRE&gt;The given `DataMonitorInfo` is invalid for the following reason(s): - For table `dev_tst_mls.winequality_uc.winequality_payload`: The specified `model_id_col` (`request_metadata`) must be a groupable column, but instead it is a MAP type. Please check that all referenced columns exist in the table(s) and have compatible column type.&lt;/PRE&gt;&lt;P&gt;As you can see, there is no other column that I can pick for the model_id column. Why am I getting this error and what alternatives do I have?&lt;/P&gt;&lt;P&gt;Now, based on Databricks Assistant, I created a view adding the model_name and model_version and used this view to create the monitor. Things go through but the dashboard shows no data at all despite my making several scoring attempts.&lt;/P&gt;&lt;P&gt;I tried troubleshooting and found that the datetime in the window column of the profile_metrics table is way off:&lt;/P&gt;&lt;PRE&gt;profile_metrics table --&amp;gt; Column window  
start: "+057064-08-22T07:05:00.000Z" 
end: "+057064-08-22T07:10:00.000Z"&lt;/PRE&gt;&lt;P&gt;As you can see, I tried it today (02/03/2025), but the date is light years ahead!!.&lt;/P&gt;&lt;P&gt;Based on the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/inference-tables#unity-catalog-inference-table-schema" target="_blank" rel="nofollow noopener noreferrer"&gt;link&lt;/A&gt;, the date column is "The UTC date on which the model serving request was received." and the timestamp_ms column is "The timestamp in epoch milliseconds on when the model serving request was received."&lt;/P&gt;&lt;P&gt;I checked the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;date column&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in the inference table and it correctly show&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;2025-02-03&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;but the timestamp_ms shows values like&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;1738620594270&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;which is&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;"2024-12-31 23:09:54.270"&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Am I doing something wrong? Has anyone experienced this before?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;grajee&lt;/P&gt;</description>
    <pubDate>Tue, 04 Feb 2025 20:58:07 GMT</pubDate>
    <dc:creator>grajee</dc:creator>
    <dc:date>2025-02-04T20:58:07Z</dc:date>
    <item>
      <title>Lakehouse Monitoring of Inference Table</title>
      <link>https://community.databricks.com/t5/machine-learning/lakehouse-monitoring-of-inference-table/m-p/108849#M3946</link>
      <description>&lt;P&gt;All,&lt;/P&gt;&lt;P&gt;I'm trying to setup a lakehouse monitoring process for the WineQuality model that is widely available. While setting up the Serving Endpoint, I enabled "Inference Table" option for which the inference table was created automatically. The columns in the winequality_payload table are as follows:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;client_request_id&lt;/LI&gt;&lt;LI&gt;databricks_request_id&lt;/LI&gt;&lt;LI&gt;date&lt;/LI&gt;&lt;LI&gt;timestamp_ms&lt;/LI&gt;&lt;LI&gt;status_code&lt;/LI&gt;&lt;LI&gt;execution_time_ms&lt;/LI&gt;&lt;LI&gt;request&lt;/LI&gt;&lt;LI&gt;response&lt;/LI&gt;&lt;LI&gt;sampling_fraction&lt;/LI&gt;&lt;LI&gt;request_metadata (MAP type)&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;The request_metadata contains "model_name","endpoint_name" &amp;amp; "model_version".&lt;/P&gt;&lt;P&gt;While configuring the monitor against the inference table, I selected request_metadata as the model_id column but it is erroring out with the below error:&lt;/P&gt;&lt;PRE&gt;The given `DataMonitorInfo` is invalid for the following reason(s): - For table `dev_tst_mls.winequality_uc.winequality_payload`: The specified `model_id_col` (`request_metadata`) must be a groupable column, but instead it is a MAP type. Please check that all referenced columns exist in the table(s) and have compatible column type.&lt;/PRE&gt;&lt;P&gt;As you can see, there is no other column that I can pick for the model_id column. Why am I getting this error and what alternatives do I have?&lt;/P&gt;&lt;P&gt;Now, based on Databricks Assistant, I created a view adding the model_name and model_version and used this view to create the monitor. Things go through but the dashboard shows no data at all despite my making several scoring attempts.&lt;/P&gt;&lt;P&gt;I tried troubleshooting and found that the datetime in the window column of the profile_metrics table is way off:&lt;/P&gt;&lt;PRE&gt;profile_metrics table --&amp;gt; Column window  
start: "+057064-08-22T07:05:00.000Z" 
end: "+057064-08-22T07:10:00.000Z"&lt;/PRE&gt;&lt;P&gt;As you can see, I tried it today (02/03/2025), but the date is light years ahead!!.&lt;/P&gt;&lt;P&gt;Based on the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/inference-tables#unity-catalog-inference-table-schema" target="_blank" rel="nofollow noopener noreferrer"&gt;link&lt;/A&gt;, the date column is "The UTC date on which the model serving request was received." and the timestamp_ms column is "The timestamp in epoch milliseconds on when the model serving request was received."&lt;/P&gt;&lt;P&gt;I checked the&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;date column&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;in the inference table and it correctly show&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;2025-02-03&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;but the timestamp_ms shows values like&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;1738620594270&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;which is&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;"2024-12-31 23:09:54.270"&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Am I doing something wrong? Has anyone experienced this before?&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;grajee&lt;/P&gt;</description>
      <pubDate>Tue, 04 Feb 2025 20:58:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/lakehouse-monitoring-of-inference-table/m-p/108849#M3946</guid>
      <dc:creator>grajee</dc:creator>
      <dc:date>2025-02-04T20:58:07Z</dc:date>
    </item>
    <item>
      <title>Re: Lakehouse Monitoring of Inference Table</title>
      <link>https://community.databricks.com/t5/machine-learning/lakehouse-monitoring-of-inference-table/m-p/137452#M4403</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/147808"&gt;@grajee&lt;/a&gt;&amp;nbsp;,&amp;nbsp; I can see you're dealing with two separate issues here. Let me address both:&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Issue 1: The model_id column (request_metadata MAP type)&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;You're correct that request_metadata is a MAP type and can't be directly used as the model_id column in Lakehouse Monitoring. Your approach of creating a view that extracts model_name and model_version from the request_metadata MAP is the right solution. You can create a view like this:&lt;/P&gt;
&lt;P&gt;```sql&lt;BR /&gt;CREATE OR REPLACE VIEW your_catalog.your_schema.winequality_inference_view AS&lt;BR /&gt;SELECT &lt;BR /&gt;*,&lt;BR /&gt;request_metadata['model_name'] AS model_name,&lt;BR /&gt;request_metadata['model_version'] AS model_version,&lt;BR /&gt;CONCAT(request_metadata['model_name'], '_', request_metadata['model_version']) AS model_id&lt;BR /&gt;FROM your_catalog.your_schema.winequality_payload&lt;BR /&gt;```&lt;/P&gt;
&lt;P&gt;Then create the monitor on this view using the model_id column.&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Issue 2: The timestamp and profile_metrics date problem&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;I noticed something important in your post. The timestamp_ms value 1738620594270 actually converts to **2025-02-03 22:09:54** (not 2024-12-31 as you mentioned). So your timestamp_ms column is correct and matches your date column.&lt;/P&gt;
&lt;P&gt;The real issue is that the profile_metrics table is showing dates in year 57064, which suggests Lakehouse Monitoring may be misinterpreting your timestamp_ms column. This could happen if:&lt;/P&gt;
&lt;P&gt;1. The monitoring is treating timestamp_ms as seconds instead of milliseconds&lt;BR /&gt;2. There's a unit mismatch in how the timestamp column is being processed&lt;BR /&gt;3. The timestamp column being used for windowing isn't correctly specified&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;Troubleshooting steps:&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;1. Verify your monitor configuration explicitly specifies timestamp_ms as the timestamp column and that it's correctly formatted as epoch milliseconds (LONG type)&lt;BR /&gt;2. Check if there are any timezone configuration issues in your monitor setup&lt;BR /&gt;3. Try recreating the monitor and ensure you're using the InferenceLog profile type (not TimeSeries) for inference tables&lt;BR /&gt;4. Confirm your inference table schema matches the expected format with timestamp_ms as a LONG type&lt;/P&gt;
&lt;P&gt;You might also want to run a query directly on your inference table to validate the timestamp_ms values are reasonable:&lt;/P&gt;
&lt;P&gt;```sql&lt;BR /&gt;SELECT &lt;BR /&gt;date,&lt;BR /&gt;timestamp_ms,&lt;BR /&gt;FROM_UNIXTIME(timestamp_ms/1000) as converted_timestamp&lt;BR /&gt;FROM your_catalog.your_schema.winequality_payload&lt;BR /&gt;LIMIT 10&lt;BR /&gt;```&lt;/P&gt;
&lt;P&gt;Hope this helps, Louis.&lt;/P&gt;</description>
      <pubDate>Mon, 03 Nov 2025 20:51:47 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/lakehouse-monitoring-of-inference-table/m-p/137452#M4403</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2025-11-03T20:51:47Z</dc:date>
    </item>
  </channel>
</rss>

