<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to Reduce Log Latency for AI Gateway-Enabled Inference Tables in Model Serving? in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/how-to-reduce-log-latency-for-ai-gateway-enabled-inference/m-p/129173#M4243</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/112959"&gt;@ecram&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Thank you for contacting Databricks community.&lt;/P&gt;
&lt;P&gt;As per the doc below, you'll see the latency for 1 hour for log delivery in the inference table.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/aws/en/ai-gateway/inference-tables#:~:text=You%20can%20expect%20logs%20to%20be%20available%20within%201%20hour%20of%20a%20request.%20Reach%20out%20to%20your%20Databricks%20account%20team%20for%20more%20information" target="_blank"&gt;https://docs.databricks.com/aws/en/ai-gateway/inference-tables#:~:text=You%20can%20expect%20logs%20to%20be%20available%20within%201%20hour%20of%20a%20request.%20Reach%20out%20to%20your%20Databricks%20account%20team%20for%20more%20information&lt;/A&gt;.&lt;/P&gt;</description>
    <pubDate>Thu, 21 Aug 2025 19:32:25 GMT</pubDate>
    <dc:creator>Kumaran</dc:creator>
    <dc:date>2025-08-21T19:32:25Z</dc:date>
    <item>
      <title>How to Reduce Log Latency for AI Gateway-Enabled Inference Tables in Model Serving?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-to-reduce-log-latency-for-ai-gateway-enabled-inference/m-p/123365#M4138</link>
      <description>&lt;P&gt;Hi everyone,&lt;/P&gt;&lt;P&gt;I've recently deployed a custom model using Databricks Model Serving with &lt;STRONG&gt;AI Gateway-enabled inference tables&lt;/STRONG&gt;. The model is built with:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;Python 3.11.11&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;LightGBM 4.5.0&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;MLflow 2.13.1&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;I’ve noticed that the &lt;STRONG&gt;inference logs can take up to 1 hour to appear&lt;/STRONG&gt;, as mentioned in the &lt;A class="" href="https://learn.microsoft.com/en-us/azure/databricks/ai-gateway/inference-tables" target="_new" rel="noopener"&gt;Databricks documentation&lt;/A&gt;. This is quite different from a previous setup (Python 3.10.12, LightGBM 3.3.5, MLflow 2.5.0) where logs appeared in ~5 minutes using legacy inference tables.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Question:&lt;/STRONG&gt;&lt;BR /&gt;Is there any way to reduce the latency of inference logs when using AI Gateway-enabled inference tables?&lt;/P&gt;&lt;P&gt;I understand the system is now based on batch delivery, but I’d like to know if:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;P&gt;There are configuration options to speed this up?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;There’s any official roadmap to reduce this latency?&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Any best practices to implement near real-time logging (e.g., logging predictions manually into a Delta table within the model wrapper)?&lt;/P&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Thanks in advance for your help!&lt;BR /&gt;Marcelo&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jun 2025 22:34:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-to-reduce-log-latency-for-ai-gateway-enabled-inference/m-p/123365#M4138</guid>
      <dc:creator>ecram</dc:creator>
      <dc:date>2025-06-30T22:34:09Z</dc:date>
    </item>
    <item>
      <title>Re: How to Reduce Log Latency for AI Gateway-Enabled Inference Tables in Model Serving?</title>
      <link>https://community.databricks.com/t5/machine-learning/how-to-reduce-log-latency-for-ai-gateway-enabled-inference/m-p/129173#M4243</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/112959"&gt;@ecram&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Thank you for contacting Databricks community.&lt;/P&gt;
&lt;P&gt;As per the doc below, you'll see the latency for 1 hour for log delivery in the inference table.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.databricks.com/aws/en/ai-gateway/inference-tables#:~:text=You%20can%20expect%20logs%20to%20be%20available%20within%201%20hour%20of%20a%20request.%20Reach%20out%20to%20your%20Databricks%20account%20team%20for%20more%20information" target="_blank"&gt;https://docs.databricks.com/aws/en/ai-gateway/inference-tables#:~:text=You%20can%20expect%20logs%20to%20be%20available%20within%201%20hour%20of%20a%20request.%20Reach%20out%20to%20your%20Databricks%20account%20team%20for%20more%20information&lt;/A&gt;.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Aug 2025 19:32:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/how-to-reduce-log-latency-for-ai-gateway-enabled-inference/m-p/129173#M4243</guid>
      <dc:creator>Kumaran</dc:creator>
      <dc:date>2025-08-21T19:32:25Z</dc:date>
    </item>
  </channel>
</rss>

