<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Feature Store Benchmarks in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/feature-store-benchmarks/m-p/128659#M4224</link>
    <description>&lt;P&gt;We’re also exploring this internally and found very limited public benchmarks comparing Databricks Feature Store to directly using Delta Tables. That said, the open-source project featurestore-benchmarks provides a framework to evaluate offline and online feature store performance across platforms, which could be adapted for Databricks:&lt;BR /&gt;&lt;A href="https://github.com/featurestoreorg/featurestore-benchmarks" target="_blank"&gt;https://github.com/featurestoreorg/featurestore-benchmarks&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Additionally, Hopsworks published some academic benchmarks comparing their feature store to Databricks, SageMaker, and Vertex AI. While results may not generalize fully, they provide useful performance reference points:&lt;BR /&gt;&lt;A href="https://www.hopsworks.ai/news/redefining-feature-stores-with-class-leading-performance" target="_blank"&gt;https://www.hopsworks.ai/news/redefining-feature-stores-with-class-leading-performance&lt;/A&gt;&lt;/P&gt;&lt;P&gt;From community discussions, Delta Tables may be sufficient for batch inference, but Feature Store provides added value for point-in-time joins, versioning, and online inference:&lt;BR /&gt;&lt;A href="https://www.reddit.com/r/mlops/comments/14fj1o7" target="_blank"&gt;https://www.reddit.com/r/mlops/comments/14fj1o7&lt;/A&gt;&lt;BR /&gt;&lt;A href="https://www.reddit.com/r/mlops/comments/17p0w7h" target="_blank"&gt;https://www.reddit.com/r/mlops/comments/17p0w7h&lt;/A&gt;&lt;/P&gt;&lt;P&gt;We’re considering setting up our own benchmarks using these tools. Would be great to hear if others have done similar testing on Databricks.&lt;/P&gt;</description>
    <pubDate>Sun, 17 Aug 2025 13:55:10 GMT</pubDate>
    <dc:creator>WiliamRosa</dc:creator>
    <dc:date>2025-08-17T13:55:10Z</dc:date>
    <item>
      <title>Feature Store Benchmarks</title>
      <link>https://community.databricks.com/t5/machine-learning/feature-store-benchmarks/m-p/124904#M4155</link>
      <description>&lt;P&gt;We are currently planning to create feature tables to serve machine learning models in our organization.&lt;/P&gt;&lt;P&gt;I am struggling to find interesting benchmarks on Databricks Feature Store performances vs using directly Delta Tables. It would also be interesting the different results for batch inference and usage of online feature store for real time inference.&lt;/P&gt;&lt;P&gt;Throwing the topic in the community to see if someone in Databricks or other clients have tried running some tests.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Jul 2025 12:27:01 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/feature-store-benchmarks/m-p/124904#M4155</guid>
      <dc:creator>FedeRaimondi</dc:creator>
      <dc:date>2025-07-11T12:27:01Z</dc:date>
    </item>
    <item>
      <title>Re: Feature Store Benchmarks</title>
      <link>https://community.databricks.com/t5/machine-learning/feature-store-benchmarks/m-p/128659#M4224</link>
      <description>&lt;P&gt;We’re also exploring this internally and found very limited public benchmarks comparing Databricks Feature Store to directly using Delta Tables. That said, the open-source project featurestore-benchmarks provides a framework to evaluate offline and online feature store performance across platforms, which could be adapted for Databricks:&lt;BR /&gt;&lt;A href="https://github.com/featurestoreorg/featurestore-benchmarks" target="_blank"&gt;https://github.com/featurestoreorg/featurestore-benchmarks&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Additionally, Hopsworks published some academic benchmarks comparing their feature store to Databricks, SageMaker, and Vertex AI. While results may not generalize fully, they provide useful performance reference points:&lt;BR /&gt;&lt;A href="https://www.hopsworks.ai/news/redefining-feature-stores-with-class-leading-performance" target="_blank"&gt;https://www.hopsworks.ai/news/redefining-feature-stores-with-class-leading-performance&lt;/A&gt;&lt;/P&gt;&lt;P&gt;From community discussions, Delta Tables may be sufficient for batch inference, but Feature Store provides added value for point-in-time joins, versioning, and online inference:&lt;BR /&gt;&lt;A href="https://www.reddit.com/r/mlops/comments/14fj1o7" target="_blank"&gt;https://www.reddit.com/r/mlops/comments/14fj1o7&lt;/A&gt;&lt;BR /&gt;&lt;A href="https://www.reddit.com/r/mlops/comments/17p0w7h" target="_blank"&gt;https://www.reddit.com/r/mlops/comments/17p0w7h&lt;/A&gt;&lt;/P&gt;&lt;P&gt;We’re considering setting up our own benchmarks using these tools. Would be great to hear if others have done similar testing on Databricks.&lt;/P&gt;</description>
      <pubDate>Sun, 17 Aug 2025 13:55:10 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/feature-store-benchmarks/m-p/128659#M4224</guid>
      <dc:creator>WiliamRosa</dc:creator>
      <dc:date>2025-08-17T13:55:10Z</dc:date>
    </item>
  </channel>
</rss>

