<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Serving pay-per-token Chat LLM Model in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/serving-pay-per-token-chat-llm-model/m-p/86147#M8209</link>
    <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/85354"&gt;@Henrik&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;The documentation clearly states that it should be available in west europe, but i'm also unable to see DBRX ppt endpoint.&amp;nbsp;&lt;BR /&gt;I think that it would be best to raise an Azure Support ticket - they should either somehow enable it on your workspace or modify the documentation.&lt;/P&gt;</description>
    <pubDate>Thu, 29 Aug 2024 05:26:43 GMT</pubDate>
    <dc:creator>daniel_sahal</dc:creator>
    <dc:date>2024-08-29T05:26:43Z</dc:date>
    <item>
      <title>Serving pay-per-token Chat LLM Model</title>
      <link>https://community.databricks.com/t5/get-started-discussions/serving-pay-per-token-chat-llm-model/m-p/85381#M8208</link>
      <description>&lt;P&gt;We have build a chat solution on LLM RAG chat model, but we face an issue when we spin up a service endpoint to host the model.&lt;/P&gt;&lt;P&gt;According to the documentation, there should be sevral LLM models available as pay-per-token endpoints, for instance the DBRX Instruct.&lt;/P&gt;&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/machine-learning/foundation-models/supported-models" target="_blank" rel="noopener"&gt;https://learn.microsoft.com/en-us/azure/databricks/machine-learning/foundation-models/supported-models&lt;/A&gt;&lt;/P&gt;&lt;P&gt;However, in our workspace we only se two available pay-per-token endpoints (se attachment "serving endpoints.png").&lt;/P&gt;&lt;P&gt;When we "create a new service endpoint", it seems like we can only spin up "provisioned throughtput models, which are currently too expensive to run for our setup (se attachment "issue.png").&lt;/P&gt;&lt;P&gt;Our Databricks environment is in azure west europe.&lt;/P&gt;&lt;P&gt;Any suggestions?&lt;/P&gt;</description>
      <pubDate>Wed, 28 Aug 2024 06:17:18 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/serving-pay-per-token-chat-llm-model/m-p/85381#M8208</guid>
      <dc:creator>Henrik</dc:creator>
      <dc:date>2024-08-28T06:17:18Z</dc:date>
    </item>
    <item>
      <title>Re: Serving pay-per-token Chat LLM Model</title>
      <link>https://community.databricks.com/t5/get-started-discussions/serving-pay-per-token-chat-llm-model/m-p/86147#M8209</link>
      <description>&lt;P&gt;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/85354"&gt;@Henrik&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;The documentation clearly states that it should be available in west europe, but i'm also unable to see DBRX ppt endpoint.&amp;nbsp;&lt;BR /&gt;I think that it would be best to raise an Azure Support ticket - they should either somehow enable it on your workspace or modify the documentation.&lt;/P&gt;</description>
      <pubDate>Thu, 29 Aug 2024 05:26:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/serving-pay-per-token-chat-llm-model/m-p/86147#M8209</guid>
      <dc:creator>daniel_sahal</dc:creator>
      <dc:date>2024-08-29T05:26:43Z</dc:date>
    </item>
  </channel>
</rss>

