cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Serving pay-per-token Chat LLM Model

Henrik
New Contributor III

We have build a chat solution on LLM RAG chat model, but we face an issue when we spin up a service endpoint to host the model.

According to the documentation, there should be sevral LLM models available as pay-per-token endpoints, for instance the DBRX Instruct.

https://learn.microsoft.com/en-us/azure/databricks/machine-learning/foundation-models/supported-mode...

However, in our workspace we only se two available pay-per-token endpoints (se attachment "serving endpoints.png").

When we "create a new service endpoint", it seems like we can only spin up "provisioned throughtput models, which are currently too expensive to run for our setup (se attachment "issue.png").

Our Databricks environment is in azure west europe.

Any suggestions?

1 ACCEPTED SOLUTION

Accepted Solutions

daniel_sahal
Esteemed Contributor

@Henrik 
The documentation clearly states that it should be available in west europe, but i'm also unable to see DBRX ppt endpoint. 
I think that it would be best to raise an Azure Support ticket - they should either somehow enable it on your workspace or modify the documentation.

View solution in original post

1 REPLY 1

daniel_sahal
Esteemed Contributor

@Henrik 
The documentation clearly states that it should be available in west europe, but i'm also unable to see DBRX ppt endpoint. 
I think that it would be best to raise an Azure Support ticket - they should either somehow enable it on your workspace or modify the documentation.

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now