cancel
Showing results for 
Search instead for 
Did you mean: 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results for 
Search instead for 
Did you mean: 

Azure Databricks: Where are foundation models hosted

Rjdudley
Honored Contributor

In the updated Supported models for Databricks Foundation Models APIs - Azure Databricks | Microsoft Learn document, it's stated that Claude 3.7 and 4 Opus are " hosted by Databricks Inc. in AWS".  Is that accurate for Azure Databricks, or is there an Azure endpoint also?  If AWS is true, that's a problem for us.

There is no discussion of where the Meta Llama models are hosted.  Are these also in AWS?

1 ACCEPTED SOLUTION

Accepted Solutions

qlmahga2
New Contributor III

Claude is not hosted on Azure. Only Amazon Bedrock and Google Cloud's Vertex AI. Databricks hosts it through AWS within their security perimeter; i.e., requests go through Amazon Bedrock (Fig. 1), as indicated by the model name in AWS format and the response ID starting with the characteristic Bedrock models prefix `msg_bdrk_`.



There is no discussion of where the Meta Llama models are hosted.  Are these also in AWS?


Based on the reference to Applicable model developer licenses and terms, I can assume they're hosted directly in Databricks. At least the request IDs have a different format from AWS (`chatcmpl_xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx`

View solution in original post

1 REPLY 1

qlmahga2
New Contributor III

Claude is not hosted on Azure. Only Amazon Bedrock and Google Cloud's Vertex AI. Databricks hosts it through AWS within their security perimeter; i.e., requests go through Amazon Bedrock (Fig. 1), as indicated by the model name in AWS format and the response ID starting with the characteristic Bedrock models prefix `msg_bdrk_`.



There is no discussion of where the Meta Llama models are hosted.  Are these also in AWS?


Based on the reference to Applicable model developer licenses and terms, I can assume they're hosted directly in Databricks. At least the request IDs have a different format from AWS (`chatcmpl_xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx`