cancel
Showing results for 
Search instead for 
Did you mean: 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results for 
Search instead for 
Did you mean: 

Provisioned throughput is not enabled for this workspace

andr11v
Visitor

 

 

I am seeing two related issues with Databricks-hosted Claude models in my workspace.

Workspace details:
- Cloud: AWS
- Region: us-west-2
- Tier: Premium PAYG
- Serverless compute: enabled
- Unity Catalog/metastore: configured
- Compliance security profile: disabled
- "Enforce data processing within workspace Geography for Designated Services": disabled

Issue 1: Claude pay-per-token endpoint is missing

According to the supported foundation models table, us-west-2 supports Databricks-hosted Claude models such as databricks-claude-haiku-4-5 and databricks-claude-opus-4-7 for Foundation Model APIs pay-per-
token.

However, when I call the chat completions endpoint with a Claude model, I get ENDPOINT_NOT_FOUND.

For example:
model="databricks-claude-haiku-4-5" -> ENDPOINT_NOT_FOUND

Pay-per-token inference works for databricks-gpt-oss-20b, so Model Serving itself appears to be working.

Issue 2: Provisioned Throughput is not enabled

Separately, when I try to create a model serving endpoint with provisioned throughput for a Databricks-hosted Claude model from system.ai in Unity Catalog, endpoint creation fails with:

"Provisioned throughput is not enabled for this workspace."

My questions:
1. Why are Databricks-hosted Claude pay-per-token endpoints missing from my workspace even though us-west-2 is listed as supported?
2. Is Provisioned Throughput for Claude models expected to be available in this workspace, or does it require a separate workspace/account-level entitlement?

 

andr11v_0-1779098540132.pngandr11v_1-1779098542405.pngandr11v_2-1779098546886.png

 

0 REPLIES 0