cancel
Showing results for 
Search instead for 
Did you mean: 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results for 
Search instead for 
Did you mean: 

Provisioned throughput is not enabled for this workspace

andr11v
New Contributor

 

 

I am seeing two related issues with Databricks-hosted Claude models in my workspace.

Workspace details:
- Cloud: AWS
- Region: us-west-2
- Tier: Premium PAYG
- Serverless compute: enabled
- Unity Catalog/metastore: configured
- Compliance security profile: disabled
- "Enforce data processing within workspace Geography for Designated Services": disabled

Issue 1: Claude pay-per-token endpoint is missing

According to the supported foundation models table, us-west-2 supports Databricks-hosted Claude models such as databricks-claude-haiku-4-5 and databricks-claude-opus-4-7 for Foundation Model APIs pay-per-
token.

However, when I call the chat completions endpoint with a Claude model, I get ENDPOINT_NOT_FOUND.

For example:
model="databricks-claude-haiku-4-5" -> ENDPOINT_NOT_FOUND

Pay-per-token inference works for databricks-gpt-oss-20b, so Model Serving itself appears to be working.

Issue 2: Provisioned Throughput is not enabled

Separately, when I try to create a model serving endpoint with provisioned throughput for a Databricks-hosted Claude model from system.ai in Unity Catalog, endpoint creation fails with:

"Provisioned throughput is not enabled for this workspace."

My questions:
1. Why are Databricks-hosted Claude pay-per-token endpoints missing from my workspace even though us-west-2 is listed as supported?
2. Is Provisioned Throughput for Claude models expected to be available in this workspace, or does it require a separate workspace/account-level entitlement?

 

andr11v_0-1779098540132.pngandr11v_1-1779098542405.pngandr11v_2-1779098546886.png

 

1 REPLY 1

Ashwin_DSA
Databricks Employee
Databricks Employee

Hi @andr11v,

What you are seeing appears to be two separate issues rather than a single root cause.

For the pay-per-token issue, the regional support table shows that Claude models are supported in AWS us-west-2, and the docs also say those models are available through pay-per-token endpoints in your workspace. In practice, though, regional support does not always mean every workspace in that region already has every Databricks-hosted Claude endpoint exposed. The fact that databricks-gpt-oss-20b works but databricks-claude-haiku-4-5 returns ENDPOINT_NOT_FOUND strongly suggests this is a workspace-level availability or enablement issue for the Claude endpoints, not a general Model Serving issue.

For Provisioned Throughput, that is a separate path from pay-per-token. The error message Provisioned throughput is not enabled for this workspace usually means the workspace or account does not have the required Databricks-side entitlement enabled, and that is not a self-service workspace toggle. Separately, for Databricks-hosted Claude models from system.ai, Claude is not currently supported for Provisioned Throughput, so that specific deployment path is not expected to work today.

From what I can gather, 

  • Missing Claude pay-per-token endpoints is likely a workspace-level enablement or availability issue for Databricks-hosted Claude endpoints.
  • Provisioned Throughput for Claude is not currently expected for Databricks-hosted Claude models from system.ai.
  • The PT entitlement error itself is likely a separate workspace/account-level gating issue that Databricks Support or the account team would need to verify.

I would recommend opening a support case and asking Databricks to verify or enable Databricks-hosted Claude pay-per-token availability for this workspace, and also confirm whether the workspace has Provisioned Throughput entitlement enabled.

If this answer resolves your question, could you mark it as “Accept as Solution”? That helps other users quickly find the correct fix.

Regards,
Ashwin | Delivery Solution Architect @ Databricks
Helping you build and scale the Data Intelligence Platform.
***Opinions are my own***