- 862 Views
- 1 replies
- 0 kudos
Model Serving Endpoints Scaling From Zero Forever
Hello,When deploying agents via a Serving Endpoint with "Scale To Zero" enabled, we are encountering that at seemingly random times, when we hit the endpoint to wake it up, it will begin to "Scale From Zero", but will never become fully available. Th...
- 862 Views
- 1 replies
- 0 kudos
- 0 kudos
Hi @jAAmes_bentley, There are multiple possible root causes, so let me walk through them so you can diagnose which applies to your situation. WHAT HAPPENS DURING SCALE-FROM-ZERO When an endpoint scales from zero, Databricks must: 1. Acquire comp...
- 0 kudos
- 1720 Views
- 5 replies
- 2 kudos
Resolved! Documentation on all ways to access agent serving endpoint from outside databricks
Struggling to find clear documentation which can help me with the subject. Need to know all the ways (production best practices) along with API method. As far as I know, using PAT is not a production best practice
- 1720 Views
- 5 replies
- 2 kudos
- 2 kudos
To access an Agent serving endpoint without a Personal Access Token (PAT), you must use OAuth 2.0 Machine-to-Machine (M2M) authentication. This is the industry-standard approach for production applications.1. OAuth M2M Authentication WorkflowInstead ...
- 2 kudos
- 825 Views
- 1 replies
- 1 kudos
Inference table not working for Gemma 3 12b
Hi, I have a problem regarding the inference table for Gemma 3 12b. If I create a serving endpoint for the model (from system.ai.gemma-3-12b-it) with an inference table, the inference table is being created, but it always stays empty. No matter how m...
- 825 Views
- 1 replies
- 1 kudos
- 1 kudos
The endpoint is working since you’re receiving responses, but the inference table isn’t capturing any records. This points to a logging/compatibility issue rather than request handling. Please confirm inference logging is enabled, verify permissions,...
- 1 kudos
-
agent
2 -
agent bricks
2 -
Agent Skills
1 -
agents
2 -
AI
2 -
AI Agents
10 -
ai gateway
2 -
Anthropic
1 -
API Documentation
1 -
App
3 -
Application
1 -
Asset Bundles
1 -
Authentication
1 -
Autologging
1 -
automoation
1 -
Aws databricks
2 -
ChatDatabricks
1 -
claude
5 -
Cluster
1 -
Credentials
1 -
crewai
1 -
cursor
1 -
Databricks App
3 -
Databricks Course
1 -
Databricks Delta Table
1 -
Databricks Mlflow
1 -
Databricks Notebooks
1 -
Databricks SQL
1 -
Databricks Table Usage
1 -
Databricks-connect
1 -
databricksapps
1 -
delta sync
1 -
Delta Tables
1 -
Developer Experience
1 -
DLT Pipeline
1 -
documentation
1 -
Ethical Data Governance
1 -
Foundation Model
4 -
gemini
1 -
gemma
1 -
GenAI
11 -
GenAI agent
2 -
GenAI and LLMs
4 -
GenAI Generation AI
1 -
GenAIGeneration AI
38 -
Generation AI
2 -
Generative AI
5 -
Genie
18 -
Genie - Notebook Access
2 -
GenieAPI
4 -
Google
1 -
GPT
1 -
healthcare
1 -
Index
1 -
inference table
1 -
Information Extraction
1 -
Langchain
4 -
LangGraph
1 -
Llama
1 -
Llama 3.3
1 -
LLM
2 -
machine-learning
1 -
mcp
2 -
MlFlow
4 -
Mlflow registry
1 -
MLModels
1 -
Model Serving
3 -
modelserving
1 -
mosic ai search
1 -
Multiagent
2 -
NPM error
1 -
OpenAI
1 -
Pandas udf
1 -
Playground
1 -
productivity
1 -
Pyspark
1 -
Pyspark Dataframes
1 -
RAG
3 -
ro
1 -
Scheduling
1 -
Server
1 -
serving endpoint
3 -
streaming
2 -
Tasks
1 -
Vector
1 -
vector index
1 -
Vector Search
2 -
Vector search index
6