Databricks Community

javieryw · ‎01-01-2025

I am facing issues calling the Llama Guard Model after deploying it to Model Serving on Databricks. I deployed the model by requesting instant access from Databricks Marketplace and deployed it to Model Serving.

I am following the code examples here: https://github.com/databricks/databricks-ml-examples/blob/master/llm-models/safeguard/llamaguard/Lla...

this is my function

import mlflow
import mlflow.deployments
import re

def query_llamaguard(chat, unsafe_categories=DEFAULT_UNSAFE_CATEGORY):
    """
    Queries LlamaGuard model to check if the chat contains unsafe content based on predefined categories.
    """
    try:
        client = mlflow.deployments.get_deploy_client("databricks")

        # Constructing the payload for the LlamaGuard API

        query_payload = [
           f"""[INST] Task: Check for unsafe content in 'Agent' messages in conversations according to our safety policy.
          <BEGIN UNSAFE CONTENT CATEGORIES>
          {unsafe_categories}
          <END UNSAFE CONTENT CATEGORIES>
          <BEGIN CONVERSATION>
          {chat}
          <END CONVERSATION>
          Provide your safety assessment for Agent in the above conversation:
          - First line must read 'safe' or 'unsafe'.
          - If unsafe, a second line must include a comma-separated list of violated categories. [/INST]"""
        ]

        response = client.predict(
            endpoint=LLAMAGUARD_ENDPOINT_NAME,
            inputs={"dataframe_split": {
                    "index": [0],
                    "columns": ["prompt"],
                    "data": [query_payload]
                    }
                })
        
        # Extract the desired information from the response object
        prediction = response.predictions[0]["candidates"][0]["text"].strip()
        is_safe = None if len(prediction.split("\n")) == 1 else prediction.split("\n")[1].strip()
        
        return prediction.split("\n")[0].lower()=='safe', is_safe
    
    except Exception as e:
        raise Exception(f"Error in querying LlamaGuard model: {str(e)}")

thereafter I call the Llama Guard Model

safe_user_chat = [
  {
      "role": "user",
      "content": "I want to love."
  }
]

query_llamaguard(safe_user_chat)

This is the error I faced Error in querying LlamaGuard model: 400 Client Error: Bad Request for url: https://<workspace>/serving-endpoints/llama-guard/invocations. Response text: Bad request: json: unknown field "dataframe_split"

Alberto_Umana · ‎01-02-2025

@javieryw,

The error you are encountering, "400 Client Error: Bad Request for url: https://westus.azuredatabricks.net/serving-endpoints/llama-guard/invocations. Response text: Bad request: json: unknown field 'dataframe_split'", indicates that the payload structure you are using is not recognized by the Llama Guard Model Serving endpoint.

Based on the context provided, it seems that the correct payload structure for querying the Llama Guard Model does not include the "dataframe_split" field. Instead, you should use a simpler structure for the inputs.

import mlflow

import mlflow.deployments

import re

def query_llamaguard(chat, unsafe_categories=DEFAULT_UNSAFE_CATEGORY):

"""

Queries LlamaGuard model to check if the chat contains unsafe content based on predefined categories.

"""

try:

client = mlflow.deployments.get_deploy_client("databricks")

# Constructing the payload for the LlamaGuard API

query_payload = [

f"""[INST] Task: Check for unsafe content in 'Agent' messages in conversations according to our safety policy.

{unsafe_categories}

{chat}

Provide your safety assessment for Agent in the above conversation:

- First line must read 'safe' or 'unsafe'.

- If unsafe, a second line must include a comma-separated list of violated categories. [/INST]"""

]

response = client.predict(

endpoint=LLAMAGUARD_ENDPOINT_NAME,

inputs={"prompt": query_payload}

)

# Extract the desired information from the response object

prediction = response.predictions[0]["candidates"][0]["text"].strip()

is_safe = None if len(prediction.split("\n")) == 1 else prediction.split("\n")[1].strip()

return prediction.split("\n")[0].lower() == 'safe', is_safe

except Exception as e:

raise Exception(f"Error in querying LlamaGuard model: {str(e)}")

# Example usage

safe_user_chat = [

{

"role": "user",

"content": "I want to love."

}

]

query_llamaguard(safe_user_chat)

In this updated function, the payload structure for the inputs parameter is simplified to just include the "prompt" field with the constructed query payload

Databricks Community

Error Calling Llama Guard Model from Databricks Marketplace after deploying the model

Join Us as a Local Community Builder!

PSA: Community Edition retires on January 1, 2026. Move to the Free Edition today to keep your work.

🎤 Call for Presentations: Data + AI Summit 2026 is Open!

Last Chance: Help Shape the 2026 Data + AI Summit | Win a Full Conference Pass

🌟 Community Pulse: Your Weekly Roundup! December 05 – 11, 2025

Jaipur Usergroup First Virtual Meetup: AI/BI Genie + Data Science Careers — 19 Dec | 6 PM IST