I built a RAG by cloning the databricks example (create a vector search index of databricks documentation and then a RAG).
When I try and call the RAG via the serving endpoint from a notebook, I get the following error:
dictionary update sequence element #0 has length 309; 2 is required
I can't figure out what I've done wrong. When I use the Serving interface in databricks and ask it a question, it works fine. Here's my code:
chat_model = ChatDatabricks(
endpoint="my_rag",
temperature=0.1,
max_tokens=256,
messages = [
("user", "What questions can I ask?"),
]
chat_model.invoke(messages)
Any tips or hints on what my problem is I would be grateful! Thanks.