I'm currently experimenting with vector search using Databricks. Everything runs smoothly when I load the model deployed in Unity Catalog into a notebook session and ask questions using Python. However, when I attempt to serve it, I encounter a generic error.
The container builds successfully. However, upon running the code, I encounter an error. Debugging this serve endpoint is challenging because the machine is not available for an interactive session. I've observed that the error occurs when the code attempts to retrieve the index or endpoint, as shown below:
- vsc.get_endpoint(name=vector_search_endpoint_name) returns An error occurred while loading the model. Expecting value: line 1 column 1 (char 0).
- index = vsc.get_index(vector_search_endpoint_name, vs_index_fullname) returns An error occurred while loading the model. Expecting value: line 1 column 1 (char 0)
** my workspace is fully private
** i'm basing my code in this databricks example https://notebooks.databricks.com/demos/llm-rag-chatbot/index.html#