We are facing an internal 500 error accessing the vector search endpoint through streamlit application, if I refresh the application sometimes the error goes away, it has now started to become an usual occurrence. If I try to query the endpoint using the console, I am able to fetch data, but through the streamlit app hosted on databricks it throws the internal 500 error
The embedding model is openAI text-embedding-3-large.
We’re running the embedding process as a separate pipeline and storing the output. Then we’re using a truncated and normalized vector (768 dimensions) for the index creation.
The vector search process is then as follows:
Query string -> Send string to create vector embeddings -> truncate/normalise to get query vector -> send vector to similarity_search function
Here’s some code snippets of how it all works (it’s in a streamlit app):
The vector client authentication using service principal client id/secret

Running the similarity search

The embedding is handled through a separate function which is called and stored before sending to the similarity search. It runs on the serving endpoint within Databricks rather than directly to the underlying OpenAI resource. This part appears to be working ok.

And the Workspace client authentication, again using service principal client id/secret.

This is the stack trace of the internal error:
Exception: Response content b'{"error_code":"INTERNAL_ERROR","message":"Something went wrong, please try again later","details":[{"@type":"type.googleapis.com/google.rpc.RequestInfo","request_id":"xxxxxxx","serving_data":""}]}', status_code 500
