jnkthms
New Contributor III

The issue for us was most likely that we used CPU compute for the deployed embedding model, switching to GPU (small) solved the issue.