โ12-17-2025 07:49 AM
Hello,
I am creating a vector search index and selected Compute embeddings for a delta table with 19M records. Delta table has only two columns: ID (selected as index) and Name (selected for embedding). Embedding model is databricks-gte-large-en.
Index initialization estimated time to complete is 15 days!
I'm looking for advice to speed up initialization.
Thank you,
Rodrigo Escamilla
โ12-17-2025 07:55 AM - edited โ12-17-2025 07:55 AM
Hi Rodrigo,
The issue that you are seeing is because these embeddings are computed on the Databricks-GTE-Large-EN endpoint, which is a Pay-Per-Token endpoint. These have very high latency when used. So if speed is a concern, we suggest you use the models present in system.ai schema and create a Provisioned Throughput endpoint with a larger number of Model Units to have higher throughput and faster computations of embeddings. Then use that endpiont for computing the embeddings.
โ12-17-2025 07:58 AM
Thank you very much for the fast response. I am new to databricks (and vector search). How do I go about "use the models present in system.ai schema and create a Provisioned Throughput endpoint with a larger number of Model Units"
Thank you,
Rodrigo Escamilla
โ12-17-2025 07:55 AM - edited โ12-17-2025 07:55 AM
Hi Rodrigo,
The issue that you are seeing is because these embeddings are computed on the Databricks-GTE-Large-EN endpoint, which is a Pay-Per-Token endpoint. These have very high latency when used. So if speed is a concern, we suggest you use the models present in system.ai schema and create a Provisioned Throughput endpoint with a larger number of Model Units to have higher throughput and faster computations of embeddings. Then use that endpiont for computing the embeddings.
โ12-17-2025 07:58 AM
Thank you very much for the fast response. I am new to databricks (and vector search). How do I go about "use the models present in system.ai schema and create a Provisioned Throughput endpoint with a larger number of Model Units"
Thank you,
Rodrigo Escamilla
โ12-17-2025 08:04 AM
@RodrigoE please follow this document - https://docs.databricks.com/aws/en/machine-learning/foundation-model-apis/deploy-prov-throughput-fou...
โ12-18-2025 06:33 AM
Your recommendation addressed the issue. Followed the instructions and index initialization took only 8 hours - thank you!