The easiest and most reliable way to generate embeddings for millions of rows is to let Databricks Vector Search compute them automatically during synchronization from a Delta table.
Vector Search can generate embeddings for you, keep them updated when new records are inserted or updated, and handle batching, scaling, and retries behind the scenes.
You donโt have to manually loop over rows or call a model serving endpoint, Vector Search handles that for you.
https://learn.microsoft.com/en-us/azure/databricks/generative-ai/create-query-vector-search
Handles full backfill (5M+ rows) efficiently
Supports incremental updates automatically via Delta change data
No manual code or loops required
Fully managed and Unity Catalogโgoverned