cancel
Showing results for 
Search instead for 
Did you mean: 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results for 
Search instead for 
Did you mean: 

Vector search index creation is incredibly slow

epistoteles
New Contributor II

I am trying to create a vector search index for a Delta Table using Azure OpenAI embeddings (text-embedding-3-large). The table contains 5000 chunks with approx. 1000 tokens each. The OpenAI embeddings are generated through a Databricks model serving endpoint which forwards the embedding requests to our Azure deployment.

The latency of the index creation is incredibly high. To embed just 5000 chunks the initial sync takes over an hour. If I extrapolate this to 5 Mio. chunks, embedding them would take over a month.

The deployment in Azure can process 350K tokens/min and should not be the limiting factor.


I am currently creating the index like this:

 

vsc.create_delta_sync_index(
    endpoint_name=VECTOR_SEARCH_ENDPOINT_NAME,
    index_name=vs_index_fullname,
    source_table_name=source_table_fullname,
    pipeline_type="TRIGGERED",
    primary_key="id",
    embedding_source_column='content'
    embedding_model_endpoint_name='text-embedding-3-large'
  )
 
My assumption for the slow speed is that the index calculates the embeddings row by row, without async Azure API calls, and only starts with the next row once the previous embedding is created.

Which options do I have for speeding up the index creation?
 

 

2 REPLIES 2

amitkumarvish
New Contributor II

@epistoteles I am also facing the same issue while delta table sync with Index. Could you please share if you have any work around or used some different appoarch to decrease latency.

Thanks in advance.

@amitkumarvish My only solution so far has been to increase embedding model capacity on Azure ... still comparably slow.

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now