<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Dbdemo: LLM Chatbot With Retrieval Augmented Generation (RAG) in Generative AI</title>
    <link>https://community.databricks.com/t5/generative-ai/dbdemo-llm-chatbot-with-retrieval-augmented-generation-rag/m-p/64661#M50</link>
    <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;, thanks a lot for the information you provided. Anyhow, I have managed a workaround, by pre-computing the embeddings for each chunk.&amp;nbsp; I have created an embedding column on the source table and used this column as input to the&amp;nbsp;&lt;FONT face="courier new,courier"&gt;create_delta_sync_index&lt;/FONT&gt;&lt;SPAN&gt; method.&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;That is: substitute parameter&amp;nbsp;&amp;nbsp;&lt;FONT face="courier new,courier"&gt;embedding_source_column='content'&lt;/FONT&gt; for:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier"&gt;&lt;SPAN&gt;embedding_dimension&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;1024&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier"&gt;&lt;SPAN&gt;embedding_vector_column&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"embedding"&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;and the syncronization of the index with the source table worked just fine.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
    <pubDate>Tue, 26 Mar 2024 13:33:22 GMT</pubDate>
    <dc:creator>cmunteanu</dc:creator>
    <dc:date>2024-03-26T13:33:22Z</dc:date>
    <item>
      <title>Dbdemo: LLM Chatbot With Retrieval Augmented Generation (RAG)</title>
      <link>https://community.databricks.com/t5/generative-ai/dbdemo-llm-chatbot-with-retrieval-augmented-generation-rag/m-p/64393#M48</link>
      <description>&lt;P&gt;Hello All,&lt;/P&gt;&lt;P&gt;I am trying to follow the dbdemo called '&lt;SPAN&gt;llm-rag-chatbot&lt;/SPAN&gt;' available at the following &lt;A href="https://notebooks.databricks.com/demos/llm-rag-chatbot/index.html#" target="_self"&gt;link&lt;/A&gt;. The setup works Ok, and I have imported from the Databricks Marketplace an embedding model that is:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;bge_large_en_v1_5&lt;/LI&gt;&lt;/UL&gt;&lt;P data-unlink="true"&gt;Running the notebook called:&amp;nbsp;&lt;STRONG&gt;01-Data-Preparation-and-Inde&lt;/STRONG&gt;x&amp;nbsp;&amp;nbsp;I am stuck with an error when trying to create a&amp;nbsp;Vector Search Index with Managed Embeddings and the BGE model that I have setup as a serving endpoint, previously. More specifically, the Vector Search endpoint provisions succesfully, but when executing the index creation and syncronization method:&amp;nbsp;&lt;SPAN&gt;&lt;FONT face="courier new,courier"&gt;create_delta_sync_index&lt;/FONT&gt;,&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;I get the following error:&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;----&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;FONT size="2"&gt;Exception: Response content b'{"error_code":"INVALID_PARAMETER_VALUE","message":"Model serving endpoint bge-large-en configured with improper input: {\\"error_code\\": \\"BAD_REQUEST\\", \\"message\\": \\"Failed to enforce schema of data \' 0\\\\n0 Welcome to databricks vector search\' with schema \'[\'input\': string (required)]\'. Error: Model is missing inputs [\'input\']. Note that there were extra inputs: [0]\\"}"}', status_code 400&lt;/FONT&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;----&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;My code that calls this method is:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;if&lt;/SPAN&gt; &lt;SPAN&gt;not&lt;/SPAN&gt; &lt;SPAN&gt;index_exists&lt;/SPAN&gt;&lt;SPAN&gt;(vsc, VECTOR_SEARCH_ENDPOINT_NAME, vs_index_fullname):&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;print&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"Creating index &lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;vs_index_fullname&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt; on endpoint &lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;VECTOR_SEARCH_ENDPOINT_NAME&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;..."&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; vsc.&lt;/SPAN&gt;&lt;SPAN&gt;create_delta_sync_index&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;endpoint_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;VECTOR_SEARCH_ENDPOINT_NAME,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;index_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;vs_index_fullname,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;source_table_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;source_table_fullname,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;pipeline_type&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"TRIGGERED"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;primary_key&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"id"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;embedding_source_column&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;'content'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;#The column containing our text&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;embedding_model_endpoint_name='bge-large-en'&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; #&lt;/SPAN&gt;&lt;SPAN&gt;embedding_model_endpoint_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;'gte_large'&lt;/SPAN&gt;&amp;nbsp;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;SPAN&gt;&amp;nbsp; )&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;I have tried changing to a different embedding model (GTE_LARGE), but still getting the above error.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;I guess there is a incompatibilty between the input schema of the embedding model and the schema expected by the vector search endpoint.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Has any of you encountered this problem?&amp;nbsp; I would appreciate if you could give me a hint on how to solve it using an embedded model from Databricks Marketplace.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Thanks !&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 22 Mar 2024 11:35:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/dbdemo-llm-chatbot-with-retrieval-augmented-generation-rag/m-p/64393#M48</guid>
      <dc:creator>cmunteanu</dc:creator>
      <dc:date>2024-03-22T11:35:03Z</dc:date>
    </item>
    <item>
      <title>Re: Dbdemo: LLM Chatbot With Retrieval Augmented Generation (RAG)</title>
      <link>https://community.databricks.com/t5/generative-ai/dbdemo-llm-chatbot-with-retrieval-augmented-generation-rag/m-p/64661#M50</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;, thanks a lot for the information you provided. Anyhow, I have managed a workaround, by pre-computing the embeddings for each chunk.&amp;nbsp; I have created an embedding column on the source table and used this column as input to the&amp;nbsp;&lt;FONT face="courier new,courier"&gt;create_delta_sync_index&lt;/FONT&gt;&lt;SPAN&gt; method.&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;That is: substitute parameter&amp;nbsp;&amp;nbsp;&lt;FONT face="courier new,courier"&gt;embedding_source_column='content'&lt;/FONT&gt; for:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier"&gt;&lt;SPAN&gt;embedding_dimension&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;1024&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT face="courier new,courier"&gt;&lt;SPAN&gt;embedding_vector_column&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"embedding"&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;and the syncronization of the index with the source table worked just fine.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Tue, 26 Mar 2024 13:33:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/dbdemo-llm-chatbot-with-retrieval-augmented-generation-rag/m-p/64661#M50</guid>
      <dc:creator>cmunteanu</dc:creator>
      <dc:date>2024-03-26T13:33:22Z</dc:date>
    </item>
    <item>
      <title>Re: Dbdemo: LLM Chatbot With Retrieval Augmented Generation (RAG)</title>
      <link>https://community.databricks.com/t5/generative-ai/dbdemo-llm-chatbot-with-retrieval-augmented-generation-rag/m-p/70567#M120</link>
      <description>&lt;DIV&gt;Hi&amp;nbsp; &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;&amp;nbsp;and&amp;nbsp; &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/97558"&gt;@cmunteanu&lt;/a&gt;&amp;nbsp;&amp;nbsp;, I am having exactly the same problem to create the vector index and it seems that there could be a bug in the demo. What confuses me is that and even when using the Databricks UI, I can not manage to create the vector index.&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;Well, when running the demo, it stays for a long time repeating:&lt;/DIV&gt;&lt;DIV&gt;============&lt;/DIV&gt;&lt;DIV&gt;Waiting for index to be ready, this can take a few min... {'detailed_state': 'PROVISIONING_INITIAL_SNAPSHOT', 'message': 'Index is currently is in the process of syncing initial data. Check latest status:&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;A class="" href="https://adb-393322312342205.5.azuredatabricks.net/explore/data/dev_talk_desk/llm_rag/databricks_documentation_vs_index" target="_blank" rel="noopener noreferrer"&gt;https://adb-393322312342211.5.azuredatabricks.net/explore/data/dev_talk/llm_rag/databricks_documentation_vs_index&lt;/A&gt;', 'indexed_row_count': 0, 'provisioning_status': {'initial_pipeline_sync_progress': {'latest_version_currently_processing': 1, 'num_synced_rows': 0, 'total_rows_to_sync': 14129, 'sync_progress_completion': 0.0, 'pipeline_metrics': {'total_sync_time_per_row_ms': 0.0, 'ingestion_metrics': {'ingestion_time_per_row_ms': 0.0, 'ingestion_batch_size': 300}, 'embedding_metrics': {'embedding_generation_time_per_row_ms': 0.0, 'embedding_generation_batch_size': 0}}}}, 'ready': False, 'index_url': 'adb-393322312342211.5.azuredatabricks.net/api/2.0/vector-search/endpoints/dbdemos_vs_endpoint/indexes/dev_talk.llm_rag.databricks_documentation_vs_index'} - pipeline url:adb-393322312342211.5.azuredatabricks.net/api/2.0/vector-search/endpoints/dbdemos_vs_endpoint/indexes/dev_talk.llm_rag.databricks_documentation_vs_index&lt;/DIV&gt;&lt;DIV&gt;&lt;BR /&gt;Then after a long time the Cell stops with the following error message:&lt;/DIV&gt;&lt;DIV&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;I&gt;"HTTPError: 400 Client Error: Bad Request for url: &lt;A class="" href="https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/dev_talk_desk.llm_rag.databricks_documentation_vs_index/indexes/dbdemos_vs_endpoint" target="_blank" rel="noopener noreferrer"&gt;https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/dev_talk_desk.llm_rag.databricks_documentation_vs_index/indexes/dbdemos_vs_endpoint&lt;/A&gt;"&lt;/I&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;I&gt;It seems that the url is wrong (this is the bug I was referring), it has the endpoint and the vector index path interchanged. It should be:&lt;/I&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;I&gt;"&lt;A class="" href="https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/dev_talk_desk.llm_rag.databricks_documentation_vs_index/indexes/dbdemos_vs_endpoint" target="_blank" rel="noopener noreferrer"&gt;https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/&lt;/A&gt;&lt;A class="" href="https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/dev_talk_desk.llm_rag.databricks_documentation_vs_index/indexes/dbdemos_vs_endpoint" target="_blank" rel="noopener noreferrer"&gt;dbdemos_vs_endpoint&lt;/A&gt;&lt;A class="" href="https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/dev_talk_desk.llm_rag.databricks_documentation_vs_index/indexes/dbdemos_vs_endpoint" target="_blank" rel="noopener noreferrer"&gt;/&lt;/A&gt;&lt;/I&gt;&lt;/SPAN&gt;&lt;I&gt;indexes/&lt;/I&gt;&lt;SPAN&gt;&lt;I&gt;&lt;A class="" href="https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/dev_talk_desk.llm_rag.databricks_documentation_vs_index/indexes/dbdemos_vs_endpoint" target="_blank" rel="noopener noreferrer"&gt;dev_talk_desk.llm_rag.databricks_documentation_vs_index&lt;/A&gt;&lt;/I&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;I&gt;&lt;A class="" href="https://australiaeast.azuredatabricks.net/api/2.0/vector-search/endpoints/dev_talk_desk.llm_rag.databricks_documentation_vs_index/indexes/dbdemos_vs_endpoint" target="_blank" rel="noopener noreferrer"&gt;dbdemos_vs_endpoint&lt;/A&gt;"&lt;/I&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;Just like in the output of the Cell that is showing above. There, the URL is showed correctly,&lt;/DIV&gt;&lt;DIV&gt;================&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9"&gt;@Retired_mod&lt;/a&gt;&amp;nbsp;If any specific configuration is required regarding the embedding model, it would be good to have it specified. In your reply you said:&lt;/DIV&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;&lt;I&gt;When creating the Vector Search Index, ensure that you specify the correct parameters:&lt;/I&gt;&lt;/SPAN&gt;&lt;/LI&gt;&lt;UL&gt;&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;&lt;I&gt;embedding_source_column&lt;/I&gt;&lt;/STRONG&gt;&lt;I&gt;: This should match the column name containing your text data (e.g.,&amp;nbsp;&lt;/I&gt;&lt;STRONG&gt;&lt;I&gt;‘content’&lt;/I&gt;&lt;/STRONG&gt;&lt;I&gt;).&lt;/I&gt;&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;&lt;I&gt;embedding_model_endpoint_name&lt;/I&gt;&lt;/STRONG&gt;&lt;I&gt;: Use&amp;nbsp;&lt;/I&gt;&lt;STRONG&gt;&lt;I&gt;‘bge-large-en’&lt;/I&gt;&lt;/STRONG&gt;&lt;I&gt;&amp;nbsp;as you’ve set up this model as a serving endpoint.&lt;/I&gt;&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;/UL&gt;&lt;DIV&gt;All these specifications are correctly configured in the demo notebook. So, I am confused on what is left for us to configure.&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/97558"&gt;@cmunteanu&lt;/a&gt;&amp;nbsp; I have followed your suggestion of using a self managed embedding to create the vector index. It does work, in the sense that I created the vector index. But, I can not use (easily) the nice features of Databricks vector_search client that converts internally text to vectors&amp;nbsp; and vice-versa.&amp;nbsp; Which make things easier for the RAG - chatbot. Have you got around that?&lt;/DIV&gt;</description>
      <pubDate>Fri, 24 May 2024 07:13:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/dbdemo-llm-chatbot-with-retrieval-augmented-generation-rag/m-p/70567#M120</guid>
      <dc:creator>jbellidocaceres</dc:creator>
      <dc:date>2024-05-24T07:13:00Z</dc:date>
    </item>
  </channel>
</rss>

