<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Getting &amp;quot;The given endpoint does not exist&amp;quot; for LLM endpoint though I've access to it in Generative AI</title>
    <link>https://community.databricks.com/t5/generative-ai/getting-quot-the-given-endpoint-does-not-exist-quot-for-llm/m-p/114959#M833</link>
    <description>&lt;P&gt;When trying out this &lt;A href="https://notebooks.databricks.com/demos/llm-rag-chatbot/index.html#" target="_self"&gt;LLM RAG&lt;/A&gt; notebook steps I'm getting this error:&lt;/P&gt;&lt;P&gt;&lt;EM&gt;"The given endpoint does not exist, please retry after checking the specified model and version deployment exists"&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;This notebook originally referred to&amp;nbsp;&lt;FONT color="#FF0000"&gt;databricks-meta-llama-3-1-70b-instruct&lt;/FONT&gt;. After this error, I verified Models tab and found it is not available anymore. So I replaced that with&lt;FONT color="#0000FF"&gt;&amp;nbsp;meta_llama_v3_1_70b,&amp;nbsp;meta_llama_v3_1_70b_instruct and&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;FONT color="#0000FF"&gt;system.ai.meta_llama_v3_1_70b&lt;/FONT&gt; in the notebook but non of them work, I keep getting the same error. Also, I gave explicit apply, execute, manage permissions to my Principal user. Any pointers are greatly appreciated.&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;# For this first basic demo, we'll keep the configuration as a minimum. In real app, you can make all your RAG as a param (such as your prompt template to easily test different prompts!)&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;#"llm_model_serving_endpoint_name": "databricks-meta-llama-3-1-70b-instruct",&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;chain_config &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; {&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;"llm_model_serving_endpoint_name": "system.ai.meta_llama_v3_1_70b"&lt;/STRONG&gt;&lt;SPAN&gt;&lt;STRONG&gt;,&lt;/STRONG&gt; &lt;/SPAN&gt;&lt;SPAN&gt;# the foundation model we want to use&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;"vector_search_endpoint_name"&lt;/SPAN&gt;&lt;SPAN&gt;: VECTOR_SEARCH_ENDPOINT_NAME, &lt;/SPAN&gt;&lt;SPAN&gt;# the endoint we want to use for vector search&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;"vector_search_index"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;catalog&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;db&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;.databricks_documentation_vs_index"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;"llm_prompt_template"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"""You are an assistant that answers questions. Use the following pieces of retrieved context to answer the question. Some pieces of context may be irrelevant, in which case you should not use them to form the answer.&lt;/SPAN&gt;&lt;SPAN&gt;\n\n&lt;/SPAN&gt;&lt;SPAN&gt;Context: &lt;/SPAN&gt;&lt;SPAN&gt;{context}&lt;/SPAN&gt;&lt;SPAN&gt;"""&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 09 Apr 2025 12:01:20 GMT</pubDate>
    <dc:creator>Iris12</dc:creator>
    <dc:date>2025-04-09T12:01:20Z</dc:date>
    <item>
      <title>Getting "The given endpoint does not exist" for LLM endpoint though I've access to it</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-quot-the-given-endpoint-does-not-exist-quot-for-llm/m-p/114959#M833</link>
      <description>&lt;P&gt;When trying out this &lt;A href="https://notebooks.databricks.com/demos/llm-rag-chatbot/index.html#" target="_self"&gt;LLM RAG&lt;/A&gt; notebook steps I'm getting this error:&lt;/P&gt;&lt;P&gt;&lt;EM&gt;"The given endpoint does not exist, please retry after checking the specified model and version deployment exists"&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;This notebook originally referred to&amp;nbsp;&lt;FONT color="#FF0000"&gt;databricks-meta-llama-3-1-70b-instruct&lt;/FONT&gt;. After this error, I verified Models tab and found it is not available anymore. So I replaced that with&lt;FONT color="#0000FF"&gt;&amp;nbsp;meta_llama_v3_1_70b,&amp;nbsp;meta_llama_v3_1_70b_instruct and&amp;nbsp;&lt;/FONT&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&lt;FONT color="#0000FF"&gt;system.ai.meta_llama_v3_1_70b&lt;/FONT&gt; in the notebook but non of them work, I keep getting the same error. Also, I gave explicit apply, execute, manage permissions to my Principal user. Any pointers are greatly appreciated.&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;# For this first basic demo, we'll keep the configuration as a minimum. In real app, you can make all your RAG as a param (such as your prompt template to easily test different prompts!)&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;#"llm_model_serving_endpoint_name": "databricks-meta-llama-3-1-70b-instruct",&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;chain_config &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; {&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;STRONG&gt;"llm_model_serving_endpoint_name": "system.ai.meta_llama_v3_1_70b"&lt;/STRONG&gt;&lt;SPAN&gt;&lt;STRONG&gt;,&lt;/STRONG&gt; &lt;/SPAN&gt;&lt;SPAN&gt;# the foundation model we want to use&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;"vector_search_endpoint_name"&lt;/SPAN&gt;&lt;SPAN&gt;: VECTOR_SEARCH_ENDPOINT_NAME, &lt;/SPAN&gt;&lt;SPAN&gt;# the endoint we want to use for vector search&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;"vector_search_index"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;f&lt;/SPAN&gt;&lt;SPAN&gt;"&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;catalog&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;db&lt;/SPAN&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;SPAN&gt;.databricks_documentation_vs_index"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;"llm_prompt_template"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"""You are an assistant that answers questions. Use the following pieces of retrieved context to answer the question. Some pieces of context may be irrelevant, in which case you should not use them to form the answer.&lt;/SPAN&gt;&lt;SPAN&gt;\n\n&lt;/SPAN&gt;&lt;SPAN&gt;Context: &lt;/SPAN&gt;&lt;SPAN&gt;{context}&lt;/SPAN&gt;&lt;SPAN&gt;"""&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;FONT color="#0000FF"&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/FONT&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Apr 2025 12:01:20 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-quot-the-given-endpoint-does-not-exist-quot-for-llm/m-p/114959#M833</guid>
      <dc:creator>Iris12</dc:creator>
      <dc:date>2025-04-09T12:01:20Z</dc:date>
    </item>
    <item>
      <title>Re: Getting "The given endpoint does not exist" for LLM endpoint though I've access to it</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-quot-the-given-endpoint-does-not-exist-quot-for-llm/m-p/115360#M841</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/157397"&gt;@Iris12&lt;/a&gt;&amp;nbsp;,&lt;BR /&gt;&lt;BR /&gt;Not sure if this will fix your issue, but I’ve seen similar errors caused by &lt;SPAN class=""&gt;permission mismatches&lt;/SPAN&gt;.&lt;BR /&gt;&lt;SPAN class=""&gt;Make sure that the &lt;/SPAN&gt;user or service principal you’re using has the correct permissions&lt;SPAN class=""&gt; both in the &lt;/SPAN&gt;&lt;STRONG&gt;Models&lt;/STRONG&gt;&lt;SPAN class=""&gt; and &lt;/SPAN&gt;&lt;STRONG&gt;Serving&lt;/STRONG&gt;&lt;SPAN class=""&gt; tab.&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;Also, double-check that the &lt;/SPAN&gt;endpoint name you’re referencing in the notebook matches exactly&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class=""&gt;Hope that helps, &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;Isi&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 13 Apr 2025 14:12:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-quot-the-given-endpoint-does-not-exist-quot-for-llm/m-p/115360#M841</guid>
      <dc:creator>Isi</dc:creator>
      <dc:date>2025-04-13T14:12:56Z</dc:date>
    </item>
  </channel>
</rss>

