<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: getting 500 on embedding model invocation call in Generative AI</title>
    <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134589#M1211</link>
    <description>&lt;P&gt;I don't see that option on my serving endpoint.&lt;BR /&gt;Also, if there's a rate limit, I'd expect to receive a 429 with relevant message and not 500.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="tefrati_3-1760133664864.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20677i7228548E9CB437C6/image-size/medium?v=v2&amp;amp;px=400" role="button" title="tefrati_3-1760133664864.png" alt="tefrati_3-1760133664864.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 10 Oct 2025 22:01:56 GMT</pubDate>
    <dc:creator>tefrati</dc:creator>
    <dc:date>2025-10-10T22:01:56Z</dc:date>
    <item>
      <title>getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134576#M1209</link>
      <description>&lt;P&gt;I'm getting the following error message "&lt;SPAN&gt;{"error_code": "INTERNAL_ERROR", "message": "The server received an invalid response from an upstream server."}" when making a call to&amp;nbsp;bge-large-en embedding model.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 19:43:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134576#M1209</guid>
      <dc:creator>tefrati</dc:creator>
      <dc:date>2025-10-10T19:43:02Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134579#M1210</link>
      <description>&lt;P&gt;Seems to me like a rate limit issue. Can you please confirm if the rate limit is not zero.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="nayan_wylde_0-1760126703954.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20672iC1CF6DDF90C96D2A/image-size/medium?v=v2&amp;amp;px=400" role="button" title="nayan_wylde_0-1760126703954.png" alt="nayan_wylde_0-1760126703954.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 20:18:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134579#M1210</guid>
      <dc:creator>nayan_wylde</dc:creator>
      <dc:date>2025-10-10T20:18:09Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134589#M1211</link>
      <description>&lt;P&gt;I don't see that option on my serving endpoint.&lt;BR /&gt;Also, if there's a rate limit, I'd expect to receive a 429 with relevant message and not 500.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="tefrati_3-1760133664864.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20677i7228548E9CB437C6/image-size/medium?v=v2&amp;amp;px=400" role="button" title="tefrati_3-1760133664864.png" alt="tefrati_3-1760133664864.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 22:01:56 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134589#M1211</guid>
      <dc:creator>tefrati</dc:creator>
      <dc:date>2025-10-10T22:01:56Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134592#M1213</link>
      <description>&lt;P&gt;Can you please click on the edit AI gateway it will take you show you the rate limit and share the screenshot&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 22:25:05 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134592#M1213</guid>
      <dc:creator>nayan_wylde</dc:creator>
      <dc:date>2025-10-10T22:25:05Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134595#M1215</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="tefrati_0-1760135381782.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20678i493C7CFBC0552E9B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="tefrati_0-1760135381782.png" alt="tefrati_0-1760135381782.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;no rate limitation is enabled&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 22:30:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134595#M1215</guid>
      <dc:creator>tefrati</dc:creator>
      <dc:date>2025-10-10T22:30:07Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134596#M1216</link>
      <description>&lt;P&gt;Yeah your rate limit seems to be good. Can you also check the following points.&lt;/P&gt;&lt;P&gt;1. Use the Databricks-specific name (e.g., databricks-bge-large-en), not the Hugging Face model name. Check in Serving → Endpoints.&lt;BR /&gt;2. Validate Payload Format&lt;BR /&gt;{ "input": "text to embed" }&lt;BR /&gt;3. Test via Databricks UIUse Query endpoint in the Serving page. If that works, issue is client config.&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 23:03:08 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134596#M1216</guid>
      <dc:creator>nayan_wylde</dc:creator>
      <dc:date>2025-10-10T23:03:08Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134603#M1217</link>
      <description>&lt;P&gt;I have been using&amp;nbsp;&lt;SPAN&gt;databricks-bge-large-en.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Not sure what is meant by client config. I've been using this model for 2 years now. Most of the time it works but today and two days ago is stopped working intermittedly.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 11 Oct 2025 04:02:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/134603#M1217</guid>
      <dc:creator>tefrati</dc:creator>
      <dc:date>2025-10-11T04:02:40Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/135334#M1250</link>
      <description>&lt;P&gt;Problem was not resolved.. Any thought as to what else could have happened?&lt;/P&gt;</description>
      <pubDate>Sat, 18 Oct 2025 14:41:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/135334#M1250</guid>
      <dc:creator>tefrati</dc:creator>
      <dc:date>2025-10-18T14:41:04Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/135343#M1251</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/115381"&gt;@tefrati&lt;/a&gt;&amp;nbsp;- Can you please go to the model serving endpoint and click on the use drop down (as shown in the picture below) and try the simple python or sql and see if that works for you.&lt;/P&gt;
&lt;LI-CODE lang="python"&gt;from openai import OpenAI
import os

# How to get your Databricks token: https://docs.databricks.com/en/dev-tools/auth/pat.html
DATABRICKS_TOKEN = os.environ.get('DATABRICKS_TOKEN')
# Alternatively in a Databricks notebook you can use this:
# DATABRICKS_TOKEN = dbutils.notebook.entry_point.getDbutils().notebook().getContext().apiToken().get()

client = OpenAI(
  api_key=DATABRICKS_TOKEN,
  base_url="https://e2-demo-field-eng.cloud.databricks.com/serving-endpoints"
)

embeddings = client.embeddings.create(
  input='Your string for the embedding model goes here',
  model="databricks-bge-large-en"
)

print(embeddings.data[0].embedding)&lt;/LI-CODE&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dkushari_0-1760806801038.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20841i35EF321F908B01C8/image-size/medium?v=v2&amp;amp;px=400" role="button" title="dkushari_0-1760806801038.png" alt="dkushari_0-1760806801038.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;SELECT ai_query('databricks-bge-large-en',
    request =&amp;gt; "&amp;lt;Please provide your input string here!&amp;gt;")&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 18 Oct 2025 17:03:12 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/135343#M1251</guid>
      <dc:creator>dkushari</dc:creator>
      <dc:date>2025-10-18T17:03:12Z</dc:date>
    </item>
    <item>
      <title>Re: getting 500 on embedding model invocation call</title>
      <link>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/135915#M1270</link>
      <description>&lt;P&gt;Hi there,&lt;BR /&gt;&lt;BR /&gt;I've ran the python script and got the vector back. Also, I've been using the model and the endpoint for 2 years now so no reason it shouldn't work.&lt;BR /&gt;However, I'm getting the same error again now. This time, Databricks select query:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;SELECT&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;*&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;FROM&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;&amp;lt;my company&amp;gt;&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;default&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;`databricks-bge-large-en_payload`&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;where&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; status_code &lt;/SPAN&gt;&lt;SPAN&gt;!=&lt;/SPAN&gt; &lt;SPAN&gt;200&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;ORDER BY&lt;/SPAN&gt;&lt;SPAN&gt; request_date &lt;/SPAN&gt;&lt;SPAN&gt;DESC&lt;BR /&gt;&lt;BR /&gt;returns numerous 502 status code (after 5 seconds execution time each time). Response is:&amp;nbsp;{"error_code":"INTERNAL_ERROR","message":"The server received an invalid response from an upstream server."}&lt;BR /&gt;&lt;BR /&gt;Can I finally get an answer as to why we're keep getting these unpredictable errors?&lt;BR /&gt;&lt;BR /&gt;Thank you&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="tefrati_0-1761277197141.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/21009iC777107A9A1EE397/image-size/medium?v=v2&amp;amp;px=400" role="button" title="tefrati_0-1761277197141.png" alt="tefrati_0-1761277197141.png" /&gt;&lt;/span&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 24 Oct 2025 03:40:30 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/getting-500-on-embedding-model-invocation-call/m-p/135915#M1270</guid>
      <dc:creator>tefrati</dc:creator>
      <dc:date>2025-10-24T03:40:30Z</dc:date>
    </item>
  </channel>
</rss>

