<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Initializing Vector Search index Sync failes with Failed to resolve flow: '__online_index_view' in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/81006#M3529</link>
    <description>&lt;P&gt;&lt;BR /&gt;I would double check what specific values are being sent to the model in the workflow. Possibly transitioning to environments changed a value's type or possibly data isn't defined correctly leaving certain parameters empty?&lt;BR /&gt;The "INVALID_PARAMETER_VALUE" from the embedding output makes me believe something isn't being set correctly in the workflow when accessing the endpoint programmatically.&lt;/P&gt;</description>
    <pubDate>Mon, 29 Jul 2024 18:15:59 GMT</pubDate>
    <dc:creator>tyler-xorbix</dc:creator>
    <dc:date>2024-07-29T18:15:59Z</dc:date>
    <item>
      <title>Initializing Vector Search index Sync failes with Failed to resolve flow: '__online_index_view'</title>
      <link>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/80962#M3528</link>
      <description>&lt;P&gt;When setting up a vector search in databricks using the bge_m3 (Version 1) embedding model available in system.ai schema, the setup runs for 20 minutes or so and then fails. Querying the served embedding models from the browser works perfectly fine.&amp;nbsp;&lt;/P&gt;&lt;P&gt;The exact same data worked in the past (although in a different workspace), I've retried several times, over a longer period of time, so this does not seem to be a temporary issue.&amp;nbsp;&lt;/P&gt;&lt;P&gt;The flow_progress step in the pipeline creating fails with&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="python"&gt;Failed to resolve flow: '__online_index_view'&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;and error details:&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="java"&gt;java.lang.Exception: Error: Response Code: 400, Response: {"error_code":"INVALID_PARAMETER_VALUE","message":"Failed to call Model Serving endpoint: bge_m3_embedding."}
at com.databricks.pipelines.execution.extensions.brickindex.DatabricksHttpClient.$anonfun$sendRequestWithRetries$5(DatabricksHttpClient.scala:129)
at com.databricks.pipelines.execution.extensions.brickindex.DatabricksHttpClient.$anonfun$sendRequestWithRetries$5$adapted(DatabricksHttpClient.scala:121)
at scala.util.Using$.resource(Using.scala:269)
at com.databricks.pipelines.execution.extensions.brickindex.DatabricksHttpClient.$anonfun$sendRequestWithRetries$4(DatabricksHttpClient.scala:121)
at com.databricks.backend.common.util.TimeUtils$.$anonfun$retryWithExponentialBackoff0$1(TimeUtils.scala:191)
at scala.runtime.java8.JFunction0$mcZ$sp.apply(JFunction0$mcZ$sp.java:23)
at scala.util.Try$.apply(Try.scala:213)
at com.databricks.backend.common.util.TimeUtils$.retryWithExponentialBackoff0(TimeUtils.scala:191)
at com.databricks.backend.common.util.TimeUtils$.retryWithExponentialBackoff(TimeUtils.scala:145)
at com.databricks.pipelines.execution.extensions.brickindex.DatabricksHttpClient.sendRequestWithRetries(DatabricksHttpClient.scala:120)
at com.databricks.pipelines.execution.extensions.brickindex.DatabricksHttpClient.post(DatabricksHttpClient.scala:209)
at com.databricks.pipelines.execution.extensions.brickindex.BrickIndexGatewayClient.$anonfun$makePredictions$2(GatewayClient.scala:335)
at com.databricks.pipelines.execution.extensions.brickindex.BrickIndexGatewayClient.withCredentials(GatewayClient.scala:157)
at com.databricks.pipelines.execution.extensions.brickindex.BrickIndexGatewayClient.makePredictions(GatewayClient.scala:332)
at com.databricks.pipelines.execution.extensions.brickindex.ModelServingBatchProcessor.processViaGateway(ModelServingBatchProcessor.scala:96)
at com.databricks.pipelines.execution.extensions.brickindex.ModelServingBatchProcessor.process(ModelServingBatchProcessor.scala:75)
at com.databricks.pipelines.execution.extensions.brickindex.VectorSearchIngestionProcessor.$anonfun$processIngestionWithConcurrency$6(VectorSearchIngestionProcessor.scala:125)
at com.databricks.pipelines.execution.extensions.brickindex.VectorSearchIngestionProcessor.$anonfun$processIngestionWithConcurrency$6$adapted(VectorSearchIngestionProcessor.scala:125)
at com.databricks.pipelines.execution.extensions.brickindex.VectorSearchIngestionProcessor.$anonfun$processIngestionBatchFuture$1(VectorSearchIngestionProcessor.scala:216)
at scala.concurrent.Future$.$anonfun$apply$1(Future.scala:659)
at scala.util.Success.$anonfun$map$1(Try.scala:255)
at scala.util.Success.map(Try.scala:213)
at scala.concurrent.Future.$anonfun$map$1(Future.scala:292)
at scala.concurrent.impl.Promise.liftedTree1$1(Promise.scala:46)
at scala.concurrent.impl.Promise.$anonfun$transform$1(Promise.scala:46)
at scala.concurrent.impl.CallbackRunnable.run(Promise.scala:77)
at com.databricks.threading.DatabricksExecutionContext$InstrumentedRunnable.run(DatabricksExecutionContext.scala:36)
at com.databricks.threading.NamedExecutor$$anon$2.$anonfun$run$1(NamedExecutor.scala:367)
at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)
at com.databricks.logging.UsageLogging.$anonfun$withAttributionContext$1(UsageLogging.scala:426)
at scala.util.DynamicVariable.withValue(DynamicVariable.scala:62)
at com.databricks.logging.AttributionContext$.withValue(AttributionContext.scala:216)
at com.databricks.logging.UsageLogging.withAttributionContext(UsageLogging.scala:424)
at com.databricks.logging.UsageLogging.withAttributionContext$(UsageLogging.scala:418)
at com.databricks.threading.NamedExecutor.withAttributionContext(NamedExecutor.scala:294)
at com.databricks.threading.NamedExecutor$$anon$2.run(NamedExecutor.scala:365)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:750)&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Any ideas what the problem might be?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jul 2024 10:48:35 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/80962#M3528</guid>
      <dc:creator>jnkthms</dc:creator>
      <dc:date>2024-07-29T10:48:35Z</dc:date>
    </item>
    <item>
      <title>Re: Initializing Vector Search index Sync failes with Failed to resolve flow: '__online_index_view'</title>
      <link>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/81006#M3529</link>
      <description>&lt;P&gt;&lt;BR /&gt;I would double check what specific values are being sent to the model in the workflow. Possibly transitioning to environments changed a value's type or possibly data isn't defined correctly leaving certain parameters empty?&lt;BR /&gt;The "INVALID_PARAMETER_VALUE" from the embedding output makes me believe something isn't being set correctly in the workflow when accessing the endpoint programmatically.&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jul 2024 18:15:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/81006#M3529</guid>
      <dc:creator>tyler-xorbix</dc:creator>
      <dc:date>2024-07-29T18:15:59Z</dc:date>
    </item>
    <item>
      <title>Re: Initializing Vector Search index Sync failes with Failed to resolve flow: '__online_index_view'</title>
      <link>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/81050#M3531</link>
      <description>&lt;P&gt;Hi taylor-xorbix,&lt;/P&gt;&lt;P&gt;I'm not defining a workflow manually or setting any environment variables. I'm using the databricks UI (so from the unity catalog I'm using the create/vector search index dropdown. Having a running (and working) bge_m3 endpoint.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="jnkthms_0-1722325591075.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/9975i3A011EA3898C13B3/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="jnkthms_0-1722325591075.png" alt="jnkthms_0-1722325591075.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Looking at the example from the UI for the served embedding model, it seems that the API now specifies "inputs". If I remember correctly previously is was called "message" at some point in the past, which would explain the error message above.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="jnkthms_1-1722325701141.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/9976iDDDB5F4082E24DAF/image-size/medium/is-moderation-mode/true?v=v2&amp;amp;px=400" role="button" title="jnkthms_1-1722325701141.png" alt="jnkthms_1-1722325701141.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;The thing is that I'm not installing anything manually and just using the databricks UI functionality, so this should all work together&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jul 2024 07:50:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/81050#M3531</guid>
      <dc:creator>jnkthms</dc:creator>
      <dc:date>2024-07-30T07:50:59Z</dc:date>
    </item>
    <item>
      <title>Re: Initializing Vector Search index Sync failes with Failed to resolve flow: '__online_index_view'</title>
      <link>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/81269#M3537</link>
      <description>&lt;P&gt;The issue was most likely to use a CPU compute for the deployed model, switching to GPU (small) solved the issue.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 31 Jul 2024 08:50:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/initializing-vector-search-index-sync-failes-with-failed-to/m-p/81269#M3537</guid>
      <dc:creator>jnkthms</dc:creator>
      <dc:date>2024-07-31T08:50:04Z</dc:date>
    </item>
  </channel>
</rss>

