<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Problem serving a langchain model on Databricks in Machine Learning</title>
    <link>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/98181#M3774</link>
    <description>&lt;P&gt;Hi, I've encountered a problem of serving a langchain model I just created successfully on Databricks.&lt;/P&gt;&lt;P&gt;I was using the following code to set up a model in unity catalog:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;from&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.models &lt;/SPAN&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; infer_signature&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; mlflow&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; langchain&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;mlflow.&lt;/SPAN&gt;&lt;SPAN&gt;set_registry_uri&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"databricks-uc"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;model_name &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;"model1"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;with&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.&lt;/SPAN&gt;&lt;SPAN&gt;start_run&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;run_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"clippy_rag"&lt;/SPAN&gt;&lt;SPAN&gt;) &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; run:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; signature &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;infer_signature&lt;/SPAN&gt;&lt;SPAN&gt;(question, answer)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; model_info &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.langchain.&lt;/SPAN&gt;&lt;SPAN&gt;log_model&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; chain, &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;loader_fn&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;get_retriver,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;artifact_path&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"chain"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;registered_model_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;model_name,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;pip_requirements&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;[&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"mlflow=="&lt;/SPAN&gt; &lt;SPAN&gt;+&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.&lt;/SPAN&gt;&lt;SPAN&gt;__version__&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"langchain=="&lt;/SPAN&gt; &lt;SPAN&gt;+&lt;/SPAN&gt;&lt;SPAN&gt; langchain.&lt;/SPAN&gt;&lt;SPAN&gt;__version__&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"databricks-vectorsearch"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ],&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;signature&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;signature,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; )&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;The UI shows that the model is ready but when I severed this model it showed Model with name 'model1' and version '1' is not successfully registered. Ensure model version has finished registration before use in model serving. Do you know what's the issue here?&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
    <pubDate>Fri, 08 Nov 2024 11:53:49 GMT</pubDate>
    <dc:creator>hawa</dc:creator>
    <dc:date>2024-11-08T11:53:49Z</dc:date>
    <item>
      <title>Problem serving a langchain model on Databricks</title>
      <link>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/98181#M3774</link>
      <description>&lt;P&gt;Hi, I've encountered a problem of serving a langchain model I just created successfully on Databricks.&lt;/P&gt;&lt;P&gt;I was using the following code to set up a model in unity catalog:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;from&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.models &lt;/SPAN&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; infer_signature&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; mlflow&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; langchain&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;mlflow.&lt;/SPAN&gt;&lt;SPAN&gt;set_registry_uri&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;"databricks-uc"&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;model_name &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;"model1"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;with&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.&lt;/SPAN&gt;&lt;SPAN&gt;start_run&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;run_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"clippy_rag"&lt;/SPAN&gt;&lt;SPAN&gt;) &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; run:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; signature &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt; &lt;SPAN&gt;infer_signature&lt;/SPAN&gt;&lt;SPAN&gt;(question, answer)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; model_info &lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.langchain.&lt;/SPAN&gt;&lt;SPAN&gt;log_model&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; chain, &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;loader_fn&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;get_retriver,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;artifact_path&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"chain"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;registered_model_name&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;model_name,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;pip_requirements&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;[&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"mlflow=="&lt;/SPAN&gt; &lt;SPAN&gt;+&lt;/SPAN&gt;&lt;SPAN&gt; mlflow.&lt;/SPAN&gt;&lt;SPAN&gt;__version__&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"langchain=="&lt;/SPAN&gt; &lt;SPAN&gt;+&lt;/SPAN&gt;&lt;SPAN&gt; langchain.&lt;/SPAN&gt;&lt;SPAN&gt;__version__&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;"databricks-vectorsearch"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; ],&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/SPAN&gt;&lt;SPAN&gt;signature&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;signature,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;&amp;nbsp; &amp;nbsp; )&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;The UI shows that the model is ready but when I severed this model it showed Model with name 'model1' and version '1' is not successfully registered. Ensure model version has finished registration before use in model serving. Do you know what's the issue here?&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 08 Nov 2024 11:53:49 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/98181#M3774</guid>
      <dc:creator>hawa</dc:creator>
      <dc:date>2024-11-08T11:53:49Z</dc:date>
    </item>
    <item>
      <title>Re: Problem serving a langchain model on Databricks</title>
      <link>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/98185#M3775</link>
      <description>&lt;P&gt;I suspected the issue is coming from this small error I got:&amp;nbsp;&lt;SPAN&gt;Got error: Must specify a chain Type in config. I used the&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;chain_type&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"stuff" when building the langchain but I'm not sure how to fix it.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 08 Nov 2024 12:24:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/98185#M3775</guid>
      <dc:creator>hawa</dc:creator>
      <dc:date>2024-11-08T12:24:43Z</dc:date>
    </item>
    <item>
      <title>Re: Problem serving a langchain model on Databricks</title>
      <link>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/112531#M3992</link>
      <description>&lt;P&gt;Hi, were you able to solve it? I'm having the same issue&lt;/P&gt;</description>
      <pubDate>Fri, 14 Mar 2025 00:33:40 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/112531#M3992</guid>
      <dc:creator>rkmee</dc:creator>
      <dc:date>2025-03-14T00:33:40Z</dc:date>
    </item>
    <item>
      <title>Re: Problem serving a langchain model on Databricks</title>
      <link>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/114721#M4015</link>
      <description>&lt;P&gt;Hi! Are there any news about this? I'm getting the same error &lt;span class="lia-unicode-emoji" title=":confused_face:"&gt;😕&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Apr 2025 13:44:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/114721#M4015</guid>
      <dc:creator>constanmrtnz</dc:creator>
      <dc:date>2025-04-07T13:44:55Z</dc:date>
    </item>
    <item>
      <title>Re: Problem serving a langchain model on Databricks</title>
      <link>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/116383#M4039</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;The warnings/errors in the logs of the langchain model log process can give you a good hint, although it may be not that evident at first sight.&lt;BR /&gt;&lt;BR /&gt;It happened something similar to me - same error message, and the cause was having used an OpenAI model that I mistakenly have passed to the langchain model as being a Databricks one.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Apr 2025 17:27:06 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/116383#M4039</guid>
      <dc:creator>Octavian1</dc:creator>
      <dc:date>2025-04-23T17:27:06Z</dc:date>
    </item>
    <item>
      <title>Re: Problem serving a langchain model on Databricks</title>
      <link>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/138022#M4414</link>
      <description>&lt;P&gt;Greetings&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/131623"&gt;@hawa&lt;/a&gt;&amp;nbsp;,&amp;nbsp; Thanks for sharing the details—this looks like a combination of registration and configuration issues that commonly surface with the MLflow LangChain flavor on Databricks.&lt;/P&gt;
&lt;H3 class="paragraph"&gt;What’s going wrong&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;The &lt;STRONG&gt;registered model name&lt;/STRONG&gt; should be a full three-level Unity Catalog path like &lt;CODE&gt;&amp;lt;catalog&amp;gt;.&amp;lt;schema&amp;gt;.&amp;lt;model&amp;gt;&lt;/CODE&gt;. Using just &lt;CODE&gt;"model1"&lt;/CODE&gt; causes registration/serving mismatches and can lead to “not successfully registered” errors when serving from UC.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;The &lt;STRONG&gt;LangChain flavor needs chain type info in the logged model’s config&lt;/STRONG&gt; so it can reconstruct the chain at load/serve time. Without it, you get “Must specify a chain Type in config.” The fix is to pass &lt;CODE&gt;model_config={"chain_type": "stuff"}&lt;/CODE&gt; (or whatever you used) when calling &lt;CODE&gt;mlflow.langchain.log_model(...)&lt;/CODE&gt; so the MLflow artifact contains the chain’s type for serving.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;It’s best to &lt;STRONG&gt;validate the model before serving&lt;/STRONG&gt; by loading the model back and invoking it (or using &lt;CODE&gt;mlflow.models.predict&lt;/CODE&gt;) to ensure the runtime and signature behave as expected.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;&lt;SPAN&gt;Fix:&lt;/SPAN&gt;&lt;/H3&gt;
&lt;DIV class="paragraph"&gt;&lt;SPAN&gt;log and register correctly, then validate Below is a minimal pattern that addresses all three points.&lt;/SPAN&gt;&lt;/DIV&gt;
&lt;PRE&gt;&lt;CODE class="markdown-code-python"&gt;from mlflow.models import infer_signature
import mlflow
import langchain

# 1) Use a full UC name
CATALOG = "prod"
SCHEMA = "ai_apps"
MODEL_BASENAME = "model1"
REGISTERED_MODEL_NAME = f"{CATALOG}.{SCHEMA}.{MODEL_BASENAME}"

mlflow.set_registry_uri("databricks-uc")

# Assume you already built `chain` (with your chain_type="stuff") and have a loader_fn (e.g., get_retriver)
question = {"query": "Hello"}  # keep your input schema consistent with how the chain expects inputs
answer = chain.invoke(question)
signature = infer_signature(question, answer)

with mlflow.start_run(run_name="clippy_rag") as run:
    model_info = mlflow.langchain.log_model(
        chain,
        loader_fn=get_retriver,                     # your retriever factory
        artifact_path="chain",
        registered_model_name=REGISTERED_MODEL_NAME,
        # 2) Persist chain type so serving can reconstruct it
        model_config={"chain_type": "stuff"},
        # Pin requirements needed at serve time
        pip_requirements=[
            f"mlflow=={mlflow.__version__}",
            f"langchain=={langchain.__version__}",
            "databricks-vectorsearch",
        ],
        # 3) Keep non-DataFrame example intact for proper signature inference
        input_example=question,
        example_no_conversion=True,
        signature=signature,
    )

# Optional: quick pre-deployment validation
loaded = mlflow.langchain.load_model(model_info.model_uri)
_ = loaded.invoke(question)  # should run without errors&lt;/CODE&gt;&lt;/PRE&gt;
&lt;H3 class="paragraph"&gt;Why this works&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;The full &lt;STRONG&gt;Unity Catalog path&lt;/STRONG&gt; ensures the version is created under UC and can be targeted by Model Serving without cross-registry confusion.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Providing &lt;STRONG&gt;&lt;CODE&gt;model_config={"chain_type": "stuff"}&lt;/CODE&gt;&lt;/STRONG&gt; writes the chain type into the MLflow LangChain flavor’s config (steps YAML), satisfying LangChain’s loader which otherwise throws “Must specify a chain Type in config.”&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;Doing a quick &lt;STRONG&gt;self-load/invoke&lt;/STRONG&gt; avoids surprises at serving time and aligns with Databricks’ guidance to validate models pre-deployment.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;H3 class="paragraph"&gt;Then serve it&lt;/H3&gt;
&lt;DIV class="paragraph"&gt;You can now create a custom model serving endpoint from the UI (Serving &amp;gt; Create endpoint), selecting your UC model by its full name and version. The endpoint should transition to READY once the container image is built and the model is loaded.&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;H3 class="paragraph"&gt;Extra tips&lt;/H3&gt;
&lt;UL&gt;
&lt;LI class="paragraph"&gt;If your endpoint shows “Not Ready” for an extended period, confirm the &lt;STRONG&gt;model version status&lt;/STRONG&gt; in UC (READY vs. PENDING) and that the endpoint creator’s identity has UC access to the catalog/schema/model. If permissions are wrong for the creator, delete and recreate under a principal with correct UC privileges.&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;When logging nonstandard dependencies (private wheels or pinned versions), prefer &lt;STRONG&gt;logging them with the model&lt;/STRONG&gt; (via &lt;CODE&gt;pip_requirements&lt;/CODE&gt;, &lt;CODE&gt;extra_pip_requirements&lt;/CODE&gt;, or &lt;CODE&gt;conda_env&lt;/CODE&gt;) to ensure the serving container matches your training env.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;DIV class="paragraph"&gt;If you want Databricks-managed authentication to resources (Vector Search, foundation model endpoints), consider the &lt;STRONG&gt;resources&lt;/STRONG&gt; mechanism described in the agent logging docs; for simple retrievers your &lt;CODE&gt;loader_fn&lt;/CODE&gt; is fine, but resources help with auth passthrough in production.&lt;/DIV&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;DIV class="paragraph"&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV class="paragraph"&gt;Cheers, Louis.&lt;/DIV&gt;</description>
      <pubDate>Thu, 06 Nov 2025 18:53:52 GMT</pubDate>
      <guid>https://community.databricks.com/t5/machine-learning/problem-serving-a-langchain-model-on-databricks/m-p/138022#M4414</guid>
      <dc:creator>Louis_Frolio</dc:creator>
      <dc:date>2025-11-06T18:53:52Z</dc:date>
    </item>
  </channel>
</rss>

