- 902 Views
- 1 replies
- 1 kudos
Resolved! streaming llm response
I am deploying an agent that works good withouth streaming:it is using the following packages: "mlflow==2.22.1", "langgraph", "langchain", "pydantic==2.8.2", "langgraph-checkpoint-sqlite", "databricks-langchain", "p...
- 902 Views
- 1 replies
- 1 kudos
- 1 kudos
To implement streaming output for your agent in Databricks and resolve the error "This model does not support predict_stream method.", the key requirement is that your underlying MLflow model must support the predict_stream method. Most l...
- 1 kudos
- 6174 Views
- 4 replies
- 0 kudos
Resolved! LangGraph MemorySaver checkpointer usage with MLflow
Hi everyone.I am working on a graph that utilizes the MemorySaver class to incorporate short-term memory. This will enable me to maintain a multi-turn conversation with the user by storing the chat history.I am using the MLflow "models from code" fea...
- 6174 Views
- 4 replies
- 0 kudos
- 0 kudos
Hi @moemedina. No, I didn't.I'm considering using ChatModel/ChatAgent class to wrap the graph and be able to move on. However, the MLflow documentation is still referring to ChatModel where Chat Agent is the latest recommendation:MLflow ChatModel Doc...
- 0 kudos
- 3364 Views
- 1 replies
- 1 kudos
I am struggling on complicated RAG code for Mosaic evaluation deployment
I want to know how I can apply some complicated RAG designs so I can directly deploy on the Mosaic AI evaluation UI. There are two cases:1. two or more index retrieval as independent tools (parallel retrievals)2.two index retrieval processes that dep...
- 3364 Views
- 1 replies
- 1 kudos
- 2355 Views
- 2 replies
- 0 kudos
Retrieved Docs in Message History
Hi,I've deployed a Langchain model via MLFlow log model and a deployment agent (following this https://notebooks.databricks.com/demos/llm-rag-chatbot/index.html#)How do I add additional content to the message history for example, I'd like to add retr...
- 2355 Views
- 2 replies
- 0 kudos
- 0 kudos
Hi @davidhuang thanks for pointing to that demo! That implementation of chat history did work in my experience, but it's a little simplistic and could be token-intensive if the history needs to be maintained as a list like that. Has Databricks done a...
- 0 kudos
-
agent
1 -
agents
2 -
AI
1 -
AI Agents
4 -
ai gateway
1 -
API Documentation
1 -
App
1 -
Application
1 -
automoation
1 -
Aws databricks
2 -
ChatDatabricks
1 -
claude
2 -
Cluster
1 -
Credentials
1 -
cursor
1 -
Databricks App
1 -
Databricks Course
1 -
Databricks Delta Table
1 -
Databricks Notebooks
1 -
Databricks SQL
1 -
Databricks-connect
1 -
delta sync
1 -
Delta Tables
1 -
Developer Experience
1 -
DLT Pipeline
1 -
Foundation Model
3 -
gemma
1 -
GenAI
7 -
GenAI agent
2 -
GenAI and LLMs
3 -
GenAI Generation AI
1 -
GenAIGeneration AI
13 -
Generation AI
2 -
Generative AI
4 -
Genie
13 -
Genie - Notebook Access
2 -
GenieAPI
2 -
Index
1 -
inference table
1 -
Langchain
4 -
LangGraph
1 -
Llama
1 -
Llama 3.3
1 -
LLM
2 -
machine-learning
1 -
MlFlow
4 -
Mlflow registry
1 -
MLModels
1 -
Model Serving
1 -
mosic ai search
1 -
Multiagent
1 -
NPM error
1 -
Pandas udf
1 -
RAG
2 -
ro
1 -
Scheduling
1 -
Server
1 -
serving endpoint
1 -
streaming
1 -
Tasks
1 -
Vector
1 -
vector index
1 -
Vector Search
2 -
Vector search index
6