cancel
Showing results for 
Search instead for 
Did you mean: 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

dk_g
by New Contributor
  • 350 Views
  • 0 replies
  • 0 kudos

How to utilize clustered gpu for large hf models

Hi,I am using clustered GPU(driver -1GPU and Worker-3GPU), and caching model data into unity catalog but while loading model checkpoint shards its always use driver memory and failed due insufficient memory.How to use complete cluster GPU while loadi...

  • 350 Views
  • 0 replies
  • 0 kudos
Karthik_Karanm
by New Contributor III
  • 5002 Views
  • 10 replies
  • 7 kudos

Resolved! Insufficient Permission Error When Serving RAG Model with Multiple Vector Search Indexes

Hi Community,I’m currently working on a Retrieval-Augmented Generation (RAG) use case in Databricks. I’ve successfully implemented and served a model that uses a single Vector Search index, and everything works as expected.However, when I try to serv...

  • 5002 Views
  • 10 replies
  • 7 kudos
Latest Reply
lingareddy_Alva
Honored Contributor III
  • 7 kudos

Thank you

  • 7 kudos
9 More Replies
sebascardonal
by New Contributor III
  • 5772 Views
  • 4 replies
  • 0 kudos

Resolved! LangGraph MemorySaver checkpointer usage with MLflow

Hi everyone.I am working on a graph that utilizes the MemorySaver class to incorporate short-term memory. This will enable me to maintain a multi-turn conversation with the user by storing the chat history.I am using the MLflow "models from code" fea...

  • 5772 Views
  • 4 replies
  • 0 kudos
Latest Reply
sebascardonal
New Contributor III
  • 0 kudos

Hi @moemedina. No, I didn't.I'm considering using ChatModel/ChatAgent class to wrap the graph and be able to move on. However, the MLflow documentation is still referring to ChatModel where Chat Agent is the latest recommendation:MLflow ChatModel Doc...

  • 0 kudos
3 More Replies