Showing results for 
Search instead for 
Did you mean: 
New Contributor III
since ‎02-15-2024

User Stats

  • 5 Posts
  • 0 Solutions
  • 2 Kudos given
  • 3 Kudos received

User Activity

Hello,I've been trying to serve registered MLflow models at GPU Model Serving Endpoint, which works except for the models using bitsandbytes library. The library is used to quantise the LLM models into 4-bit/ 8-bit (e.g. Mistral-7B), however, it runs...
Kudos from
Kudos given to