- 30413 Views
- 18 replies
- 33 kudos
Databricks Announces the Industry’s First Generative AI Engineer Learning Pathway and Certification
Today, we are announcing the industry's first Generative AI Engineer learning pathway and certification to help ensure that data and AI practitioners have the resources to be successful with generative AI. At Databricks, we recognize that generative ...
- 30413 Views
- 18 replies
- 33 kudos
- 4003 Views
- 1 replies
- 0 kudos
Issue in creating endpoint for quantized gguf model with llama-cpp-python
Hello, Databricks Community,I am experiencing an issue while trying to serve a quantized model in gguf format using Databricks serving with the llama-cpp-python library.The model is registered using MLflow and pyfunc on Unity. The model loads without...
- 4003 Views
- 1 replies
- 0 kudos
- 0 kudos
The error code 132 typically means an illegal instruction was encountered, often caused by a CPU incompatibility with the code being executed—especially with libraries that use SIMD or hardware acceleration (e.g., llama-cpp-python, which is often com...
- 0 kudos
- 4039 Views
- 1 replies
- 0 kudos
Not able to invoke model external model
I have followed below steps1) Created serving end point by for external model gpt-4-turbo and providing azure AI endpoint and key2) Now using langchain, i am trying to connect and invoke message from model in notebook model = ChatDatabricks(target_u...
- 4039 Views
- 1 replies
- 0 kudos
- 0 kudos
Based on your description, you are encountering a 500 Server Error when trying to use the Langchain ChatDatabricks integration with a Databricks Serving Endpoint connected to an external OpenAI GPT-4 Turbo model on Azure. This error usually indicates...
- 0 kudos
- 4420 Views
- 1 replies
- 0 kudos
Mlflow.evaluation fails to generate score
The execution of code stucks when evaluation of data start. eval_df = pd.DataFrame( { "inputs": [ "What is MLflow?", "What is Spark?", ], "ground_truth": [ "MLflow is an open-source platform f...
- 4420 Views
- 1 replies
- 0 kudos
- 0 kudos
The issue described—a Databricks notebook getting "stuck" during the evaluation phase using mlflow.evaluate—is most likely related to environment setup, model compatibility, or limitations with the mlflow.pyfunc.log_model and the evaluation utilities...
- 0 kudos
- 4872 Views
- 1 replies
- 0 kudos
I need a sample code or process which will help us to dynamically select the prompt template
We need a sample code or process which will help us to dynamically select the prompt template based on the prompt given as an input through the model legacy serving endpoint
- 4872 Views
- 1 replies
- 0 kudos
- 0 kudos
To dynamically select a prompt template in Databricks based on the input prompt received through a legacy model serving endpoint, you can implement a Python function that maps incoming prompts to specific templates. This often involves using conditio...
- 0 kudos
- 5258 Views
- 1 replies
- 0 kudos
How to perform combined search on structured and unstructured data in databrick using RAG or other
I created a RAG application in databricks which performs the following steps:1. Extract text from PDF files2. Prepare embeddings on extracted text and create vector search index3. Create a LLM model and served the model which can answer question base...
- 5258 Views
- 1 replies
- 0 kudos
- 0 kudos
You can achieve combined retrieval across both PDF-extracted unstructured data and multiple columns from structured Delta tables in Databricks, but there are important considerations and available patterns to optimize this workflow for your RAG appli...
- 0 kudos
- 3832 Views
- 1 replies
- 0 kudos
Issue with Multi agent supervisor based agentic framework
I have three agents in my multi agent framework (code attached) Supervisor : This is the main controller "Genie": #this is a genai agent "Coder": #this is a re-act agent created on the fly "Weather": #this is an exis...
- 3832 Views
- 1 replies
- 0 kudos
- 0 kudos
Your multi-agent framework has several issues affecting agent iteration control and error handling. Here’s a lined explanation and practical suggestions for each problem: Agents not stopping at one iteration Agents consistently reaching the max iter...
- 0 kudos
- 4374 Views
- 1 replies
- 0 kudos
Behavior of Vector Index Sync with Delta Tables When Using OVERWRITE vs MERGE in Databricks
I'm working with vector search in Databricks using vector index sync with Delta tables, and I'm a bit unclear on how updates to the source table affect the vector index, specifically when using different write operations.If I overwrite the source Del...
- 4374 Views
- 1 replies
- 0 kudos
- 0 kudos
Overwriting a Delta table versus using a MERGE operation has different impacts on Databricks vector index sync, especially when Change Data Feed (CDF) is enabled and your embeddings are generated via Azure OpenAI models. Overwrite Mode When you overw...
- 0 kudos
- 4090 Views
- 1 replies
- 0 kudos
AI/BI Genie - Components
I am almost certain that AI/BI Genie is using Azure OpenAi under the hood. Does anyone knows if Langchain has been used too ?
- 4090 Views
- 1 replies
- 0 kudos
- 0 kudos
AI/BI Genie is confirmed to use Azure OpenAI under the hood, as it leverages generative AI models for translating natural language into analytical queries and producing business intelligence insights. Additionally, there is strong evidence that Langc...
- 0 kudos
- 3916 Views
- 1 replies
- 1 kudos
Custom sentence transformer for indexing
Hi! i would like to use my own sentence transformer to create a vector index. It is not a problem using mlflow sentence-transformer flavour, it works fine with: mlflow.sentence_transformers.log_model( model, artifact_path="model", signatu...
- 3916 Views
- 1 replies
- 1 kudos
- 1 kudos
To use a custom MLflow pyfunc model for sentence-transformers with preprocessing, you need to comply with the expected interface of mlflow.pyfunc.PythonModel, especially the predict method. The method signature, data handling, and serialization are k...
- 1 kudos
- 208 Views
- 1 replies
- 0 kudos
None of the Connect to the custom MCP server examples work
hi! I'm following through this documentation >> https://docs.databricks.com/aws/en/generative-ai/mcp/custom-mcp?language=Agent+code+%28service+principal%29 I have successfully deployed the app in Databricks and can use it from the playground and from...
- 208 Views
- 1 replies
- 0 kudos
- 0 kudos
Greetings @smferro54epam , the errors point to two distinct issues: your custom MCP app expects OAuth-based Databricks credentials (not a raw bearer token), and the URL you pass to the HTTP transport must be a fully qualified https URL to the app’s /...
- 0 kudos
- 4529 Views
- 1 replies
- 0 kudos
Best Practices for Multilingual Model Training: Single vs. Multi-Model for Translation
Hello everyone,I’m working on a translation project involving documents up to 100 pages long, in 17 different languages, and I'm looking for the best approach to achieve high-quality translations in this multilingual context.Single model vs. multi-mo...
- 4529 Views
- 1 replies
- 0 kudos
- 0 kudos
Greetings @Maylin , A single many-to-many multilingual model, fine-tuned jointly across your 17 languages, is usually the best trade-off between quality, scalability, and operational simplicity; combine it with lightweight adapters to preserve quali...
- 0 kudos
- 4212 Views
- 2 replies
- 1 kudos
How to serve a RAG chain endpoint that supports streaming
Hello everyone,I am trying to serve a sample RAG chain model that should support streaming output. But I could not find any documantation on how to enable streaming for a serving endpoint for a langchain model. Could you provide some hints on how to ...
- 4212 Views
- 2 replies
- 1 kudos
- 1 kudos
Greetings @MarsWalker , here’s how to get true streaming from a served RAG chain on Databricks Model Serving. What’s going on The served endpoint error (“This endpoint does not support streaming”) is expected when the deployed entity doesn’t expos...
- 1 kudos
- 264 Views
- 1 replies
- 0 kudos
Provisioned Throughput is not supported for llama...
Hi,This question is tightly correlated with another discussion: Model deprecation issue while serving on Databrick... - Databricks Community - 131968In a nutshell, I'm trying to serve the model which is based on llama architecture (deployed through m...
- 264 Views
- 1 replies
- 0 kudos
- 0 kudos
Greetings @c4ndy , Thanks for sharing the context and links. I couldn’t read the Databricks Community thread from the URL you provided; please verify the URL and access settings, as it may not yet be available to Glean or require additional permissi...
- 0 kudos
- 3963 Views
- 1 replies
- 1 kudos
Request to share Genie space URL with restricted access
Dear Databricks Team,I hope this email finds you well.I would like to share a specific Genie Space URL with a user, but I want to ensure that the user only has access to view this particular Genie Space ask questions in that space and no other featur...
- 3963 Views
- 1 replies
- 1 kudos
- 1 kudos
Greetings @avadhut22111997 , sorry this fell through the cracks. I am Happy to help you lock down access to a single Genie Space so the user can only view it and ask questions there. What “view and ask only” means in Genie Give the user the Genie S...
- 1 kudos
- 4680 Views
- 1 replies
- 2 kudos
Data security when using Databricks Foundation Model endpoints
Hi Databricks,My client is considering using Databricks as the solution to their RAG chatbot framework. We are testing on the Databrick's demo tutorial on its own documentation data (pdf files). The tutorial is provided at: 00-RAG-chatbot-Introductio...
- 4680 Views
- 1 replies
- 2 kudos
- 2 kudos
Hey @tholers , here’s how Databricks handles data for the embedding model endpoints and what that means for tenant and data residency requirements. Short answer If you use Databricks-hosted embedding endpoints (for example, GTE-Large or BGE-Large), y...
- 2 kudos
Join Us as a Local Community Builder!
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now-
agent
1 -
agent bricks
1 -
agents
2 -
AI
1 -
AI Agents
5 -
ai gateway
1 -
Anthropic
1 -
API Documentation
1 -
App
1 -
Application
1 -
Authentication
1 -
automoation
1 -
Aws databricks
2 -
ChatDatabricks
1 -
claude
4 -
Cluster
1 -
Credentials
1 -
cursor
1 -
Databricks App
1 -
Databricks Course
1 -
Databricks Delta Table
1 -
Databricks Notebooks
1 -
Databricks SQL
1 -
Databricks-connect
1 -
delta sync
1 -
Delta Tables
1 -
Developer Experience
1 -
DLT Pipeline
1 -
Foundation Model
3 -
gemini
1 -
gemma
1 -
GenAI
8 -
GenAI agent
2 -
GenAI and LLMs
4 -
GenAI Generation AI
1 -
GenAIGeneration AI
26 -
Generation AI
2 -
Generative AI
4 -
Genie
15 -
Genie - Notebook Access
2 -
GenieAPI
3 -
Google
1 -
GPT
1 -
healthcare
1 -
Index
1 -
inference table
1 -
Information Extraction
1 -
Langchain
4 -
LangGraph
1 -
Llama
1 -
Llama 3.3
1 -
LLM
2 -
machine-learning
1 -
mcp
1 -
MlFlow
4 -
Mlflow registry
1 -
MLModels
1 -
Model Serving
1 -
mosic ai search
1 -
Multiagent
1 -
NPM error
1 -
OpenAI
1 -
Pandas udf
1 -
Playground
1 -
RAG
3 -
ro
1 -
Scheduling
1 -
Server
1 -
serving endpoint
1 -
streaming
1 -
Tasks
1 -
Vector
1 -
vector index
1 -
Vector Search
2 -
Vector search index
6
- « Previous
- Next »
| User | Count |
|---|---|
| 34 | |
| 21 | |
| 15 | |
| 9 | |
| 8 |