- 30382 Views
- 18 replies
- 33 kudos
Databricks Announces the Industry’s First Generative AI Engineer Learning Pathway and Certification
Today, we are announcing the industry's first Generative AI Engineer learning pathway and certification to help ensure that data and AI practitioners have the resources to be successful with generative AI. At Databricks, we recognize that generative ...
- 30382 Views
- 18 replies
- 33 kudos
- 1354 Views
- 1 replies
- 1 kudos
Question about response time by Llama 3.3 70B
Hey everyone !So I'm new into Databricks and I'm learning about the possibilities offered by Mosaic AI Foundation Model Serving. I'm mostly following the Azure's documentation to learn about it.In my testing, I've created 4 unity catalog functions vi...
- 1354 Views
- 1 replies
- 1 kudos
- 1 kudos
Llama 3.3 normally offers faster inference speeds compared to earlier versions. It provides approximately 40% faster responses and reduced batch processing time However, the usual performance for Mosaic AI Model Serving are also influenced by configu...
- 1 kudos
- 6106 Views
- 10 replies
- 7 kudos
Resolved! Insufficient Permission Error When Serving RAG Model with Multiple Vector Search Indexes
Hi Community,I’m currently working on a Retrieval-Augmented Generation (RAG) use case in Databricks. I’ve successfully implemented and served a model that uses a single Vector Search index, and everything works as expected.However, when I try to serv...
- 6106 Views
- 10 replies
- 7 kudos
- 2203 Views
- 3 replies
- 0 kudos
Vector Index format.
On of the key benefits of a delta table format is that it's open. Is this also the case for vector indexes as well? If so where could I find it's specification?In Databricks today, we see that we can create and manage a vector index using API's. (ht...
- 2203 Views
- 3 replies
- 0 kudos
- 0 kudos
HI Team, I am trying to update an existing vector search index with new data which is in another delta table. But i have no luck figuring out how to do it using Python SDK. Can you please hekp point to the right resources?
- 0 kudos
- 4388 Views
- 1 replies
- 0 kudos
Is It Possible to use genie as tool in agents?
Hello, community,I am currently exploring the agentic workflow surrounding Genie and would like to know if there are ways to effectively integrate or incorporate Genie as a tool within my existing agentic workflow. I am open to utilizing frameworks s...
- 4388 Views
- 1 replies
- 0 kudos
- 0 kudos
You may find this useful for your use case - https://gist.github.com/prithvikannan/82e789730c2fceec11932816bda50e59
- 0 kudos
- 1863 Views
- 1 replies
- 1 kudos
Resolved! AI/BI Genie Space - 20 QPM
Hi Databricks Community,I'm working with Genie Spaces and came across the documented limit of 20 questions per minute per workspace. Could someone please confirm whether this is a soft limit, and if it can be relaxed or increased upon request through...
- 1863 Views
- 1 replies
- 1 kudos
- 1 kudos
Hello @Rumesh! The 20 questions per minute per workspace limit is fixed. To double-check for enterprise or production use cases, I recommend contacting your Account Executive or Databricks Support to confirm or explore possible options.
- 1 kudos
- 4276 Views
- 2 replies
- 1 kudos
Databricks Serving Endpoint 400 Error: Model Response Format Issue while langgraph tool calling
Dear Databricks Community,I am seeking assistance with an issue I've encountered while deploying a model on Databricks. When invoking the serving endpoint, I receive the following error message sometimes:400 Client Error: Error: Model response did no...
- 4276 Views
- 2 replies
- 1 kudos
- 1 kudos
Same issue here... developing a langraph tool-calling agent, and for certain (but not all) questions, I get the same error.Any luck resolving?
- 1 kudos
- 590 Views
- 1 replies
- 0 kudos
Compute Claster Creation Failing
Everytime we try to roll out compute cluster it fails.
- 590 Views
- 1 replies
- 0 kudos
- 0 kudos
Hey @SomLTIM , Can you provide any additional information here to help troubleshoot? It would be helpful to know the region, cloud, error message, and any configurations you can provide on the cluster.
- 0 kudos
- 7946 Views
- 10 replies
- 0 kudos
Gen AI course material
Databricks updated the Generative AI course https://partner-academy.databricks.com/learn/lp/315/generative-ai-engineering-pathway but the course material is missing in the partner academy. Does anybody know where to download the course material?
- 7946 Views
- 10 replies
- 0 kudos
- 0 kudos
This course is available in the Academy. The link may be different if you are in Customer, Partner, or general Academy. You should be able to find it by searching the course catalog for "Databricks Generative AI Fundamentals Learning Plan"
- 0 kudos
- 1509 Views
- 1 replies
- 0 kudos
Unable to read message from Genie Conversation API
Hi Team,I am facing issue in getting the message details from the genie conversation API. I have created the conversation, which returns the message ID and conversation ID. But I am unable to see the message. I am getting error, when hitting the Get ...
- 1509 Views
- 1 replies
- 0 kudos
- 0 kudos
@RitikaKulshrest I can see what's happening with your Genie conversation API issue. The problem is not with your API call but with the Genie service's attempt to execute your query.The response indicates that your message was received but failed to p...
- 0 kudos
- 1531 Views
- 3 replies
- 0 kudos
Genie Agent integration issue
Hi, anyone from development team for Genie Agent integration? i had an issue of using sample code of Genie Agent integration. The issue is that underlying code (databricks_ai_bridge/genie.py) cannot connect to url openaipublic.blob.core.windows.net ...
- 1531 Views
- 3 replies
- 0 kudos
- 0 kudos
You're absolutely right. The implementation should definitely take restricted network scenarios into consideration, as many enterprise Databricks deployments operate in air-gapped or network-restricted environments.Commenting out the token counting l...
- 0 kudos
- 4630 Views
- 2 replies
- 2 kudos
Resolved! LLM with the largest context window
A Generative AI Engineer is tasked with developing an application that is based on an open-source large language model (LLM). They need a foundation LLM with a large context window. Which model fits this need?DBRX,Llama2-70B,DistilBert MPT-30B.DBRX h...
- 4630 Views
- 2 replies
- 2 kudos
- 2 kudos
@royinblr11 You're absolutely right to question the answer — the correct model for an application needing a foundation LLM with a large context window is: DBRXWhy DBRX is the Best Fit:It is a foundation model, designed for generation tasks.It support...
- 2 kudos
- 979 Views
- 3 replies
- 0 kudos
Are serverless endpoints possible in this Technical Blog post by qianyu?
Hi, are serverless endpoints possible for Whisper and Llama in this Technical Blog post by qianyu?https://community.databricks.com/t5/technical-blog/streamline-customer-call-center-transcripts-analytics-with/ba-p/101689Thanks!
- 979 Views
- 3 replies
- 0 kudos
- 0 kudos
https://www.databricks.com/product/pricing/foundation-model-servingWe don´t serve models on databricks, but as far as i can see you pay per input/output tokens (for foundation models).For classic models:https://www.databricks.com/product/pricing/mode...
- 0 kudos
- 3012 Views
- 4 replies
- 2 kudos
Resolved! MLFlow Authentication from Databricks App for GenAI Tracing
I am working on a Dash-based app that includes a call to a Databricks-hosted LLM endpoint. I am trying to track those calls with MLFlow. My code is (roughly) like this:from openai import OpenAIimport mlflowmlflow.set_tracking_uri("databricks")mlflow....
- 3012 Views
- 4 replies
- 2 kudos
- 2 kudos
Thank you for the push in the right direction! I was able to solve the issue with this codeos.environ["DATABRICKS_CLIENT_ID"] = "" os.environ["DATABRICKS_CLIENT_SECRET"] = "" os.environ["DATABRICKS_TOKEN"] = os.environ.get("VAR_CONFIGURED_WITH_DATABR...
- 2 kudos
- 9585 Views
- 2 replies
- 1 kudos
AI/BI Genie API
Hi everyone,I'm currently working on a project that implements a custom front end for Genie using the Genie API: https://docs.databricks.com/api/workspace/genie. So far, I’ve successfully built a working model that can send and receive messages.Howev...
- 9585 Views
- 2 replies
- 1 kudos
- 1 kudos
Is there any possibility that workspace admin could have removed this permission to you? If you check the permissions on your side do you see yourself with at least Can View?
- 1 kudos
- 3280 Views
- 1 replies
- 0 kudos
Resolved! Roadmap for vector_search function
I was wondering if there was a roadmap for the development of the vector_search function: vector_search function | Databricks DocumentationSpecifically, I was wondering if / when the following limitations may be lifted:Querying DIRECT_ACCESS index ty...
- 3280 Views
- 1 replies
- 0 kudos
- 0 kudos
Hello @jAAmes_bentley, DIRECT_ACCESS & filters_json are not currently supported with vector_search sql function. These are on our roadmap, but we don’t have concrete ETAs to share at the moment as we’re focusing on other high-priority tasks. Hybrid s...
- 0 kudos
Join Us as a Local Community Builder!
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now-
agent
1 -
agent bricks
1 -
agents
2 -
AI
1 -
AI Agents
5 -
ai gateway
1 -
Anthropic
1 -
API Documentation
1 -
App
1 -
Application
1 -
Authentication
1 -
automoation
1 -
Aws databricks
2 -
ChatDatabricks
1 -
claude
4 -
Cluster
1 -
Credentials
1 -
cursor
1 -
Databricks App
1 -
Databricks Course
1 -
Databricks Delta Table
1 -
Databricks Notebooks
1 -
Databricks SQL
1 -
Databricks-connect
1 -
delta sync
1 -
Delta Tables
1 -
Developer Experience
1 -
DLT Pipeline
1 -
Foundation Model
3 -
gemini
1 -
gemma
1 -
GenAI
8 -
GenAI agent
2 -
GenAI and LLMs
4 -
GenAI Generation AI
1 -
GenAIGeneration AI
26 -
Generation AI
2 -
Generative AI
4 -
Genie
15 -
Genie - Notebook Access
2 -
GenieAPI
3 -
Google
1 -
GPT
1 -
healthcare
1 -
Index
1 -
inference table
1 -
Information Extraction
1 -
Langchain
4 -
LangGraph
1 -
Llama
1 -
Llama 3.3
1 -
LLM
2 -
machine-learning
1 -
mcp
1 -
MlFlow
4 -
Mlflow registry
1 -
MLModels
1 -
Model Serving
1 -
mosic ai search
1 -
Multiagent
1 -
NPM error
1 -
OpenAI
1 -
Pandas udf
1 -
Playground
1 -
RAG
3 -
ro
1 -
Scheduling
1 -
Server
1 -
serving endpoint
1 -
streaming
1 -
Tasks
1 -
Vector
1 -
vector index
1 -
Vector Search
2 -
Vector search index
6
- « Previous
- Next »
| User | Count |
|---|---|
| 34 | |
| 21 | |
| 15 | |
| 9 | |
| 8 |