cancel
Showing results for 
Search instead for 
Did you mean: 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Sujitha
by Databricks Employee
  • 30382 Views
  • 18 replies
  • 33 kudos

Databricks Announces the Industry’s First Generative AI Engineer Learning Pathway and Certification

Today, we are announcing the industry's first Generative AI Engineer learning pathway and certification to help ensure that data and AI practitioners have the resources to be successful with generative AI. At Databricks, we recognize that generative ...

Screenshot 2024-01-24 at 11.32.01 PM.png
  • 30382 Views
  • 18 replies
  • 33 kudos
Latest Reply
AIChief
New Contributor II
  • 33 kudos

thanks for sharing

  • 33 kudos
17 More Replies
brahaman
by New Contributor II
  • 1354 Views
  • 1 replies
  • 1 kudos

Question about response time by Llama 3.3 70B

Hey everyone !So I'm new into Databricks and I'm learning about the possibilities offered by Mosaic AI Foundation Model Serving. I'm mostly following the Azure's documentation to learn about it.In my testing, I've created 4 unity catalog functions vi...

  • 1354 Views
  • 1 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

Llama 3.3 normally offers faster inference speeds compared to earlier versions. It provides approximately 40% faster responses and reduced batch processing time However, the usual performance for Mosaic AI Model Serving are also influenced by configu...

  • 1 kudos
Karthik_Karanm
by New Contributor III
  • 6106 Views
  • 10 replies
  • 7 kudos

Resolved! Insufficient Permission Error When Serving RAG Model with Multiple Vector Search Indexes

Hi Community,I’m currently working on a Retrieval-Augmented Generation (RAG) use case in Databricks. I’ve successfully implemented and served a model that uses a single Vector Search index, and everything works as expected.However, when I try to serv...

  • 6106 Views
  • 10 replies
  • 7 kudos
Latest Reply
lingareddy_Alva
Honored Contributor III
  • 7 kudos

Thank you

  • 7 kudos
9 More Replies
skrishnaprasad
by New Contributor III
  • 2203 Views
  • 3 replies
  • 0 kudos

Vector Index format.

On of the key benefits of a delta table format is that it's open.  Is this also the case for vector indexes as well? If so where could I find it's specification?In Databricks today, we see that we can create and manage a vector index using API's. (ht...

  • 2203 Views
  • 3 replies
  • 0 kudos
Latest Reply
TejaJuttu
New Contributor II
  • 0 kudos

HI Team, I am trying to update an existing vector search index with new data which is in another delta table. But i have no luck figuring out how to do it using Python SDK. Can you please hekp point to the right resources?

  • 0 kudos
2 More Replies
mayanksharma
by New Contributor
  • 4388 Views
  • 1 replies
  • 0 kudos

Is It Possible to use genie as tool in agents?

Hello, community,I am currently exploring the agentic workflow surrounding Genie and would like to know if there are ways to effectively integrate or incorporate Genie as a tool within my existing agentic workflow. I am open to utilizing frameworks s...

  • 4388 Views
  • 1 replies
  • 0 kudos
Latest Reply
kamal_ch
Databricks Employee
  • 0 kudos

You may find this useful for your use case - https://gist.github.com/prithvikannan/82e789730c2fceec11932816bda50e59

  • 0 kudos
Rumesh
by New Contributor
  • 1863 Views
  • 1 replies
  • 1 kudos

Resolved! AI/BI Genie Space - 20 QPM

Hi Databricks Community,I'm working with Genie Spaces and came across the documented limit of 20 questions per minute per workspace. Could someone please confirm whether this is a soft limit, and if it can be relaxed or increased upon request through...

  • 1863 Views
  • 1 replies
  • 1 kudos
Latest Reply
Advika
Databricks Employee
  • 1 kudos

Hello @Rumesh! The 20 questions per minute per workspace limit is fixed. To double-check for enterprise or production use cases, I recommend contacting your Account Executive or Databricks Support to confirm or explore possible options.

  • 1 kudos
kirti-11
by New Contributor II
  • 4276 Views
  • 2 replies
  • 1 kudos

Databricks Serving Endpoint 400 Error: Model Response Format Issue while langgraph tool calling

Dear Databricks Community,I am seeking assistance with an issue I've encountered while deploying a model on Databricks. When invoking the serving endpoint, I receive the following error message sometimes:400 Client Error: Error: Model response did no...

  • 4276 Views
  • 2 replies
  • 1 kudos
Latest Reply
jericksoncea
New Contributor III
  • 1 kudos

Same issue here... developing a langraph tool-calling agent, and for certain (but not all) questions, I get the same error.Any luck resolving?

  • 1 kudos
1 More Replies
SomLTIM
by New Contributor
  • 590 Views
  • 1 replies
  • 0 kudos

Compute Claster Creation Failing

Everytime we try to roll out compute cluster it fails.

  • 590 Views
  • 1 replies
  • 0 kudos
Latest Reply
Shua42
Databricks Employee
  • 0 kudos

Hey @SomLTIM , Can you provide any additional information here to help troubleshoot? It would be helpful to know the region, cloud, error message, and any configurations you can provide on the cluster.

  • 0 kudos
kumarsuresh
by New Contributor III
  • 7946 Views
  • 10 replies
  • 0 kudos

Gen AI course material

Databricks updated the Generative AI course https://partner-academy.databricks.com/learn/lp/315/generative-ai-engineering-pathway but the course material is missing in the partner academy. Does anybody know where to download the course material? 

  • 7946 Views
  • 10 replies
  • 0 kudos
Latest Reply
ScottSmithDB
Databricks Employee
  • 0 kudos

This course is available in the Academy.  The link may be different if you are in Customer, Partner, or general Academy.  You should be able to find it by searching the course catalog for "Databricks Generative AI Fundamentals Learning Plan"

  • 0 kudos
9 More Replies
RitikaKulshrest
by New Contributor
  • 1509 Views
  • 1 replies
  • 0 kudos

Unable to read message from Genie Conversation API

Hi Team,I am facing issue in getting the message details from the genie conversation API. I have created the conversation, which returns the message ID and conversation ID. But I am unable to see the message. I am getting error, when hitting the Get ...

  • 1509 Views
  • 1 replies
  • 0 kudos
Latest Reply
lingareddy_Alva
Honored Contributor III
  • 0 kudos

@RitikaKulshrest I can see what's happening with your Genie conversation API issue. The problem is not with your API call but with the Genie service's attempt to execute your query.The response indicates that your message was received but failed to p...

  • 0 kudos
yj940525
by New Contributor III
  • 1531 Views
  • 3 replies
  • 0 kudos

Genie Agent integration issue

Hi, anyone from development team for Genie Agent integration?  i had an issue of using sample code of Genie Agent integration. The issue is that underlying code (databricks_ai_bridge/genie.py) cannot connect to url openaipublic.blob.core.windows.net ...

  • 1531 Views
  • 3 replies
  • 0 kudos
Latest Reply
lingareddy_Alva
Honored Contributor III
  • 0 kudos

You're absolutely right. The implementation should definitely take restricted network scenarios into consideration, as many enterprise Databricks deployments operate in air-gapped or network-restricted environments.Commenting out the token counting l...

  • 0 kudos
2 More Replies
royinblr11
by New Contributor II
  • 4630 Views
  • 2 replies
  • 2 kudos

Resolved! LLM with the largest context window

A Generative AI Engineer is tasked with developing an application that is based on an open-source large language model (LLM). They need a foundation LLM with a large context window. Which model fits this need?DBRX,Llama2-70B,DistilBert MPT-30B.DBRX h...

  • 4630 Views
  • 2 replies
  • 2 kudos
Latest Reply
lingareddy_Alva
Honored Contributor III
  • 2 kudos

@royinblr11 You're absolutely right to question the answer — the correct model for an application needing a foundation LLM with a large context window is: DBRXWhy DBRX is the Best Fit:It is a foundation model, designed for generation tasks.It support...

  • 2 kudos
1 More Replies
brandt6264
by New Contributor II
  • 979 Views
  • 3 replies
  • 0 kudos

Are serverless endpoints possible in this Technical Blog post by qianyu?

Hi, are serverless endpoints possible for Whisper and Llama in this Technical Blog post by qianyu?https://community.databricks.com/t5/technical-blog/streamline-customer-call-center-transcripts-analytics-with/ba-p/101689Thanks! 

  • 979 Views
  • 3 replies
  • 0 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 0 kudos

https://www.databricks.com/product/pricing/foundation-model-servingWe don´t serve models on databricks, but as far as i can see you pay per input/output tokens (for foundation models).For classic models:https://www.databricks.com/product/pricing/mode...

  • 0 kudos
2 More Replies
pemidexx
by New Contributor III
  • 3012 Views
  • 4 replies
  • 2 kudos

Resolved! MLFlow Authentication from Databricks App for GenAI Tracing

I am working on a Dash-based app that includes a call to a Databricks-hosted LLM endpoint. I am trying to track those calls with MLFlow. My code is (roughly) like this:from openai import OpenAIimport mlflowmlflow.set_tracking_uri("databricks")mlflow....

  • 3012 Views
  • 4 replies
  • 2 kudos
Latest Reply
pemidexx
New Contributor III
  • 2 kudos

Thank you for the push in the right direction! I was able to solve the issue with this codeos.environ["DATABRICKS_CLIENT_ID"] = "" os.environ["DATABRICKS_CLIENT_SECRET"] = "" os.environ["DATABRICKS_TOKEN"] = os.environ.get("VAR_CONFIGURED_WITH_DATABR...

  • 2 kudos
3 More Replies
GabrielS
by New Contributor II
  • 9585 Views
  • 2 replies
  • 1 kudos

AI/BI Genie API

Hi everyone,I'm currently working on a project that implements a custom front end for Genie using the Genie API: https://docs.databricks.com/api/workspace/genie. So far, I’ve successfully built a working model that can send and receive messages.Howev...

  • 9585 Views
  • 2 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

Is there any possibility that workspace admin could have removed this permission to you? If you check the permissions on your side do you see yourself with at least Can View?

  • 1 kudos
1 More Replies
jAAmes_bentley
by Contributor
  • 3280 Views
  • 1 replies
  • 0 kudos

Resolved! Roadmap for vector_search function

I was wondering if there was a roadmap for the development of the vector_search function: vector_search function | Databricks DocumentationSpecifically, I was wondering if / when the following limitations may be lifted:Querying DIRECT_ACCESS index ty...

  • 3280 Views
  • 1 replies
  • 0 kudos
Latest Reply
Vinay_M_R
Databricks Employee
  • 0 kudos

Hello @jAAmes_bentley, DIRECT_ACCESS & filters_json are not currently supported with vector_search sql function. These are on our roadmap, but we don’t have concrete ETAs to share at the moment as we’re focusing on other high-priority tasks. Hybrid s...

  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now