<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Mosaic Vector Search in Generative AI</title>
    <link>https://community.databricks.com/t5/generative-ai/mosaic-vector-search/m-p/93634#M595</link>
    <description>&lt;P&gt;I created a RAG using a corpus of pdf which I have on ADLS. Now where will the chunked text and vector embedding be stored once I parse all the PDFs and I want to start retrieval?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 11 Oct 2024 14:28:48 GMT</pubDate>
    <dc:creator>nagND</dc:creator>
    <dc:date>2024-10-11T14:28:48Z</dc:date>
    <item>
      <title>Mosaic Vector Search</title>
      <link>https://community.databricks.com/t5/generative-ai/mosaic-vector-search/m-p/93634#M595</link>
      <description>&lt;P&gt;I created a RAG using a corpus of pdf which I have on ADLS. Now where will the chunked text and vector embedding be stored once I parse all the PDFs and I want to start retrieval?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 11 Oct 2024 14:28:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/mosaic-vector-search/m-p/93634#M595</guid>
      <dc:creator>nagND</dc:creator>
      <dc:date>2024-10-11T14:28:48Z</dc:date>
    </item>
    <item>
      <title>Re: Mosaic Vector Search</title>
      <link>https://community.databricks.com/t5/generative-ai/mosaic-vector-search/m-p/94770#M612</link>
      <description>&lt;P&gt;&lt;STRONG&gt;Option 1: Delta Sync Index with embeddings computed by Databricks&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;You provide a source Delta table that contains data in text format. Databricks calculates the embeddings, using a model that you specify, and optionally saves the embeddings to a table in Unity Catalog. As the Delta table is updated, the index stays synced with the Delta table.&lt;/P&gt;&lt;P&gt;The following diagram illustrates the process:&lt;/P&gt;&lt;OL class=""&gt;&lt;LI&gt;&lt;P&gt;Calculate query embeddings. Query can include metadata filters.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Perform similarity search to identify most relevant documents.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Return the most relevant documents and append them to the query.&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;DIV class=""&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="LauJohansson_3-1729240335633.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/12060i43A9474A2D888EAC/image-size/medium?v=v2&amp;amp;px=400" role="button" title="LauJohansson_3-1729240335633.png" alt="LauJohansson_3-1729240335633.png" /&gt;&lt;/span&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/DIV&gt;&lt;P&gt;&lt;EM&gt;&lt;BR /&gt;"&lt;/EM&gt;&lt;EM&gt;Mosaic AI Vector Search is a vector database that is built into the Databricks Data Intelligence Platform and integrated with its governance and productivity tools. A vector database is a database that is optimized to store and retrieve embeddings. Embeddings are mathematical representations of the semantic content of data, typically text or image data. Embeddings are generated by a large language model and are a key component of many GenAI applications that depend on finding documents or images that are similar to each other. Examples are RAG systems, recommender systems, and image and video recognition.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;With Mosaic AI Vector Search, you create a vector search index from a Delta table. The index includes embedded data with metadata. You can then query the index using a REST API to identify the most similar vectors and return the associated documents. You can structure the index to automatically sync when the underlying Delta table is updated."&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;&lt;BR /&gt;Have you read these docs?&lt;BR /&gt;&lt;A href="https://docs.databricks.com/en/generative-ai/vector-search.html" target="_blank" rel="noopener"&gt;https://docs.databricks.com/en/generative-ai/vector-search.html&lt;/A&gt;&lt;BR /&gt;&lt;A href="https://docs.databricks.com/en/generative-ai/create-query-vector-search.html" target="_blank" rel="noopener"&gt;https://docs.databricks.com/en/generative-ai/create-query-vector-search.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 18 Oct 2024 08:32:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/mosaic-vector-search/m-p/94770#M612</guid>
      <dc:creator>LauJohansson</dc:creator>
      <dc:date>2024-10-18T08:32:27Z</dc:date>
    </item>
  </channel>
</rss>

