<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Agent Bricks Information Extraction in Generative AI</title>
    <link>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134591#M1212</link>
    <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/190672"&gt;@Yash01Kumar12&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1. For the Index creation failure: Invalid column type - variant is used for 'raw_parsed'.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The indexing mechanism, does not support the &lt;CODE&gt;VARIANT&lt;/CODE&gt; data type for columns that need to be indexed. It only supports specific types like various numeric types, strings, timestamps, and arrays of numeric types (for vector embeddings).&amp;nbsp;&lt;/P&gt;
&lt;P data-unlink="true"&gt;While Variant data type is &lt;A href="https://docs.databricks.com/aws/en/sql/language-manual/data-types/variant-type" target="_self"&gt;Public Preview&lt;/A&gt;, I see an internal feature request "&lt;SPAN class="record-reference-pill__identifier"&gt;DB-I-14338&lt;/SPAN&gt;" to support&amp;nbsp;&lt;SPAN&gt;Variant data type for Vector search. So, I suspect you would have to change the data type to proceed right now. I have added your usecase as well and a proxy vote.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P data-unlink="true"&gt;&lt;SPAN&gt;In &lt;A href="http://&amp;nbsp;https://docs.databricks.com/aws/en/delta/variant%20" target="_self"&gt;delta table&lt;/A&gt;, y&lt;/SPAN&gt;&lt;SPAN&gt;ou would have to use the below code to support it. But I do not think this would still help as it has to be supported in Vector Search.&lt;/SPAN&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;ALTER TABLE table_name SET TBLPROPERTIES('delta.feature.variantType-preview' = 'supported')&lt;/LI-CODE&gt;
&lt;P&gt;So, for now, can you use a different data type for the column which is supported today?&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Nandini&lt;/P&gt;</description>
    <pubDate>Fri, 10 Oct 2025 22:21:02 GMT</pubDate>
    <dc:creator>NandiniN</dc:creator>
    <dc:date>2025-10-10T22:21:02Z</dc:date>
    <item>
      <title>Agent Bricks Information Extraction</title>
      <link>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134567#M1208</link>
      <description>&lt;P&gt;I am facing some problem in Information extraction using PDF. I have done all the necessary steps.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;1) I loaded the data in Volume.&lt;BR /&gt;2) I ran the Use PDF's functionality to create a structure table of the PDFs&lt;/P&gt;&lt;P&gt;3) I now have the table with the column names:&amp;nbsp;&lt;/P&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;path -&amp;gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN&gt;string&lt;BR /&gt;&lt;/SPAN&gt;&lt;SPAN&gt;raw_parsed -&amp;gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;variant&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;DIV class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;text&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;string -&amp;gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;error_status -&amp;gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;string&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;SPAN&gt;I am experiencing a problem while directly creating a Vector Search Index from this:&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV class=""&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Yash01Kumar12_0-1760117409670.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20667i020A6A40A606A97E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Yash01Kumar12_0-1760117409670.png" alt="Yash01Kumar12_0-1760117409670.png" /&gt;&lt;/span&gt;&lt;P&gt;Any reason why this is happening.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Second, facing problems in creating Information Extraction Agent, the error I am facing is&lt;/P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Yash01Kumar12_1-1760117530018.png" style="width: 400px;"&gt;&lt;img src="https://community.databricks.com/t5/image/serverpage/image-id/20668i8B0E70744DFB793E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Yash01Kumar12_1-1760117530018.png" alt="Yash01Kumar12_1-1760117530018.png" /&gt;&lt;/span&gt;&lt;P&gt;Requesting you to help me in understand what wrong I am doing throughout this process.&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;1) Databricks workspace region: East US 2 (Azure) it is also correct s per documentation&lt;/P&gt;&lt;P&gt;2) Able to use ai_query()&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Fri, 10 Oct 2025 17:34:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134567#M1208</guid>
      <dc:creator>Yash01Kumar12</dc:creator>
      <dc:date>2025-10-10T17:34:59Z</dc:date>
    </item>
    <item>
      <title>Re: Agent Bricks Information Extraction</title>
      <link>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134591#M1212</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/190672"&gt;@Yash01Kumar12&lt;/a&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1. For the Index creation failure: Invalid column type - variant is used for 'raw_parsed'.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The indexing mechanism, does not support the &lt;CODE&gt;VARIANT&lt;/CODE&gt; data type for columns that need to be indexed. It only supports specific types like various numeric types, strings, timestamps, and arrays of numeric types (for vector embeddings).&amp;nbsp;&lt;/P&gt;
&lt;P data-unlink="true"&gt;While Variant data type is &lt;A href="https://docs.databricks.com/aws/en/sql/language-manual/data-types/variant-type" target="_self"&gt;Public Preview&lt;/A&gt;, I see an internal feature request "&lt;SPAN class="record-reference-pill__identifier"&gt;DB-I-14338&lt;/SPAN&gt;" to support&amp;nbsp;&lt;SPAN&gt;Variant data type for Vector search. So, I suspect you would have to change the data type to proceed right now. I have added your usecase as well and a proxy vote.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P data-unlink="true"&gt;&lt;SPAN&gt;In &lt;A href="http://&amp;nbsp;https://docs.databricks.com/aws/en/delta/variant%20" target="_self"&gt;delta table&lt;/A&gt;, y&lt;/SPAN&gt;&lt;SPAN&gt;ou would have to use the below code to support it. But I do not think this would still help as it has to be supported in Vector Search.&lt;/SPAN&gt;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;ALTER TABLE table_name SET TBLPROPERTIES('delta.feature.variantType-preview' = 'supported')&lt;/LI-CODE&gt;
&lt;P&gt;So, for now, can you use a different data type for the column which is supported today?&lt;/P&gt;
&lt;P&gt;Thanks &amp;amp; Regards,&lt;/P&gt;
&lt;P&gt;Nandini&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 22:21:02 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134591#M1212</guid>
      <dc:creator>NandiniN</dc:creator>
      <dc:date>2025-10-10T22:21:02Z</dc:date>
    </item>
    <item>
      <title>Re: Agent Bricks Information Extraction</title>
      <link>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134593#M1214</link>
      <description>&lt;P&gt;For the issue 2:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;INVALID_PARAMETER_VALUE: Couldn't find enough valid rows in the selected table. Found 0 rows, minimum required is 1 for agent creation.
&lt;/LI-CODE&gt;
&lt;P&gt;Since the previous step was failing due to a data type issue (&lt;CODE&gt;VARIANT&lt;/CODE&gt;), it's highly likely that the &lt;I&gt;entire table&lt;/I&gt; is being marked as invalid or unusable by the Agent framework's sampling mechanism.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Oct 2025 22:26:22 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134593#M1214</guid>
      <dc:creator>NandiniN</dc:creator>
      <dc:date>2025-10-10T22:26:22Z</dc:date>
    </item>
    <item>
      <title>Re: Agent Bricks Information Extraction</title>
      <link>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134665#M1221</link>
      <description>&lt;P&gt;Thank You&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/23233"&gt;@NandiniN&lt;/a&gt; for responding. For now, I have changed the type of "raw_parsed" to string and created a vector index after that.&lt;BR /&gt;&lt;BR /&gt;But can you help me with one more thing. I am creating a Multi-Agent-Supervisor, can you explain me the need of OBO User Authentication please and where I can find this and enable, in the documentation I understood that it will use the resource like Vector Search and will authenticate using this, but why is the case when I am doing all the stuff in my Databricks UI?&lt;/P&gt;</description>
      <pubDate>Sun, 12 Oct 2025 16:04:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/generative-ai/agent-bricks-information-extraction/m-p/134665#M1221</guid>
      <dc:creator>Yash01Kumar12</dc:creator>
      <dc:date>2025-10-12T16:04:54Z</dc:date>
    </item>
  </channel>
</rss>

