<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Entity Matching on struct column in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/entity-matching-on-struct-column/m-p/148521#M52905</link>
    <description>&lt;P&gt;Databricks Genie allows &lt;A href="https://docs.databricks.com/aws/en/genie/knowledge-store#-prompt-matching-components" target="_self"&gt;configurable prompt matching&lt;/A&gt;, for example entity matching:&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;&lt;EM&gt;Entity matching provides curated lists of distinct values for up to 120 columns where users are likely to reference specific entries, such as states and product categories. This helps Genie match user terminology to actual data values.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;What if we're using a column that's a struct composed of a categorical event value and a timestamp–? This is common in status tables where we've got the latest event along with when that event occurred.&lt;/P&gt;&lt;P&gt;It would be ideal if entity matching would look into such structs and treat each field separately. Does that already happen?&lt;/P&gt;</description>
    <pubDate>Mon, 16 Feb 2026 13:54:07 GMT</pubDate>
    <dc:creator>Malthe</dc:creator>
    <dc:date>2026-02-16T13:54:07Z</dc:date>
    <item>
      <title>Entity Matching on struct column</title>
      <link>https://community.databricks.com/t5/data-engineering/entity-matching-on-struct-column/m-p/148521#M52905</link>
      <description>&lt;P&gt;Databricks Genie allows &lt;A href="https://docs.databricks.com/aws/en/genie/knowledge-store#-prompt-matching-components" target="_self"&gt;configurable prompt matching&lt;/A&gt;, for example entity matching:&lt;/P&gt;&lt;P class="lia-indent-padding-left-30px"&gt;&lt;EM&gt;Entity matching provides curated lists of distinct values for up to 120 columns where users are likely to reference specific entries, such as states and product categories. This helps Genie match user terminology to actual data values.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;What if we're using a column that's a struct composed of a categorical event value and a timestamp–? This is common in status tables where we've got the latest event along with when that event occurred.&lt;/P&gt;&lt;P&gt;It would be ideal if entity matching would look into such structs and treat each field separately. Does that already happen?&lt;/P&gt;</description>
      <pubDate>Mon, 16 Feb 2026 13:54:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/entity-matching-on-struct-column/m-p/148521#M52905</guid>
      <dc:creator>Malthe</dc:creator>
      <dc:date>2026-02-16T13:54:07Z</dc:date>
    </item>
    <item>
      <title>Re: Entity Matching on struct column</title>
      <link>https://community.databricks.com/t5/data-engineering/entity-matching-on-struct-column/m-p/148619#M52933</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/9268"&gt;@Malthe&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;Entity matching supports simple data types only (no structs). The recommended approach is to flatten the table structure before passing it to Genie.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Hope it helps,.&lt;/P&gt;
&lt;P&gt;Best regards,&lt;/P&gt;</description>
      <pubDate>Tue, 17 Feb 2026 16:00:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/entity-matching-on-struct-column/m-p/148619#M52933</guid>
      <dc:creator>aleksandra_ch</dc:creator>
      <dc:date>2026-02-17T16:00:11Z</dc:date>
    </item>
  </channel>
</rss>

