<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Data in Unity Catalog that can't be previewed in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/data-in-unity-catalog-that-can-t-be-previewed/m-p/155360#M54233</link>
    <description>&lt;P&gt;This is a small deficiency, but a fix would be nice to have.&lt;/P&gt;&lt;P&gt;For a long time now, the Sample Data previewer in the Unity Catalog explorer has been unable to show tables that contain a certain kind of column.&amp;nbsp; Instead of showing sample rows of the table, it shows:&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Error getting sample data&lt;BR /&gt;Unexpected token '(', "(8000,[0,2"... is not valid JSON&lt;/P&gt;&lt;P&gt;That little data snippet mentioning "8000" is enough to point me to a specific column causing the trouble, whose type can be described in several different ways.&amp;nbsp;&lt;/P&gt;&lt;P&gt;1)&amp;nbsp; The Overview tab shows it as a struct that breaks out as:&lt;BR /&gt;{"type": "tinyint", "size": "int", "indices": {"items": "int"}, "values": {"items": "double"}}&lt;/P&gt;&lt;P&gt;2)&amp;nbsp; A SQL DESCRIBE shows the column as having datatype "vector".&lt;/P&gt;&lt;P&gt;3)&amp;nbsp; The column was created via a UDF whose key operation is:&amp;nbsp;&lt;/P&gt;&lt;P&gt;from pyspark.ml.linalg import Vectors&lt;BR /&gt;&amp;nbsp;...&lt;BR /&gt;[output] = Vectors.sparse(inputs)&lt;/P&gt;&lt;P&gt;Any chance of getting this fixed so that the table containing this data type can be previewed?&lt;/P&gt;</description>
    <pubDate>Thu, 23 Apr 2026 16:33:51 GMT</pubDate>
    <dc:creator>DavidKxx</dc:creator>
    <dc:date>2026-04-23T16:33:51Z</dc:date>
    <item>
      <title>Data in Unity Catalog that can't be previewed</title>
      <link>https://community.databricks.com/t5/data-engineering/data-in-unity-catalog-that-can-t-be-previewed/m-p/155360#M54233</link>
      <description>&lt;P&gt;This is a small deficiency, but a fix would be nice to have.&lt;/P&gt;&lt;P&gt;For a long time now, the Sample Data previewer in the Unity Catalog explorer has been unable to show tables that contain a certain kind of column.&amp;nbsp; Instead of showing sample rows of the table, it shows:&lt;/P&gt;&lt;P&gt;&lt;BR /&gt;Error getting sample data&lt;BR /&gt;Unexpected token '(', "(8000,[0,2"... is not valid JSON&lt;/P&gt;&lt;P&gt;That little data snippet mentioning "8000" is enough to point me to a specific column causing the trouble, whose type can be described in several different ways.&amp;nbsp;&lt;/P&gt;&lt;P&gt;1)&amp;nbsp; The Overview tab shows it as a struct that breaks out as:&lt;BR /&gt;{"type": "tinyint", "size": "int", "indices": {"items": "int"}, "values": {"items": "double"}}&lt;/P&gt;&lt;P&gt;2)&amp;nbsp; A SQL DESCRIBE shows the column as having datatype "vector".&lt;/P&gt;&lt;P&gt;3)&amp;nbsp; The column was created via a UDF whose key operation is:&amp;nbsp;&lt;/P&gt;&lt;P&gt;from pyspark.ml.linalg import Vectors&lt;BR /&gt;&amp;nbsp;...&lt;BR /&gt;[output] = Vectors.sparse(inputs)&lt;/P&gt;&lt;P&gt;Any chance of getting this fixed so that the table containing this data type can be previewed?&lt;/P&gt;</description>
      <pubDate>Thu, 23 Apr 2026 16:33:51 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-in-unity-catalog-that-can-t-be-previewed/m-p/155360#M54233</guid>
      <dc:creator>DavidKxx</dc:creator>
      <dc:date>2026-04-23T16:33:51Z</dc:date>
    </item>
    <item>
      <title>Re: Data in Unity Catalog that can't be previewed</title>
      <link>https://community.databricks.com/t5/data-engineering/data-in-unity-catalog-that-can-t-be-previewed/m-p/155453#M54249</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/94566"&gt;@DavidKxx&lt;/a&gt;,&lt;/P&gt;
&lt;P&gt;Thanks for flagging this. You're right, the Sample Data previewer in Catalog Explorer is choking because your column is a Spark ML vector type (pyspark.ml.linalg.VectorUDT, what Vectors.sparse(...) returns). The previewer is trying to JSON-parse the stringified vector ((8000,[0,2,...],[...])), which obviously isn't JSON, and that's why the whole tab fails rather than just the one column. UC's Overview tab and DESCRIBE surface the same column differently (as a struct and as vector), which is consistent with this being a rendering issue rather than a problem with the data itself.&lt;/P&gt;
&lt;P&gt;While we get this fixed on the product side, a couple of workarounds for previewing... Run a quick SELECT in SQL Editor or a notebook with the vector cast to a string or broken out via vector_to_array(col) into a regular array column, and those will render fine. Creating a view that exposes the vector as an array column is a nice way to keep the "quick preview" experience without changing your base table.&lt;/P&gt;
&lt;P&gt;I've gone ahead and logged this internally so the Catalog Explorer team can pick it up and prioritise a fix. One quick thing that would help when they triage... Could you confirm roughly how large the vectors are in your table (the 8000 in the error suggests that dimension), and whether you're seeing this only on sparse vectors or also on dense ones? No need for the actual data, just a rough shape.&lt;/P&gt;
&lt;P class="p1"&gt;&lt;FONT size="2" color="#FF6600"&gt;&lt;STRONG&gt;&lt;I&gt;If this answer resolves your question, could you mark it as “Accept as Solution”? That helps other users quickly find the correct fix.&lt;/I&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;I&gt;&lt;/I&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 24 Apr 2026 13:40:11 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-in-unity-catalog-that-can-t-be-previewed/m-p/155453#M54249</guid>
      <dc:creator>Ashwin_DSA</dc:creator>
      <dc:date>2026-04-24T13:40:11Z</dc:date>
    </item>
    <item>
      <title>Re: Data in Unity Catalog that can't be previewed</title>
      <link>https://community.databricks.com/t5/data-engineering/data-in-unity-catalog-that-can-t-be-previewed/m-p/155456#M54251</link>
      <description>&lt;P&gt;Yes, my vector space is commonly of dimension 4000 or 8000.&lt;/P&gt;&lt;P&gt;I don't write any dense vectors to table; can't speak to what happens previewing that type.&lt;/P&gt;&lt;P&gt;Thanks for taking up the issue!&lt;/P&gt;</description>
      <pubDate>Fri, 24 Apr 2026 14:30:23 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-in-unity-catalog-that-can-t-be-previewed/m-p/155456#M54251</guid>
      <dc:creator>DavidKxx</dc:creator>
      <dc:date>2026-04-24T14:30:23Z</dc:date>
    </item>
  </channel>
</rss>

