<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Databricks UUID in Warehousing &amp; Analytics</title>
    <link>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/147797#M2480</link>
    <description>&lt;P&gt;Is there a plan for Databricks to support the UUIDv7 variant that is friendlier to use in databases for keys and partitioning?&lt;BR /&gt;&lt;A href="https://python.plainenglish.io/python-3-14-brings-uuidv6-v7-v8-stop-using-uuid4-like-its-2015-90518fdfce81" target="_blank"&gt;https://python.plainenglish.io/python-3-14-brings-uuidv6-v7-v8-stop-using-uuid4-like-its-2015-90518fdfce81&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Also, related, Python 3.1.4 is the first release that natively supports the v7 UUID variant.&amp;nbsp;&lt;BR /&gt;Would it be possible to say when one could expect Python 3.1.4 to land in a LTS release?&lt;BR /&gt;&lt;BR /&gt;Thank you.&lt;/P&gt;</description>
    <pubDate>Mon, 09 Feb 2026 23:27:59 GMT</pubDate>
    <dc:creator>David_Dabbs</dc:creator>
    <dc:date>2026-02-09T23:27:59Z</dc:date>
    <item>
      <title>Databricks UUID</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/147797#M2480</link>
      <description>&lt;P&gt;Is there a plan for Databricks to support the UUIDv7 variant that is friendlier to use in databases for keys and partitioning?&lt;BR /&gt;&lt;A href="https://python.plainenglish.io/python-3-14-brings-uuidv6-v7-v8-stop-using-uuid4-like-its-2015-90518fdfce81" target="_blank"&gt;https://python.plainenglish.io/python-3-14-brings-uuidv6-v7-v8-stop-using-uuid4-like-its-2015-90518fdfce81&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Also, related, Python 3.1.4 is the first release that natively supports the v7 UUID variant.&amp;nbsp;&lt;BR /&gt;Would it be possible to say when one could expect Python 3.1.4 to land in a LTS release?&lt;BR /&gt;&lt;BR /&gt;Thank you.&lt;/P&gt;</description>
      <pubDate>Mon, 09 Feb 2026 23:27:59 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/147797#M2480</guid>
      <dc:creator>David_Dabbs</dc:creator>
      <dc:date>2026-02-09T23:27:59Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks UUID</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/148206#M2489</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/215238"&gt;@David_Dabbs&lt;/a&gt;&amp;nbsp;- We do support uuidv7 database keys and partitioning in Lakebase PostgreSQL via&amp;nbsp;&lt;A href="https://neon.com/docs/extensions/pg_uuidv7" target="_blank"&gt;pg_uuidv7&lt;/A&gt;&amp;nbsp; and Databricks SQL functions. There are some limitations when it comes to clustering and column generation.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;As for Python 3.1.4 - this is an outdated minor release, and modern Databricks runtimes target versions 3.10-3.12 range. Are you inquiring about "3.11.4"? If so,&amp;nbsp;Python 3.11.4 will not land in a Databricks Runtime LTS. Instead, 15.4 LTS already standardizes on Python 3.11.11, and newer LTS lines use Python 3.12.3.&lt;/P&gt;
&lt;P&gt;I hope this helps!&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Sarah&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Feb 2026 16:19:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/148206#M2489</guid>
      <dc:creator>sarahbhord</dc:creator>
      <dc:date>2026-02-12T16:19:57Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks UUID</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/148219#M2490</link>
      <description>&lt;P&gt;Thank you&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/143567"&gt;@sarahbhord&lt;/a&gt;.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;HR /&gt;We do support uuidv7 database keys and partitioning in Lakebase PostgreSQL via&amp;nbsp;&lt;A href="https://neon.com/docs/extensions/pg_uuidv7" target="_blank" rel="noopener"&gt;pg_uuidv7&lt;/A&gt;&amp;nbsp; and Databricks SQL functions. There are some limitations when it comes to clustering and column generation.&amp;nbsp;&lt;/BLOCKQUOTE&gt;&lt;P&gt;I'm not following the Neon Databricks SQL connection. We would want to use uuidv7 in Databricks, not Neon.&lt;/P&gt;&lt;BLOCKQUOTE&gt;&lt;P&gt;As for Python 3.1.4 - this is an outdated minor release, and modern Databricks runtimes target versions 3.10-3.12 range. Are you inquiring about "3.11.4"? If so,&amp;nbsp;Python 3.11.4 will not land in a Databricks Runtime LTS. Instead, 15.4 LTS already standardizes on Python 3.11.11, and newer LTS lines use Python 3.12.3.&lt;/P&gt;&lt;HR /&gt;&lt;/BLOCKQUOTE&gt;&lt;P&gt;Yes, 3.1.4 was a typo on my part. UUIDv7 support landed in Python with version &lt;STRONG&gt;3.14&lt;/STRONG&gt;.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Feb 2026 16:50:55 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/148219#M2490</guid>
      <dc:creator>David_Dabbs</dc:creator>
      <dc:date>2026-02-12T16:50:55Z</dc:date>
    </item>
    <item>
      <title>Re: Databricks UUID</title>
      <link>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/148224#M2491</link>
      <description>&lt;P&gt;Ah yes apologies - that was confusing. To implement&amp;nbsp;&lt;SPAN&gt;uuidv7&lt;/SPAN&gt; in DATABRICKS (using Databricks SQL) (without relying on Neon/Postgres), you can leverage Databricks' native support for the uuid() function (v4) and standard SQL to construct a v7-compliant identifier... AKA you can create a SQL UDF to generate them.&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;CREATE OR REPLACE FUNCTION generate_uuidv7()
RETURNS STRING
LANGUAGE SQL
AS
  SELECT 
    printf('%012x-%s-%s-%s-%s',
      -- 48-bit timestamp in milliseconds
      CAST(unix_millis(current_timestamp()) AS LONG),
      -- Version 7 and first 12 random bits (hex starts with '7')
      substring(hex(random()), 1, 4),
      -- Variant 1 and next 12 random bits (hex starts with 8, 9, A, or B)
      substring(hex(random()), 5, 4),
      substring(hex(random()), 9, 4),
      substring(hex(random()), 13, 12)
    );&lt;/LI-CODE&gt;
&lt;P&gt;For partitioning - I will mention that using a high-cardinality ID could create the "small file problem". Instead, parition by the date of the time component of the uuidv7. Alternatively, if you are using Delta Lake, use Liquid Clustering as it handles high-cardinality keys much better.&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;CREATE TABLE events (
  id STRING,
  event_date DATE GENERATED ALWAYS AS (
    CAST(from_unixtime(conv(substring(id, 1, 12), 16, 10) / 1000) AS DATE)
  )
)
PARTITIONED BY (event_date);
&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Feb 2026 19:26:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/warehousing-analytics/databricks-uuid/m-p/148224#M2491</guid>
      <dc:creator>sarahbhord</dc:creator>
      <dc:date>2026-02-12T19:26:29Z</dc:date>
    </item>
  </channel>
</rss>

