cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
MVP Articles
This page brings together externally published articles written by our MVPs. Discover expert perspectives, real-world guidance, and community contributions from leaders across the ecosystem.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

๐Ÿš€ Databricks Runtime 18.1 (Beta) โ€” A Big Leap Forward for Data & AI Teams

Abiola-David
Databricks MVP

Databricks has rolled out Runtime 18.1 (Beta), and itโ€™s packed with meaningful enhancements across streaming, Delta Lake, SQL, geospatial, performance, and Spark 4.1.0 improvements. This release builds on 18.0 and introduces new capabilities that make pipelines faster, smarter, and more reliable. Hereโ€™s a breakdown of whatโ€™s new and why it matters.

Key New Features & Improvements

๐Ÿ“ŠAuto Loader Enhancements
Auto Loader now uses file events by default when available, reducing directory listing costs and improving latency. You can still override behaviour using useIncrementalListing, useNotifications, or disable file events with useManagedFileEvents = false.

๐Ÿ“ŠDelta Lake & Unity Catalog Improvements
Optimized Writes for CRTAS
Partitioned Unity Catalog tables created via CREATE OR REPLACE TABLE AS SELECT now automatically use optimized writes for fewer, larger files.

๐Ÿ“ŠSchema Evolution with INSERT
New WITH SCHEMA EVOLUTION clause allows automatic schema evolution during INSERT INTO, INSERT OVERWRITE, and INSERT INTO โ€ฆ REPLACE.
Handles new columns, widened types, and preserves NULL struct values even when field order differs.

๐Ÿ“ŠDelta Sharing
Now supports multiโ€‘statement transactions for shared tables using preโ€‘signed URLs or cloud tokens.

๐Ÿ“ŠSQL & Scripting Enhancements
New SQL Functions
parse_timestamp โ€” photonized for fast multiโ€‘pattern timestamp parsing.
Approximate topโ€‘k sketch functions (approx_top_k_accumulate, approx_top_k_combine, approx_top_k_estimate).
Tuple sketch functions for distinct counting and keyโ€‘summary aggregation.

๐Ÿ“ŠSQL Cursor Support
Compound SQL statements now support DECLARE CURSOR, OPEN, FETCH, and CLOSE for rowโ€‘byโ€‘row processing.

๐Ÿ“ŠBehavioural Changes
FILTER clause now works with MEASURE aggregate functions.
Timestamp partitions now use Spark session timezone instead of JVM timezone.
DESCRIBE FLOW is now a reserved keyword.

๐Ÿ“Š Streaming Improvements
Automatic streaming type widening for Delta tables.
New configs allow stricter control if needed.

๐Ÿ“Š Geospatial Performance Boost
Geospatial Boolean set operations now use a new, faster implementation (with minor precision differences beyond 15 decimal places).

๐Ÿ“Š DataFrame & Compute Enhancements
DataFrame checkpoints now support Unity Catalog volume paths.
.cache() no longer reโ€‘runs SQL commands like SHOW TABLES.

๐Ÿ“Š Cloud & External System Improvements
DATETIMEOFFSET type support for Azure Synapse.
Google BigQuery table descriptions now appear as table comments.

๐Ÿ“Š Apache Spark 4.1.0 Included
Databricks Runtime 18.1 ships with Apache Spark 4.1.0, bringing:
Major performance fixes
Improved pandas interoperability
New geospatial type support
Arrow & Pandas UDF improvements
Streaming enhancements
Stability and errorโ€‘handling improvements

If you're building modern data platforms, experimenting with LLMs, or optimizing production pipelines, this runtime is absolutely worth exploring.

#Databricks #RuntimeLatest #Beta #DatabricksMVPruntime.png

0 REPLIES 0