While data volumes are growing, the time windows to process them are getting stricter. Enterprise customers are generating more data than ever, and the speed is only accelerating with more agentic workflows. At the same time, the SLAs for when it needs to be refreshed leave less and less room, whether it's nightly ETL that must land before business hours or complex BI dashboards that users expect to load in seconds.
Long-running ETL pipelines and resource-intensive analytical queries are pushing against the limits of what a 4XL can deliver. One option is to rearchitect: break pipelines into smaller, parallelized steps or pre-aggregate tables for dashboards. But that adds significant engineering overhead. For many teams, that tradeoff doesn't make sense when all they need is more horsepower.
Today, we're announcing that the 5XL Databricks SQL Warehouse is now in Public Preview, giving customers access to the largest SQL warehouse available on Databricks. Compared to 4XL, it doubles the compute: more cores for parallelism, more memory to keep data in-memory, and more resources to power through the heaviest operations without rearchitecting your pipelines.
In benchmarks, analytical queries run ~1.8x faster. For terabyte-scale ETL inserts, customers have seen 2x or better. 5XL's hourly rate is roughly twice 4XL's, but faster completion means less time billed, so total cost is often comparable or lower.
In this post, we walk through how to evaluate whether 5XL is right for your workloads, set up a comparison, and monitor performance after deployment.
Not every workload benefits from 5XL equally. The biggest gains come from resource-intensive, long-running operations that can take advantage of 5XL's additional parallelism and compute headroom.
Look for these patterns in your current environment:
Workloads that match these patterns are the best candidates for 5XL.
We recommend a controlled comparison before moving production workloads to 5XL.
Tips for setting up a controlled comparison:
Based on customer results and internal performance testing:
5XL is purpose-built for long-running, resource-intensive operations.
The obvious question: does 2x the compute mean 2x the cost?
Not necessarily. 5XL has a higher hourly rate, but if your queries finish significantly faster, the total DBUs consumed can be similar or even lower. Here's the math for an example we observed:
|
Warehouse Size |
4XL |
5XL |
|
Hourly rate |
528 DBU/hr |
1,042 DBU/hr |
|
Batch duration |
121 min |
64 min |
|
Total DBUs |
1,074 |
1,111 |
|
Cost diff |
~3.4% |
|
|
Time saved |
1.9x |
|
For SLA-sensitive complex workloads, the math is straightforward: roughly 2x faster, at comparable total cost.
After deploying 5XL in production:
5XL SQL Warehouses are available now in Public Preview for Serverless and Pro in all cloud regions. Create a warehouse, run your heaviest workloads through a controlled test, and measure the difference.
If your batch windows are tight and your 4XL is hitting its limits, this is built for you. Reach out to your Databricks account team with questions or to share your results. We'd love to hear what 5XL does for your workloads.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.