Databricks Community

User16790091296 · ‎06-04-2021

sean_owen · ‎06-17-2021

Broadly, it's because high-concurrency cluster have to have much more control of user workloads in order to enforce resource sharing constraints. Scala is the lowest-level language you can access in Databricks, as you execute directly in the JVM, and it becomes difficult to enforce anything about those workloads, whereas it's much easier to intercept and control what Python and R (and SQL) interpreters do. It's not impossible I suppose to figure out how to do this in the JVM, just harder, so isn't supported at the moment. Typically those workloads aren't the type you want to put on high-concurrency anyway; you'd just use a (separate) standard cluster.

Databricks Community

Why doesn’t high concurrency cluster support Scala?

Photos

Join Us as a Local Community Builder!

Business Intelligence in the Era of AI

🚀 Monthly Databricks Get Started Days – Accelerate Your Learning Journey! 🚀

Databricks Community Champion - March 2025 - Takuya Omi

Get Started With Lakehouse Architecture | Pass a quiz to earn your certificate completion.

Virtual Learning Festival: 9 April - 30 April