cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Why doesnโ€™t high concurrency cluster support Scala?

User16790091296
Contributor II
 
1 REPLY 1

sean_owen
Honored Contributor II
Honored Contributor II

Broadly, it's because high-concurrency cluster have to have much more control of user workloads in order to enforce resource sharing constraints. Scala is the lowest-level language you can access in Databricks, as you execute directly in the JVM, and it becomes difficult to enforce anything about those workloads, whereas it's much easier to intercept and control what Python and R (and SQL) interpreters do. It's not impossible I suppose to figure out how to do this in the JVM, just harder, so isn't supported at the moment. Typically those workloads aren't the type you want to put on high-concurrency anyway; you'd just use a (separate) standard cluster.

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.