Louis_Frolio
Databricks Employee
Databricks Employee
Databricks Serverless compute operates with shared infrastructure, enabling multiple notebooks and users to utilize the same serverless cluster while maintaining isolation. This is achieved through features like client, driver, and executor isolation, ensuring workload security and preventing interference among users.
Key characteristics and behaviors include:
  1. Shared Compute for Users and Notebooks: Serverless compute allows for secure sharing across users and notebooks on the same cluster while leveraging identity management and sandboxing techniques to ensure isolation.
  2. Scaling and Concurrency:
    • By default, there is no concurrency limit set for serverless environments, allowing high-concurrency operations. However, resource caps and autoscaling policies can be applied to control costs and optimize performance.
    • Horizontal autoscaling dynamically adjusts resources based on workload requirements, transitioning quickly between scaling up during peak loads and scaling down during low usage to conserve resources and minimize idle costs.
  3. Resource Limits and Auto-Scaling:
    • Serverless compute employs an advanced autoscaler that cannot be disabled. It scales resources intelligently by leveraging workload patterns and pre-provisioning warm pools of instances for fast startup times. However, per-hour cost scaling limits are imposed to avoid runaway expenses, and higher resource caps can be requested as needed.
  4. Resource Isolation: Even in shared environments, workloads enjoy resource isolation. This ensures independent execution of tasks without resource bottlenecking, particularly valuable for multi-task jobs or streaming pipelines, enhancing both performance and cost-efficiency.
  5. Billing: Users are billed for actual workload activity rather than idle resources. Databricks manages all infrastructure intricacies—including instance type selection and scaling—resulting in simpler operations for users.
These features together ensure that Databricks Serverless compute is cost-efficient, scalable, and secure for multi-user and multi-notebook usage, while providing robust resource handling through intelligent autoscaling and workload isolation.
 
Hope this helps, Lou.

View solution in original post