cancel
Showing results for 
Search instead for 
Did you mean: 
Warehousing & Analytics
cancel
Showing results for 
Search instead for 
Did you mean: 

What is databricks SQL, spark SQL and how are they different from MS SQL ?

chari
Contributor

Hello Databricks Community,

I have a hard time understanding how is Databricks SQL different from microsoft SQL ? Also, why does databricks provide spark SQL ? 

If you direct me to a well-written webpage or document its of immense help!

Thanks,

1 ACCEPTED SOLUTION

Accepted Solutions

Kaniz
Community Manager
Community Manager

Hi @chari

Certainly! Let’s delve into the differences between Databricks SQL and Microsoft SQL Server, as well as the rationale behind Spark SQL in Databricks.

  1. Databricks SQL vs. Microsoft SQL Server:

    • Databricks SQL is an integral part of the Databricks Lakehouse Platform. Here’s how it compares to Microsoft SQL Server:

      • Primary Database Model:

        • Databricks SQL: It combines elements of data lakes and data warehouses, providing a unified view of structured and unstructured data. It is based on Apache Spark.
        • Microsoft SQL Server: A flagship relational DBMS.
      • Secondary Database Models:

        • Databricks SQL: Supports additional models like document store and graph DBMS.
        • Microsoft SQL Server: Primarily a relational DBMS.
      • APIs and Access Methods:

        • Databricks SQL: Supports JDBC, ODBC, and a RESTful HTTP API.
        • Microsoft SQL Server: Offers a wide range of access methods including ADO.NET, JDBC, ODBC, and OLE DB.
      • Programming Languages:

        • Databricks SQL: Supports Python, R, Scala, C#, C++, Delphi, Go, Java, and JavaScript (Node.js).
        • Microsoft SQL Server: Allows server-side scripts in Transact SQL, .NET languages, R, Python, and (with SQL Server 2019) Java.
      • Consistency and Durability:

        • Both adhere to ACID properties (Atomicity, Consistency, Isolation, Durability).
      • In-Memory Capabilities:

        • Microsoft SQL Server offers in-memory capabilities, while Databricks SQL does not.
    • For more detailed information, you can explore the Databricks documentation and the Microsoft SQL Server documentation.

  2. Why Spark SQL in Databricks?:

    • Spark SQL is an essential component of Databricks for several reasons:
      • Unified Data Processing: Spark SQL seamlessly integrates structured data processing (SQL queries) with unstructured data processing (Spark operations).
      • Performance: It leverages the power of Apache Spark, enabling distributed, parallel processing across large datasets.
      • Data Lakehouse Paradigm: Databricks aims to bridge the gap between data lakes and data warehouses. Spark SQL plays a crucial role in this by providing a unified interface for both types of data.
      • Ecosystem Compatibility: Spark SQL allows users to work with data stored in various formats (Parquet, JSON, Avro, etc.) within the same platform.

    If you’d like to explore further, you can refer to the Databricks documentation.

Remember, both Databricks SQL and Microsoft SQL Server serve different purposes, and your choice depends on your specific use case and requirements. 🚀🔍

 

View solution in original post

1 REPLY 1

Kaniz
Community Manager
Community Manager

Hi @chari

Certainly! Let’s delve into the differences between Databricks SQL and Microsoft SQL Server, as well as the rationale behind Spark SQL in Databricks.

  1. Databricks SQL vs. Microsoft SQL Server:

    • Databricks SQL is an integral part of the Databricks Lakehouse Platform. Here’s how it compares to Microsoft SQL Server:

      • Primary Database Model:

        • Databricks SQL: It combines elements of data lakes and data warehouses, providing a unified view of structured and unstructured data. It is based on Apache Spark.
        • Microsoft SQL Server: A flagship relational DBMS.
      • Secondary Database Models:

        • Databricks SQL: Supports additional models like document store and graph DBMS.
        • Microsoft SQL Server: Primarily a relational DBMS.
      • APIs and Access Methods:

        • Databricks SQL: Supports JDBC, ODBC, and a RESTful HTTP API.
        • Microsoft SQL Server: Offers a wide range of access methods including ADO.NET, JDBC, ODBC, and OLE DB.
      • Programming Languages:

        • Databricks SQL: Supports Python, R, Scala, C#, C++, Delphi, Go, Java, and JavaScript (Node.js).
        • Microsoft SQL Server: Allows server-side scripts in Transact SQL, .NET languages, R, Python, and (with SQL Server 2019) Java.
      • Consistency and Durability:

        • Both adhere to ACID properties (Atomicity, Consistency, Isolation, Durability).
      • In-Memory Capabilities:

        • Microsoft SQL Server offers in-memory capabilities, while Databricks SQL does not.
    • For more detailed information, you can explore the Databricks documentation and the Microsoft SQL Server documentation.

  2. Why Spark SQL in Databricks?:

    • Spark SQL is an essential component of Databricks for several reasons:
      • Unified Data Processing: Spark SQL seamlessly integrates structured data processing (SQL queries) with unstructured data processing (Spark operations).
      • Performance: It leverages the power of Apache Spark, enabling distributed, parallel processing across large datasets.
      • Data Lakehouse Paradigm: Databricks aims to bridge the gap between data lakes and data warehouses. Spark SQL plays a crucial role in this by providing a unified interface for both types of data.
      • Ecosystem Compatibility: Spark SQL allows users to work with data stored in various formats (Parquet, JSON, Avro, etc.) within the same platform.

    If you’d like to explore further, you can refer to the Databricks documentation.

Remember, both Databricks SQL and Microsoft SQL Server serve different purposes, and your choice depends on your specific use case and requirements. 🚀🔍

 
Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.