Databricks

BasavarajAngadi · ‎02-27-2022

Hubert-Dudek · ‎02-27-2022

Hive metastore is semantic layer, it is like mapping so you have folder with delta or json etc. and it is mapped as table so you can query your data using SQL syntax.

SQL endpoint is is server address for incoming SQL queries (like jdbc/odbc) so you can query tables registered in hive metastore from any application (PowerBI, qlikview, looker, or just from own code using jdbc/odbc driver)

View solution in original post

Hubert-Dudek · ‎02-27-2022

Hive metastore is semantic layer, it is like mapping so you have folder with delta or json etc. and it is mapped as table so you can query your data using SQL syntax.

SQL endpoint is is server address for incoming SQL queries (like jdbc/odbc) so you can query tables registered in hive metastore from any application (PowerBI, qlikview, looker, or just from own code using jdbc/odbc driver)

Anonymous · ‎02-28-2022

What Hubert said is correct. I'd also add that Apache Hive is an old/obsolete tool created at Facebook and open sourced that was a SQL interface for MapReduce. https://hive.apache.org/

It's often helpful to think of SparkSQL as the modern or evolved version of Hive.

SQL Endpoint is more of a cluster to run SQL queries on https://docs.databricks.com/sql/admin/sql-endpoints.html

BasavarajAngadi · ‎02-28-2022

@Joseph Kambourakis hi can we not run hive queries with Spark as execution engine ? what difference does it make when compared with data bricks SQL end point ... apart from delta engine .

Anonymous · ‎02-28-2022

Yes, in later updates Hive did get a Spark backend engine, but at that time it was largely obsolete.

SQL endpoints have many advantages. They are cloud optimized and there is a C++ engine (photon) that is faster than the traditional spark engine. The endpoints are more compute environments, where hive was more of a syntax and query engine. Also, Spark SQL became more Ansi compatible in recent releases. https://spark.apache.org/docs/latest/sql-ref-ansi-compliance.html