Databricks Community

Krish1 · ‎04-29-2023

Can somebody give me good definition of delta lake vs delta table? What are the use cases of each, similarities and differences? Sorry I’m new to databricks ans trying to learn.

Rishabh-Pandey · ‎05-01-2023

Delta Lake is an open-source storage layer that is designed to bring reliability to data lakes. It is built on top of Apache Spark and provides features such as ACID transactions, schema enforcement, and time travel. Delta Lake is essentially a storage format that provides a set of features for managing data in a data lake environment.

Delta tables, on the other hand, are tables that are created using the Delta Lake storage format. Delta tables are optimized for use in data lake environments and provide features such as ACID transactions, schema enforcement, and time travel. Delta tables are essentially a specific type of table that is built on top of the Delta Lake storage format.

In summary, Delta Lake is a storage layer that provides features for managing data in a data lake environment, while Delta tables are tables that are built on top of the Delta Lake storage format and provide optimized features for working with data in a data lake environment.

Rishabh Pandey

Annapurna_Hiriy · ‎05-01-2023

Delta Lake and Delta table are related concepts in the Apache Delta Lake project. which extends Apache Spark with ACID (Atomicity, Consistency, Isolation, Durability) capabilities for data lakes.

Delta Lake provides a storage layer that enables transactional and scalable data processing on top of cloud storage systems like Hadoop Distributed File System (HDFS)/Amazon S3/ADLS.

Reference: https://docs.delta.io/latest/delta-intro.html

A Delta table is a collection of data organized in a tabular format within Delta Lake. It represents a table structure with schema and associated data stored in a Delta Lake format. There are 2 types of delta tables

Managed table
Unmanaged table

Please refer to the following document for more information about managed and unmanaged delta tables:

https://docs.databricks.com/lakehouse/data-objects.html#managed-table

Key features of Delta Lake and Delta tables are the same and they include:

ACID transactions

Schema enforcement and evolution

Time travel

Data reliability

Metadata management

In summary, Delta Lake is the underlying storage layer that provides transactional and reliability features, while Delta tables represent the tabular structures within Delta Lake, offering ACID properties, schema enforcement, versioning, and other Delta Lake capabilities. Delta tables are the primary means of working with structured data in Delta Lake.

Databricks Community

Deltalkake vs Delta table

Photos

Connect with Databricks Users in Your Area

Data + AI Summit 2025 — registration now open!

Jumpstart Your Data Journey with Databricks Get Started Days!

Databricks DevConnect: Global Community Meetups for Data Engineers

Intelligent Data Warehousing: AI/BI for Self-service Analytics

Introducing SAP Databricks