cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Warehousing & Analytics
Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

AWS Databricks external tables are delta tables?

Akshay_Petkar
New Contributor III

If I create an external table on AWS Databricks, will it be a Delta table? If not, is there a way to make it a Delta table, or is there no Delta capability for external tables?

1 ACCEPTED SOLUTION

Accepted Solutions

Kaniz_Fatma
Community Manager
Community Manager

Hi @Akshay_Petkar, Unity Catalog external tables in Databricks can be Delta tables, but they are not required to be. When you create an external table, itโ€™s possible to choose whether or not to use Delta Lake. These external tables do not provide the transactional guarantees or optimized performance of Delta ...1. If you want to create a Delta table explicitly, you can follow these steps:

  1. Create an External Table: You can create an external table in Databricks using SQL commands or DataFrame write operations. Ensure that you have the necessary permissions, including the CREATE EXTERNAL TABLE privilege on the...2.

  2. Choose the File Format: External tables can use various file formats, including DELTA, CSV, JSON, AVRO, PARQUET, and ORC. To create a Delta table, specify the DELTA format when creating the external table.

  3. Register Data Files: When creating the external table, you can either register an existing directory of data files as a table or provide a path to create new data files. The table registration in Unity Catalog is essentially a pointer to the data files, and Databricks only manages the metadata for external tables without affecting the data files themselves.

Remember that Delta tables offer additional features beyond traditional external tables. They allow direct modification through inserts, updates, deletes, and merges (CRUD operations), and they can be used as batch tables or streaming sources and sinks. Delta Lake also ensures exactly-once processing during multiple streaming or concurrent batch jobs3. If you need these capabilities, consider using Delta Tables instead of plain external tables. ๐Ÿ˜Š

 

View solution in original post

1 REPLY 1

Kaniz_Fatma
Community Manager
Community Manager

Hi @Akshay_Petkar, Unity Catalog external tables in Databricks can be Delta tables, but they are not required to be. When you create an external table, itโ€™s possible to choose whether or not to use Delta Lake. These external tables do not provide the transactional guarantees or optimized performance of Delta ...1. If you want to create a Delta table explicitly, you can follow these steps:

  1. Create an External Table: You can create an external table in Databricks using SQL commands or DataFrame write operations. Ensure that you have the necessary permissions, including the CREATE EXTERNAL TABLE privilege on the...2.

  2. Choose the File Format: External tables can use various file formats, including DELTA, CSV, JSON, AVRO, PARQUET, and ORC. To create a Delta table, specify the DELTA format when creating the external table.

  3. Register Data Files: When creating the external table, you can either register an existing directory of data files as a table or provide a path to create new data files. The table registration in Unity Catalog is essentially a pointer to the data files, and Databricks only manages the metadata for external tables without affecting the data files themselves.

Remember that Delta tables offer additional features beyond traditional external tables. They allow direct modification through inserts, updates, deletes, and merges (CRUD operations), and they can be used as batch tables or streaming sources and sinks. Delta Lake also ensures exactly-once processing during multiple streaming or concurrent batch jobs3. If you need these capabilities, consider using Delta Tables instead of plain external tables. ๐Ÿ˜Š

 

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group