Tables in Databricks

A Databricks table resides in a schema and contains the data. The default table type created is a
#Unity Catalog managed table.

Table Types

Databricks provide three primary table types:

  1. Managed Table - manages underlying data files along side the metastore registration.
  2. External Table - references data stored outside Databricks in an external storage system (cloud storage). It decouples management of underlying data files with metastore registration
  3. Foreign Table - represent data stored in external systems connected to Azure Databricks through Lakehouse Federation. Foreign tables are read-only on Databricks

Storage Format

The storage format defines how the data is physically structured and tracked on disk. Databricks supports two primary open table storage format:

Unity Catalog managed table

It is a type of table where the Unity Catalog fully governes and optimizes providing performance, operation advantage also providing an option to restore a table within a timeframe of 7 days after the table is dropped.


#todo #databricks