The Linux Foundation Projects
Delta Lake
logo

Delta Lake Integrations

Use the following frameworks, Delta Sharing clients, managed services, and/or community integrations
for Delta Lake and Delta Sharing.

UniForm

Delta Universal Format (UniForm)
allows you to read Delta tables with Iceberg and Hudi clients

Requirements Enable UniForm

Frameworks

Use the following frameworks and languages including but not limited to Apache Flink, Apache Spark, Trino, and Rust.

Apache Druid

Druid

This connector allows Apache Druid to read from Delta Lake.

Apache Flink

Flinkstandalone

This connector allows Apache Flink to write to Delta Lake.

Apache Hive

Hivestandalone

This connector allows Apache Hive to read from Delta Lake.

Apache Pulsar

Pulsarcommunity

This connector allows Apache Pulsar to read from and write to Delta Lake.

Apache Spark™

Spark

This connector allows Apache Spark™ to read from and write to Delta Lake.

ClickHouse

ClickHouse

ClickHouse is a column-oriented database that allows users to run SQL queries on Delta Lake tables. This provides a read-only integration with existing Delta Lake tables in Amazon S3.

Dagster

DagsterPython

Use the Delta Lake IO Manager to read from and write to Delta Lake in your Dagster orchestration pipelines.

Delta Rust API

RustPython

This library allows Rust (with Python bindings) low level access to Delta tables and is intended to be used with data processing frameworks like datafusion, ballista, rust-dataframe, vega, etc.

Delta Standalone

ScalaJavastandalone

This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.

FINOS Legend

FINOSLegend

An extension to the FINOS Legend framework for Apache Spark™ / Delta Lake based environment, combining best of open data standards with open source technologiesThis connector allows Trino to read from and write to Delta Lake.

Hopsworks

HopsworksPython

Hopsworks Feature Store stores, manages, and serves feature data in Delta Lake.

Kafka Delta Ingest

KafkaRust

This project builds a highly efficient daemon for streaming data through Apache Kafka into Delta Lake.

PrestoDB

PrestoDBstandalone

This connector allows PrestoDB to read from Delta Lake.

RisingWave

RisingWavestandalone

This connector allows RisingWave to write to Delta Lake.

SQL Delta Import

SQLJDBC

This utility is for importing data from a JDBC source into a Delta Lake table.

StarRocks

StarRocks

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. StarRocks has the ability to read from Delta Lake.

Trino

Trino

This connector allows Trino to read from and write to Delta Lake.

Sharing

Use the following clients that integrate with Delta Sharing from C++ to Rust.

C++

C++Delta Sharingcommunity

This connector allows a C++ client to read from Delta Sharing endpoint.

Excel

ExcelDelta Sharing

This connector allows a Excel client to read from Delta Sharing endpoint.

Go

GoDelta Sharingcommunity

This connector allows a Go client to read from Delta Sharing endpoint.

Java

JavaDelta Sharingcommunity

This connector allows a Java client to read from Delta Sharing endpoint.

Kotosiro Sharing

KotosiroDelta SharingRust

A Minimalistic Rust Implementation of Delta Sharing Server.

MLflow

MLflowDelta Sharingcommunity

Perform model exchange via Delta Sharing and MLflow

node.js

node.jsDelta Sharingcommunity

This connector allows node.js to read from Delta Sharing endpoint.

Oracle

OracleDelta Sharing

This connector allows for Delta Sharing with Oracle Autonomous Database Data Studio.

Power BI

PowerBIDelta Sharing

This connector allows Power BI to read from Delta Sharing endpoint.

R

RDelta Sharingcommunity

This connector allows a R client to read from Delta Sharing endpoint.

Rust

RustDelta Sharingcommunity

This connector allows a Rust client to read from Delta Sharing endpoint.

Terminal

TerminalDelta Sharingcommunity

Terminal application for browsing Delta Sharing Metadata.

Services

Use the managed services of your choice that integrate with Delta Lake.

Athena

AthenaAWS

This utility allows Athena to natively read from Delta Lake starting with Athena SQL 3.0

AWS EMR

AWSEMR

Starting with Amazon EMR release 6.9.0, you can use Apache Spark 3.x on Amazon EMR clusters with Delta Lake tables.

AWS Glue

AWSGlue

AWS Glue 3.0 and later supports the Linux Foundation Delta Lake framework

aws-pandas-sdk

pandasAWSawswrangleraws-pandas-sdk

pandas on AWS - Easy integration with AWS services including optional dependency with Delta Lake

Azure Stream Analytics

AzureASAStream Analytics

Azure Stream Analytics provides native write support for Delta Lake

BigQuery

BigQuery’s native Delta Lake support enables seamless delivery of data for downstream applications.

Databricks

DatabricksAzureGCPAWS

Delta Lake is included within Databricks allowing it to read from and write to Delta Lake.

Microsoft Fabric

In order to achieve seamless data access across all compute engines in Microsoft Fabric, Delta Lake is chosen as the unified table format.

Power BI

PowerBIcommunity

This connector allows Power BI to read from Delta Lake.

Redshift

RedshiftAWSmanifest

This utility allows AWS Redshift to read from Delta Lake using a manifest file.

Snowflake (Beta)

Snowflake

This preview allows Snowflake to read from Delta Lake via an external table.

Starburst

StarburstAzureGCPAWS

The Starburst Delta Lake connector is an extended version of the Trino/Delta Lake connector with configuration and usage identical.

StarTree

StarTreePinot

StarTree Cloud includes the Apache Pinot / Delta Lake connector.

Community

Try out the following community integrations with Delta Lake.

Apache Beam

Beamstandalonecommunity

This connector allows Apache Beam to read from Delta Lake.

Athena Query Federation (Beta)

AWSAthenastandalonecommunity

This connector allows AWS Athena to read from Delta Lake.

Beam Delta Lake

Beamstandalonecommunity

With DataLakeIO, data from Apache Beam's pipelines can be read from and written to Delta Lake

Ceph

Cephcommunity

This connector allows you to read and write from Delta tables on Ceph storage.

dlt | SparkR

SparkRcommunity

This package allows SparkR to read from and write to Delta Lake.

DataHub

DataHubcommunity

This connector allows DataHub to extract Delta Lake metadata.

Datastream Connector

GCSDatastreambadal.iocommunity

As Datastream streams changes to files to Google Cloud Storage, this connector streams these files and writes the changes to Delta Lake.

MinIO

MinIOcommunity

This connector allows you to read and write from Delta tables on MinIO storage.