Join the Delta Lake Community
Delta Lake is supported by more than 190 developers from over 70
organizations across multiple repositories.
Chat with fellow Delta Lake users and contributors, ask questions and share
tips.
Scheduled and Latest Videos

Watch Delta Rust code development LIVE on Twitch.tv!
with R. Tyler Croy (agentdero), Delta Lake maintainer,
Director of Platform Engineering, Scribd
Because of the contributions from our amazing community, Delta Lake is now up to 20M monthly downloads! We’re proud of the progress this project is making and invite you to get involved. Get started today.

Contribute
Help us build the simplest, most complete, battle-tested open-source
storage framework ever!
Below are a few great ways to get started to contribute.
Delta
192 contributors | 43 organizations
Delta Lake connectors for Apache Spark™, with Scala/Java and Python APIs, and for Java-based engines like Hive, Flink, PrestoDB, etc.
Delta Rust
50 contributors | 26 organizations
Delta Lake connector written natively in Rust, with Python bindings.
Delta Sharing
19 contributors | 2 organizations
An open protocol for simple and secure data sharing.
kafka-delta-ingest
6 contributors | 2 organizations
Delta Lake connector for streaming data from Kafka into Delta tables.
delta.io website
7 contributors | 3 organizations
Delta Lake website and documentation built on Gatsby.js framework.
Refer to the Delta Lake contribution guide for the latest on communication, coding style, and how to sign your work.
Refer to the Delta Lake Protocol for the specification for the Delta Transaction Protocol, which brings ACID properties to large collections of data, stored as files, in a distributed file system or object store.
Want to help or help out?
Looking for ways to help or to get help? Refer to the Getting Help Guide.
Governance
Delta Lake is an independent open-source project and not controlled by any single company. To emphasize this we joined the Delta Lake Project in 2019, which is a sub-project of the Linux Foundation Projects. Within the project, we make decisions based on these rules.
Delta Lake is supported by a wide set of developers from over 70 organizations across multiple repositories. Since 2019, more than 190 developers have contributed to Delta Lake! The Delta Lake community is growing by leaps and bounds with more than 6200 members in the Delta Users slack.
For more information, please refer to the founding technical charter.
Integrating a Data Warehouse and a Data Lake
FLORIAN VALEYE
STAFF DATA ENGINEER, BACK MARKET
This conversation from the Open Source Summit in Spain provides valuable insights into the significance of Delta Lake, the role of Rust in data engineering, and the collaborative nature of open source communities.
The Delta Lake open source project integrates data lakes and data warehouses, a needed combination in this new age of scale-out data requiring reliability and controls.
A data lakehouse integrates the advanced data analytics and low-cost storage of a data lake with the performance and reliability of a data warehouse, said Florian Valeye, a data engineer with Back Market, in this episode of The New Stack Makers, recorded at the Open Source Summit in Bilbao, Spain, earlier this fall.
The data warehouse is an approach that emerged over the past two decades. It consists of structured data models that allow for better performance. The datasets are small and constrained, Valeye said. In contrast, the data lake consists of unstructured data from multiple sources . The volume size of data lakes reaches petabytes or even exabytes.
Delta Lake, created by Databricks, breaks down the barriers between data warehouses and data lakes by providing more performance and features in the data lake, Valeye said.
ACID transactions are a staple of a data warehouse, Valeye said. When thinking about the relational database or a data warehouse, a focus is put on model representations and the data structure. A data lake is an open way to push data and add a schema. Data lakes magnify data. Through the “wall breaking,” the lake house provides ACID transactions, Read, Process, Interpret (RPI) ingestions, and metadata scalability. The strength comes with a way to attain knowledge for any usage without a barrier between the data analyst side of the house and the data engineering and data scientist teams.
Databricks is now working on providing ways for anyone to contribute their connectors through Delta Lake, allowing gateways that can be used with, for example, different databases.
“And that’s why it’s really nice when you are working on a specific cloud provider; you don’t want to be bundled, locked inside it,” Valerie said. “So that’s why having this kind of standard format, you can switch and move from one to another, and don’t feel stuck with one provider and one format.”
Valeye said Back Market sells refurbished devices. To determine device quality, the company developed an algorithm to determine whether a device is viable for sale. Back Market uses Delta Lake to ingest data from APIs and other data sources.
Delta Lake is a platform to connect data scientists and data engineers, said Valeye, who previously worked as a data engineer. Before using Delta Lake, deploying models could be complicated due to the complexities of the different tools and programming languages used. The Delta Lake infrastructure closes the gaps. It allows everyone to work on the same infrastructure.

Delta Lake: The Definitive Guide
Building modern data lakehouse architectures with Delta Lake with forewords by Michael Armbrust and Dominique Brezinski.
Download
Project Maintainers
Want to dive deeper into Delta Lake, please chat with any of our maintainers!
For more information on how to contribute, please refer to the Delta Lake contribution guide.
Name | Organization | Focal Areas |
---|---|---|
Michael Armbrust | Databricks | delta, connectors, delta-sharing |
R. Tyler Croy | Scribd | delta, delta-rs, kafka-delta-ingest |
Tathagata Das | Databricks | delta, connectors |
QP Hou | Neuralink | delta, delta-rs |
Will Jones | Voltron Data | delta, delta-rs |
Venki Korukanti | Databricks | delta, connectors |
Denny Lee | Databricks | delta, connectors, delta-sharing, delta-rs, website |
Mykhailo Osypov | Scribd | delta-rs, kafka-delta-ingest |
Robert Pack | BASF | delta, delta-rs |
Allison Portis | Databricks | delta, connectors |
Scott Sandre | Databricks | delta, connectors |
Florian Valeye | Backmarket | delta, delta-rs |
Thomas Vollmer | Microsoft | delta-rs |
Christian Williams | Scribd | delta, delta-rs, kafka-delta-ingest |
Ryan Zhu | Databricks | delta, connectors, delta-sharing |
Gerhard Brueckl | Paiqo GmbH | delta, connectors, delta-sharing |
Ion Koutsouris | ASML | delta, delta-rs |

Project Governance
Delta Lake is an independent open-source project and not controlled by any single company. To emphasize this we joined the Delta Lake Project in 2019, which is a sub-project of the Linux Foundation Projects. Within the project, we make decisions based on these rules.