The Linux Foundation Projects
Delta Lake

Delta Lake Blogs

Thumbnail for Rivian expands the Delta Lake ecosystem with Delta-Go

Rivian expands the Delta Lake ecosystem with Delta-Go

By Chelsea Jones , Rahul Madnawat , Jason Shiverick

Real-time data ingestion for high-volume transactions, now available in open source

Thumbnail for Pros and cons of Hive-style partitioning

Pros and cons of Hive-style partitioning

By Matthew Powers , Martin Bode

This post discusses the pros and cons of Hive-style partioning.

Thumbnail for Structured Spark Streaming with Delta Lake: A Comprehensive Guide

Structured Spark Streaming with Delta Lake: A Comprehensive Guide

By Delta Lake

The webinar demonstrates how to embrace structured streaming seamlessly from data emission to your final Delta table destination.

Thumbnail for High-Performance Querying on Massive Delta Lake Tables with Daft

High-Performance Querying on Massive Delta Lake Tables with Daft

By Clark Zinzow , Jay Chia

This post introduces the distributed + parallel Delta Lake reader in Daft.

Thumbnail for Delta Lake - State of the Project - Part 2

Delta Lake - State of the Project - Part 2

By Tathagata Das , Susan Pierce , Carly Akerly

Delta Lake, a project hosted under The Linux Foundation, has been growing by leaps and bounds. To celebrate the achievements of the project, we’re publishing a 2-part series on Delta Lake.

Thumbnail for Delta Lake Announces Pandas Enhancement: Real Pandas to Optimize Data Lakehouse Performance

Delta Lake Announces Pandas Enhancement: Real Pandas to Optimize Data Lakehouse Performance

By Carly Akerly

The Delta Lake project is thrilled to announce its latest and most exciting collaboration with the Pandas community!

Thumbnail for Delta Lake - State of the Project - Part 1

Delta Lake - State of the Project - Part 1

By Tathagata Das , Susan Pierce , Carly Akerly

Delta Lake, a project hosted under The Linux Foundation, has been growing by leaps and bounds. To celebrate the achievements of the project, we’re publishing a 2-part series on Delta Lake.

Thumbnail for Delta Lake 3.1.0

Delta Lake 3.1.0

By Carly Akerly

This post describes the exiting features in the Delta Lake 3.1.0 release

Thumbnail for Delta Lake replaceWhere

Delta Lake replaceWhere

By Matthew Powers

Selectively overriding rows or partitions of a Delta Lake table with replaceWhere.

Thumbnail for Delta Lake Performance

Delta Lake Performance

By Joe Harris

This post shows explains why Delta Lake is fast and describes improvements to Delta Lake performance over time.

Thumbnail for Writing a Kafka Stream to Delta Lake with Spark Structured Streaming

Writing a Kafka Stream to Delta Lake with Spark Structured Streaming

By Bo Gao , Matthew Powers

This blog post explains how to write a Kafka stream to a Delta table with Spark Structured Streaming.

Thumbnail for Using Delta Lake with AWS Glue

Using Delta Lake with AWS Glue

By Keerthi Josyula , Matthew Powers

This post shows how to register Delta tables in the AWS Glue Data Catalog with the AWS Glue Crawler.