Skip to content

Latest commit

 

History

History
190 lines (122 loc) · 7.86 KB

File metadata and controls

190 lines (122 loc) · 7.86 KB
title description permalink
Awesome Delta Lake & Apache Iceberg Resources
Curated list of high-quality learning materials and tools for Delta Lake and Apache Iceberg practitioners.
/docs/awesome-list/

Awesome Delta Lake & Apache Iceberg Resources

A curated list of articles, blog posts, videos, and resources about Delta Lake and Apache Iceberg, automatically maintained by our community and AI-powered aggregator.

🌟 Featured Resources

Official Documentation

Specifications

Recent Articles

This section is automatically updated by our resource aggregator bot. New articles are added weekly and reviewed by the community.

Discovered: 2024-01-01

Delta Lake 3.0 brings significant improvements including better performance, enhanced schema evolution capabilities, and improved compatibility with Apache Spark 3.5.


Discovered: 2024-01-01

Comprehensive guide covering Iceberg architecture, design decisions, and best practices for production deployments.


📚 Learning Resources

Tutorials

  • [Delta Lake Quickstart]({{ '/docs/tutorials/getting-started/' | relative_url }}) - Get started with Delta Lake
  • [Iceberg Quickstart]({{ '/docs/tutorials/getting-started/' | relative_url }}) - Get started with Apache Iceberg
  • [Migration Guide: Parquet to Delta/Iceberg]({{ '/docs/tutorials/migration-guide/' | relative_url }}) - Convert existing data lakes

Video Content

Books

  • "Delta Lake: The Definitive Guide" by Denny Lee and Tristen Wentling
  • "Building the Data Lakehouse" by Bill Inmon, et al.

🛠️ Tools and Libraries

Delta Lake Ecosystem

Iceberg Ecosystem

Query Engines

AI-Powered Research Tools

  • Google NotebookLM - AI-powered research assistant for analyzing documents, PDFs, and notes
    • Community Notebook: Delta Lake & Iceberg Research Collection - A curated collection of research materials about Delta Lake and Apache Iceberg (Note: Access may require permission from the notebook owner)
    • Note: NotebookLM notebooks are private by default. To access shared content, the notebook owner must grant access or export summaries/insights
    • Access Methods:
      • Request access from the notebook owner
      • For enterprise users: NotebookLM Enterprise API provides programmatic access
      • Manual export: Notebook owners can share generated summaries, study guides, or audio overviews
    • Use Cases: Synthesizing research papers, creating study guides, generating insights from Delta Lake and Iceberg documentation

🏢 Case Studies

Delta Lake

  • Netflix: Processing petabytes of data with Delta Lake
  • Comcast: Real-time streaming analytics
  • Adobe: Marketing analytics at scale
  • Riot Games: Gaming analytics and ML pipelines

Apache Iceberg

  • Netflix: Original creator, uses Iceberg for data warehousing
  • Apple: Large-scale data processing
  • LinkedIn: Data platform modernization
  • Expedia: Travel data analytics

📊 Comparisons and Benchmarks

🎓 Courses and Training

Free Courses

Paid Courses

🔧 Integration Guides

Cloud Platforms

BI Tools

🎤 Community

Slack Channels

Mailing Lists

Meetups and Conferences

  • Data + AI Summit - Annual Databricks conference
  • ApacheCon - Apache Software Foundation conference
  • Local Data Engineering Meetups

🔬 Research Papers

🤝 Contributing

This awesome list is community-maintained. To add a resource:

  1. Check if it's already listed
  2. Ensure it's relevant and high-quality
  3. Submit a PR with your addition
  4. Include a brief description

Our AI-powered aggregator also discovers new content weekly and creates PRs for review.

See our Contributing Guide for details.

📜 License

This awesome list is part of the Delta Lake & Apache Iceberg Knowledge Hub, licensed under Apache 2.0.


Last Updated: 2025-11-14
Maintained By: Community + AI Aggregator 🤖