close

DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Apache Data Lakehouse Weekly: June 17 to June 24, 2026

Apache Data Lakehouse Weekly: June 17 to June 24, 2026

Comments
22 min read
Apache Iceberg in Production: Compaction, Catalogs, and the Pitfalls Nobody Warns You About

Apache Iceberg in Production: Compaction, Catalogs, and the Pitfalls Nobody Warns You About

Comments
5 min read
Why I set `dynamic: false` on my Elasticsearch audit-log stream

Why I set `dynamic: false` on my Elasticsearch audit-log stream

Comments
6 min read
Less Database, More Files

Less Database, More Files

Image 1
Comments
14 min read
Day 34: Advanced ClickHouse® Aggregating Functions

Day 34: Advanced ClickHouse® Aggregating Functions

Image 1
Comments
5 min read
Understanding Apache Airflow Architecture: How the Engine Runs Your Workflows

Understanding Apache Airflow Architecture: How the Engine Runs Your Workflows

Comments
3 min read
Merges and Mutations in ClickHouse®

Merges and Mutations in ClickHouse®

Image Image 2
Comments 1
4 min read
Your Data Engineering Take-Home Is Now 20 Hours of Free Work

Your Data Engineering Take-Home Is Now 20 Hours of Free Work

Comments
7 min read
Day 33: Understanding ClickHouse® Query Execution Plans

Day 33: Understanding ClickHouse® Query Execution Plans

Image Image 2
Comments
4 min read
Understanding Apache Airflow DAGs: Structure, Communication, and Deployment

Understanding Apache Airflow DAGs: Structure, Communication, and Deployment

Comments
2 min read
Why Payment Data Pipelines Break Under Real-Time Load (And How Banks Fix the Latency Problem)

Why Payment Data Pipelines Break Under Real-Time Load (And How Banks Fix the Latency Problem)

Comments
4 min read
Day 31: Ingesting Data from Kafka into ClickHouse®

Day 31: Ingesting Data from Kafka into ClickHouse®

Image 2
Comments
5 min read
Snowflake vs Databricks, BigQuery vs Redshift? The 2026 Guide to Right-Sizing Your Data Platform

Snowflake vs Databricks, BigQuery vs Redshift? The 2026 Guide to Right-Sizing Your Data Platform

Comments
8 min read
Phase 1: Document Ingestion - The Hidden Complexity Before Embeddings

Phase 1: Document Ingestion - The Hidden Complexity Before Embeddings

Comments
20 min read
Using Mise as a tool development manager when installing Apache Airflow.

Using Mise as a tool development manager when installing Apache Airflow.

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.