A curated collection of the best open source projects tagged "Data Infrastructure". Each listing includes a website screenshot along with a detailed review of its features.
Ultra-fast data transformation for AI with lineage
Stars
9,518
Forks
727
Last commit
11 hours ago
Open-source ETL framework built in Rust for AI workloads. Features incremental processing, data lineage, and observability tools for semantic search and RAG applications.