A curated collection of the best open source projects tagged "Data Infrastructure". Each listing includes a website screenshot along with a detailed review of its features.
Ultra-fast data transformation for AI with lineage
Stars
6,920
Forks
502
Last commit
5 hours ago
Open-source ETL framework built in Rust for AI workloads. Features incremental processing, data lineage, and observability tools for semantic search and RAG applications.