Ad
 
Learn more
Favicon of CocoIndex

CocoIndex

Open-source ETL framework built in Rust for AI workloads. Features incremental processing, data lineage, and observability tools for semantic search and RAG applications.

Screenshot of CocoIndex website

Transform your data for AI workloads with exceptional performance and developer velocity. CocoIndex is an open-source ETL framework with a Rust-powered core engine, designed specifically for modern AI applications including semantic search, RAG, and knowledge graphs.

Key advantages:

  • Minimal code required - Get started with just ~100 lines of Python using declarative dataflow syntax
  • Incremental processing - Automatic recomputation optimization that only processes necessary portions while reusing cached results
  • Native building blocks - Standardized interfaces for sources, targets, and transformations with 1-line component switching
  • Single source of truth - Define once, run in multiple modes: batch, live updates, or fast preview runs

CocoInsight companion tool provides best-in-class data lineage and observability, helping you understand your pipeline step-by-step without requiring deep data expertise. This significantly boosts developer velocity and lowers barriers to data engineering.

Production-ready from day zero with automatic schema management, cloud-native architecture, and enterprise features including VPC deployments, guaranteed SLA, and data governance. Available as open-source (Apache 2.0) for self-hosting, with free personal use options and enterprise support tiers.

Share:

Similar open source projects

Favicon

 

  
  • Stars


  • Forks


  • Last commit


Favicon

 

  
  • Stars


  • Forks


  • Last commit


Favicon

 

  
  • Stars


  • Forks


  • Last commit