Data Pipelines

Alternatives to DocETL

Compare healthier or more active tools in the same use case using ToolVitals public evidence.

Why compare

Signals behind this alternatives page

  • DocETL currently shows at least one risk or watch signal in ToolVitals data.
  • The tool is still alive, but signs of slower momentum are visible.
  • Health score: 67/100.
  • Shipping score: 25/100.

Switching guide

Best alternatives by need

Best overallDagster

Highest organic ToolVitals fit for this use case.

Healthier optionsDagster, Apache Airflow, dbt, CocoIndex

Stronger current public-health signal than DocETL.

Verified open/source-visibleDagster, Apache Airflow, dbt, CocoIndex

Useful when portability and inspectability matter.

License unknown, source-visible signalsBenthos, RudderStack, Pathway

Public source signals exist, but ToolVitals has not verified the license class.

Trust note: Sponsors and affiliate links are separate from rankings. ToolVitals does not let monetization change scores, risk labels, organic ordering, or evidence display.

Ranked alternatives

Best-fit data pipelines options

12 tools
01

Dagster

An orchestration platform for the development, production, and observation of data assets.

Active
Health100
Shipping100
Score100
Confidence100
Stars15.7k
ModeSource-visible project
02

Apache Airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Active
Health93
Shipping95
Score95
Confidence100
Stars45.8k
ModeSource-visible project
03

dbt

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

Active
Health98
Shipping95
Score98
Confidence100
Stars13k
ModeSource-visible project
04

CocoIndex

Incremental engine for long horizon agents 🌟 Star if you like it!

Active
Health95
Shipping100
Score98
Confidence100
Stars10.3k
ModeSource-visible project
05

Benthos

Data streaming processor with yaml-driven pipeline configuration

Active
Health95
Shipping100
Score98
Confidence98
Stars8.7k
ModeSource-visible project
07

Sail

Drop-in Apache Spark replacement written in Rust for batch and streaming workloads.

Active
Health93
Shipping100
Score97
Confidence99
Stars2.9k
ModeSource-visible project
08

MLRun

MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.

Active
Health93
Shipping100
Score97
Confidence100
Stars1.7k
ModeSource-visible project
09

Bruin

Data pipeline platform with SQL, Python, and quality checks.

Active
Health91
Shipping100
Score96
Confidence95
Stars1.6k
ModeSource-visible project
10

HPCC Systems

Open-source distributed data processing and analytics platform for large-scale data workflows.

Active
Health91
Shipping100
Score95
Confidence94
ModeSource-visible project
11

Pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Active
Health87
Shipping87
Score90
Confidence100
Stars63k
ModeSource-visible project
12

CloudQuery

Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.

Active
Health87
Shipping95
Score93
Confidence100
Stars6.4k
ModeSource-visible project