Data Pipelines

Alternatives to dbt

Compare healthier or more active tools in the same use case using ToolVitals public evidence.

Why compare

Signals behind this alternatives page

  • Maintenance and shipping signals are strong.
  • Health score: 98/100.
  • Shipping score: 95/100.

Switching guide

Best alternatives by need

Best overallDagster

Highest organic ToolVitals fit for this use case.

Healthier optionsDagster

Stronger current public-health signal than dbt.

Verified open/source-visibleDagster, Apache Airflow, CocoIndex, Sail

Useful when portability and inspectability matter.

License unknown, source-visible signalsBenthos, RudderStack, Pathway

Public source signals exist, but ToolVitals has not verified the license class.

Trust note: Sponsors and affiliate links are separate from rankings. ToolVitals does not let monetization change scores, risk labels, organic ordering, or evidence display.

Ranked alternatives

Best-fit data pipelines options

12 tools
01

Dagster

An orchestration platform for the development, production, and observation of data assets.

Active
Health100
Shipping100
Score100
Confidence100
Stars15.7k
ModeSource-visible project
02

Apache Airflow

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Active
Health93
Shipping95
Score95
Confidence100
Stars45.8k
ModeSource-visible project
03

CocoIndex

Incremental engine for long horizon agents 🌟 Star if you like it!

Active
Health95
Shipping100
Score98
Confidence100
Stars10.3k
ModeSource-visible project
04

Benthos

Data streaming processor with yaml-driven pipeline configuration

Active
Health95
Shipping100
Score98
Confidence98
Stars8.7k
ModeSource-visible project
05

RudderStack

Privacy and Security focused Segment-alternative, in Golang and React

Active
Health96
Shipping93
Score96
Confidence100
Stars4.4k
ModeSource-visible project
06

Sail

Drop-in Apache Spark replacement written in Rust for batch and streaming workloads.

Active
Health93
Shipping100
Score97
Confidence99
Stars2.9k
ModeSource-visible project
07

MLRun

MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.

Active
Health93
Shipping100
Score97
Confidence100
Stars1.7k
ModeSource-visible project
08

Bruin

Data pipeline platform with SQL, Python, and quality checks.

Active
Health91
Shipping100
Score96
Confidence95
Stars1.6k
ModeSource-visible project
09

HPCC Systems

Open-source distributed data processing and analytics platform for large-scale data workflows.

Active
Health91
Shipping100
Score95
Confidence94
ModeSource-visible project
10

Pathway

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Active
Health87
Shipping87
Score90
Confidence100
Stars63k
ModeSource-visible project
11

CloudQuery

Data pipelines for cloud config and security data. Build cloud asset inventory, CSPM, FinOps, and vulnerability management solutions. Extract from AWS, Azure, GCP, and 70+ cloud and SaaS sources.

Active
Health87
Shipping95
Score93
Confidence100
Stars6.4k
ModeSource-visible project
12

Jitsu

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

Active
Health88
Shipping95
Score93
Confidence100
Stars4.8k
ModeSource-visible project