Data Pipelines

Data integration, ETL, ELT, ingestion, and transformation tools.

Open ranking →

Open/source-visible picks

Health-first ToolVitals score, then adoption

01
Dagster Top Pick
An orchestration platform for the development, production, and observation of data assets.
OSI-approved OSS
15.6k stars Active 100
02
dbt
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
OSI-approved OSS
13k stars Active 98
03
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
OSI-approved OSS
45.7k stars Active 98

Proprietary targets

Replacement targets, not open-tool picks

01
Data streaming processor with yaml-driven pipeline configuration
Active 98
02
Privacy and Security focused Segment-alternative, in Golang and React
Active 96
03
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Active 90

1 tool is on Evidence Watch.

BI & Dashboards

Business intelligence, dashboards, reporting, and analytics workspaces.

Open ranking →

Open/source-visible picks

Health-first ToolVitals score, then adoption

01
Grafana Top Pick
Open-source observability platform
Open core
74.3k stars Active 100
02
Open-source business intelligence
Open core
47.6k stars Active 100
03
Thunderbird for Android – Open Source Email App for Android (fka K-9 Mail)
OSI-approved OSS
13.6k stars Active 96

Proprietary targets

Replacement targets, not open-tool picks

01
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer.
Active 99
02
Open-Source Self-Service Analytics Platform
Warning 78
03
AI-powered waste-to-energy platform with real-time monitoring, predictive analytics, and community engagement.
Warning 68

1 tool is on Evidence Watch.

Data Catalogs & Portals

Tools for cataloging, publishing, discovering, and managing metadata for datasets across one or more data sources.

Open ranking →

Open/source-visible picks

Health-first ToolVitals score, then adoption

01
Project Nessie Top Pick
Transactional catalog for data lakes with Git-like semantics.
OSI-approved OSS
1.5k stars Active 94
02
Portable data catalog that can run without a server.
OSI-approved OSS
0 stars Active 83
03
A lightweight Python framework for financial data discovery, indexing, retrieval and ingestion.
OSI-approved OSS
0 stars Warning 77

Data Annotation & Labeling

Tools for labeling, annotating, reviewing, and managing datasets for machine learning or data workflows.

Open ranking →

Open/source-visible picks

Health-first ToolVitals score, then adoption

01
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
OSI-approved OSS
1.2k stars Critical 31
02
The Open-Source Data Annotation Platform
OSI-approved OSS
1.2k stars Critical 31

Proprietary targets

Replacement targets, not open-tool picks

01
An On-Chain Open-Source Platform for Rapid AI Model Productization Using Decentralized Resources with Flexibility and Scalability
Critical 37

Clinical Decision Support

Healthcare tools that support clinical decision-making, care review workflows, and specialty medical recommendations.

Open ranking →

Proprietary targets

Replacement targets, not open-tool picks

01
Clinical decision support platform for oncology tumor boards.
Warning 73

Data Catalogs & Portals

Tools for cataloging, publishing, discovering, and managing metadata for datasets across one or more data sources.

Open ranking →

Open/source-visible picks

Health-first ToolVitals score, then adoption

01
CKAN Top Pick
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
OSI-approved OSS
5k stars Active 87

Inventory & Collection Management

Tools for tracking personal, household, parts, asset, or collection inventories outside full warehouse operations.

Open ranking →

Open/source-visible picks

Health-first ToolVitals score, then adoption

01
ItemPlus Top Pick
Open-source inventory and collection management system.
OSI-approved OSS
2 stars Warning 71

Awaiting Use-Case Review

These tools fit this broad category, but still need a more specific use-case assignment.

No scored tools in this use case yet.

13 tools are on Evidence Watch.