PSBigBig + MiniPS onestardao

WFGY, led by the AI Troubleshooting Atlas 🗺

We build WFGY, an open-source reasoning and debugging ecosystem for AI systems.
The strongest practical entry point is now the AI Troubleshooting Atlas.

One lineage, multiple public entry points, one growing ecosystem under the MIT License.

Quick navigation

⭐️ AI Troubleshooting Atlas : the main practical entry for broken RAG, agent, and AI workflows.
⭐️ Global Debug Card : image-first triage for a single failing run.
⭐️ Ecosystem Map : canonical map of how the public WFGY system fits together.
⭐️ WFGY 3.0 : frontier reasoning and long-horizon evaluation surface.

If WFGY helps your workflow or thinking, a ⭐ on the repo helps more people discover it.

Who is WFGY for

WFGY is for people who need structured debugging and serious reasoning, not just another prompt recipe.

RAG and agent teams : pipelines run, infra looks healthy, but answers are still wrong or unstable.
Infra and platform owners : you need a way to inspect and route failures across models, tenants, or deployments.
Researchers and evaluation teams : you study reasoning, robustness, safety, or stress tests and want concrete observables.
Founders, PMs, and domain experts : you carry difficult AI workflows and want a more structured system for treating them.

If you do not fit neatly into any of the above, start with the Atlas or the Global Debug Card.

Entry points

⭐️ AI Troubleshooting Atlas : route-first troubleshooting surface for real AI failures.
⭐️ WFGY RAG 16 Problem Map : classic 16-problem checklist for broken RAG flows.
⭐️ WFGY Global Debug Card : image-first triage layer for diagnosing a single failing run.
⭐️ WFGY 1.0 : earliest public conceptual foundation.
⭐️ WFGY 2.0 : current reasoning and diagnostic kernel.
⭐️ WFGY 3.0 : frontier TXT-based reasoning and evaluation surface.

Why WFGY looks like an ecosystem

WFGY is not a single page, a single chart, or a single claim.

The public system is easiest to read as:

one version lineage : WFGY 1.0 → WFGY 2.0 → WFGY 3.0
one strong practical wedge : AI Troubleshooting Atlas, Problem Map, Global Debug Card, and Global Fix Map
one wider application and evaluation surface : TXTOS, related modules, and WFGY 3.0
one public proof and collaboration layer : Adopters, Case Evidence, Recognition Map, Evidence Timeline, Work with WFGY, and Support

The goal is simple: make reasoning failures more visible, reproducible, and fixable.

Public proof and ecosystem integration

⭐️ Adopters : shortest public adoption summary.
⭐️ Case Evidence : what those integrations imply in real systems.
⭐️ Recognition Map : broader ecosystem record of integrations, citations, curated lists, and public mentions.
⭐️ Evidence Timeline : historical timeline of how WFGY became public, usable, and externally legible.

Most current public references point first to the WFGY ProblemMap / 16-problem failure checklist line.
A smaller but growing set also uses WFGY 3.0 as a long-horizon, TXT-based reasoning and evaluation surface.

This does not mean every project uses the full WFGY ecosystem.
In many cases, WFGY appears first as a practical diagnostic layer for RAG and agent pipelines.

Representative integrations

Project	Segment	How it uses WFGY ProblemMap	Proof (PR / doc)
LlamaIndex	Mainstream RAG infra	Integrates the WFGY 16-problem RAG failure checklist into its official RAG troubleshooting docs as a structured failure mode reference.	PR #20760
RAGFlow	Mainstream RAG engine	Introduced a RAG failure modes checklist guide to the RAGFlow documentation via PR, adapted from the WFGY 16-problem failure map for step-by-step RAG pipeline diagnostics.	PR #13204
FlashRAG (RUC NLPIR Lab)	Academic lab / RAG research toolkit	Adapts the WFGY ProblemMap as a structured RAG failure checklist in its documentation. The 16-mode taxonomy is cited to support reproducible debugging and systematic failure-mode reasoning for RAG experiments.	PR #224
DeepAgent (RUC NLPIR Lab)	Academic lab / agent research	Adds a multi-tool agent failure modes troubleshooting note inspired by WFGY-style debugging concepts for diagnosing tool selection loops, tool misuse, and multi-tool workflow failures in agent pipelines.	PR #15
ToolUniverse (Harvard MIMS Lab)	Academic lab / tools	Provides a `WFGY_triage_llm_rag_failure` tool that wraps the 16 mode map for incident triage.	PR #75
Rankify (University of Innsbruck)	Academic lab / system	Uses the 16 failure patterns in RAG and re-ranking troubleshooting docs.	PR #76
Multimodal RAG Survey (QCRI LLM Lab)	Academic lab / survey	Cites WFGY as a practical diagnostic resource for multimodal RAG.	PR #4
LightAgent	Agent framework	Incorporates WFGY ProblemMap concepts into its documentation via a Multi-agent troubleshooting (failure map) section, providing a structured symptom → failure-mode → debugging checklist for diagnosing role drift, cross-agent memory issues, and coordination failures in multi-agent systems.	PR #24

Work with or support WFGY

If you maintain an AI system, research project, or infra platform and want to explore collaboration around WFGY, start here:

⭐️ Work with WFGY : entry point for pilots, audits, and structured collaboration.
⭐️ Pilot Offer One-Pager : compact view of what a WFGY pilot can look like.
⭐️ Sample Deliverable : sample structure of a WFGY pilot return package.
⭐️ Support WFGY : support the continued development of the public ecosystem.

You can also:

open an issue describing your current system and failure modes
reference the matching WFGY ProblemMap number if you already know it
reach out via Discord for exploratory discussion

We are especially interested in:

RAG or agent teams testing WFGY diagnostics in production-like settings
research groups designing stress tests or observables on the atlas line
platform owners exposing WFGY-style diagnostics to their users

The long-term goal is simple: make reasoning and debugging layers a normal, visible part of AI systems.

Explore more

⭐️ Main WFGY repository
⭐️ AI Troubleshooting Atlas
⭐️ Ecosystem Map
⭐️ Recognition Map

If this ecosystem is useful to you, a ⭐ helps more people discover it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly