GFM-RAG¶
Welcome to the documentation for GFM-RAG project.
Overview¶
The GFM-RAG is the first graph foundation model-powered RAG pipeline that combines the power of graph neural networks to reason over graphs and retrieve relevant documents for question answering.

We first build a graph-index from the documents to capture the relationships between knowledge. Then, we feed the query and constructed graph-index into the pre-trained graph foundation model (GFM) retriever to obtain relevant documents for LLM generation. The GFM retriever experiences large-scale training and can be directly applied to unseen datasets without fine-tuning.
GFM-RAG is designed to be efficient and generalizable. You can bring your own dataset and directly apply the pre-trained GFM retriever to obtain relevant documents for question answering. You can also fine-tune the GFM retriever on your own dataset to improve performance on specific domains.
For more details, please refer to our project and papers: GFM-RAG, G-reasoner.
Features¶
- Graph Foundation Model (GFM): A graph neural network-based retriever that can reason over the graph-index.
- Universal Graph Index: A universal graph index that can represent various types of structural knowledge such as Knowledge Graphs, Document Graphs, and Hierarchical Graphs.
- Efficiency: The GFM-RAG pipeline is efficient in conducting multi-hop reasoning with single-step retrieval.
- Generalizability: The GFM-RAG can be directly applied to unseen datasets without fine-tuning.
- Transferability: The GFM-RAG can be fine-tuned on your own dataset to improve performance on specific domains.
- Compatibility: The GFM-RAG is compatible with arbitrary agent-based framework to conduct multi-step reasoning.
- Interpretability: The GFM-RAG can illustrate the captured reasoning paths for better understanding.
Choose Your Path¶
- Quick Start: shortest runnable path for a new user
- Workflow: general usage from data formatting through training
- Experiment: script-first paper reproduction for
GFM-RAGandG-reasoner - API Reference: developer-facing classes and modules
How GFM-RAG And G-reasoner Relate¶
- GFM-RAG is the original graph foundation model retriever and remains the baseline workflow and published checkpoint family.
- G-reasoner is the latest version of the graph foundation model retriever, which has new architecture and better performance.
- Both lines share the same indexing, dataset, QA, and documentation structure in this site.
Fastest Path¶
If you want to run something with minimal setup:
- Follow Install.
- Prepare a tiny dataset with
raw/documents.json. - Start from Quick Start.