G-reasoner Configs¶
This page groups the task presets under gfmrag/workflow/config/gfm_reasoner/.
Directory Layout¶
| File | Purpose | Typical entrypoint |
|---|---|---|
index_dataset.yaml |
Build processed/stage1/ from raw data |
python -m gfmrag.workflow.index_dataset --config-name gfm_reasoner/index_dataset |
kgc_trianing.yaml |
Run KGC pretraining for the G-reasoner model family |
python -m gfmrag.workflow.kgc_training --config-name gfm_reasoner/kgc_trianing |
qa_inference.yaml |
Run QA from saved retrieval outputs | python -m gfmrag.workflow.qa --config-name gfm_reasoner/qa_inference |
sft_training.yaml |
Run supervised fine-tuning | python -m gfmrag.workflow.sft_training --config-name gfm_reasoner/sft_training |
sft_training_w_answer.yaml |
Run SFT with additional answer supervision | python -m gfmrag.workflow.sft_training --config-name gfm_reasoner/sft_training_w_answer |
stage3_qa_ircot_inference.yaml |
Run retrieval plus IRCOT-style reasoning | python -m gfmrag.workflow.qa_ircot_inference --config-name gfm_reasoner/stage3_qa_ircot_inference |
visualize_path.yaml |
Visualize reasoning paths on dataset examples | visualization workflow |
How To Read This Folder¶
The gfm_reasoner presets follow a stable pattern:
hydra.run.dircontrols the output root.defaultspulls in shared component groups such asner_model,openie_model,el_model,text_emb_model, andwandb.- task-specific sections such as
dataset,datasets,graph_retriever,model,trainer,llm, andtestthen override the shared pieces.