Installation Guide¶

Prerequisites¶

Before installing GFM-RAG, make sure your system meets these requirements:

Python 3.12 or higher
CUDA 12 or higher (for GPU support)
Poetry (recommended for development)

Installation Methods¶

Install via Conda¶

Conda provides an easy way to install the CUDA development toolkit which is required by GFM-RAG:

Bash

conda create -n gfmrag python=3.12
conda activate gfmrag
conda install cuda-toolkit -c nvidia/label/cuda-12.4.1 # Replace with your desired CUDA version
pip install gfmrag
TORCH=$(python -c "import torch; print(torch.__version__)")
pip install torch_scatter torch_sparse -f https://data.pyg.org/whl/torch-${TORCH}.html

Install via Pip¶

Bash

pip install gfmrag

Install relevant packages, please make sure to install the correct version of torch_scatter and torch_sparse based on your PyTorch and CUDA versions:

Bash

TORCH=$(python -c "import torch; print(torch.__version__)")
pip install torch_scatter torch_sparse -f https://data.pyg.org/whl/torch-${TORCH}.html

Install from Source¶

For contributors or those who want to install from source, follow these steps:

Clone the repository:

Bash

git clone https://github.com/RManLuo/gfm-rag.git
cd gfm-rag

Install Poetry:

Create and activate a conda environment:

Bash

conda create -n gfmrag python=3.12
conda activate gfmrag
conda install cuda-toolkit -c nvidia/label/cuda-12.4.1 # Replace with your desired CUDA version

Install project dependencies:

Bash

poetry install
TORCH=$(python -c "import torch; print(torch.__version__)")
pip install torch_scatter torch_sparse -f https://data.pyg.org/whl/torch-${TORCH}.html

Optional Components¶

Llama.cpp Integration¶

If you plan to use locally host LLMs via Llama.cpp:

Install llama-cpp-python:

Bash

pip install llama-cpp-python

For more information, visit the following resources: - LangChain Llama.cpp - llama-cpp-python repository

Ollama Integration¶

If you plan to use Ollama for hosting LLMs:

Install Ollama:

Bash

pip install langchain-ollama
pip install ollama

For more information, visit the following resources: - LangChain Ollama

Troubleshooting¶

CUDA errors when compiling `rspmm` kernel¶

GFM-RAG requires the nvcc compiler to compile the rspmm kernel. If you encounter errors related to CUDA, make sure you have the CUDA toolkit installed and the nvcc compiler is in your PATH. Meanwhile, make sure your CUDA_HOME variable is set properly to avoid potential compilation errors, eg

Bash

export CUDA_HOME=/usr/local/cuda-12.4

Usually, if you install CUDA toolkit via conda, the CUDA_HOME variable is set automatically.

Stuck when compiling `rspmm` kernel¶

Sometimes the compilation of the rspmm kernel may get stuck. If you encounter this issue, try to manually remove the compilation cache under ~/.cache/torch_extensions/ and recompile the kernel.

For more help, please check our GitHub issues or create a new one.

Installation Guide¶

Prerequisites¶

Installation Methods¶

Install via Conda¶

Install via Pip¶

Install from Source¶

Optional Components¶

Llama.cpp Integration¶

Ollama Integration¶

Troubleshooting¶

CUDA errors when compiling rspmm kernel¶

Stuck when compiling rspmm kernel¶

CUDA errors when compiling `rspmm` kernel¶

Stuck when compiling `rspmm` kernel¶