Iterative Chain-of-Thought Refinement

  • Tech Stack: PyTorch, Hugging Face, LoRA, LLaMA, Gemma, GPT-2 Embeddings, CUDA, NLTK
  • Github URL: Project Link

This project introduces a self-correcting feedback loop between a LLaMA-3B generator and a Gemma-2B verifier that iteratively improves reasoning chains. Evaluated on GSM8K and REVEAL datasets.