Skip to content
#

rouge-metric

Here is 1 public repository matching this topic...

An end-to-end pipeline for adapting FLAN-T5 for dialogue summarization, exploring the full spectrum of modern LLM tuning. Implements and compares Full Fine-Tuning, PEFT (LoRA), and Reinforcement Learning (RLHF) for performance and alignment. Features a PPO-tuned model to reduce toxicity, in-depth analysis notebooks, and interactive Streamlit demo.

  • Updated Aug 4, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the rouge-metric topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rouge-metric topic, visit your repo's landing page and select "manage topics."

Learn more