rouge-metric

Here is 1 public repository matching this topic...

nabeelshan78 / flanT5-ICL-SFT-PEFT-RLHF

An end-to-end pipeline for adapting FLAN-T5 for dialogue summarization, exploring the full spectrum of modern LLM tuning. Implements and compares Full Fine-Tuning, PEFT (LoRA), and Reinforcement Learning (RLHF) for performance and alignment. Features a PPO-tuned model to reduce toxicity, in-depth analysis notebooks, and interactive Streamlit demo.

machine-learning reinforcement-learning deep-learning transformers lora rouge-metric fine-tuning huggingface streamlit-webapp prompt-tuning large-language-models prompt-engineering generative-ai rlhf flan-t5 llms-benchmarking peft-fine-tuning-llm

Updated Aug 4, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the rouge-metric topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rouge-metric topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rouge-metric

Here is 1 public repository matching this topic...

nabeelshan78 / flanT5-ICL-SFT-PEFT-RLHF

Improve this page

Add this topic to your repo