GPT-2 IMDB Sentiment Fine-Tuning with PPO

This repository contains code for fine-tuning a GPT-2 model on the IMDB dataset using Proximal Policy Optimization (PPO). The goal is to train the model to generate positive sentiment reviews. The training process utilizes the trl library for reinforcement learning, the transformers library for model handling, and datasets for dataset management.

The build_dataset function constructs the dataset for training. It tokenizes the IMDB reviews, filters out reviews with less than 200 tokens, and truncates the reviews to a random length for input.
The collator function formats the data into batches.
The main training loop fine-tunes the model using PPO. It involves generating responses, calculating rewards using a sentiment analysis model, and updating the model.

Saving the Model

The model and tokenizer are saved after training to the specified directory.

Hugging Face Model

The fine-tuned model is available on Hugging Face. You can use the inference API to test the model and generate responses with custom input. you can test the model using Hugging Face Inference API: Hugging Face The model and tokenizer files can be accessed in the files section of the above link

Reference Papers

This repository includes a collection of reference papers that provide additional context and background on the methods and techniques used. You can find these papers in the reference_papers folder.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
ppo		ppo
reference materials		reference materials
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT-2 IMDB Sentiment Fine-Tuning with PPO

Table of Contents

Installation

Usage

Code Overview

Saving the Model

Hugging Face Model

Reference Papers

About

Releases

Packages

Languages

License

sathishkumar67/GPT-2-IMDB-Sentiment-Fine-Tuning-with-PPO

Folders and files

Latest commit

History

Repository files navigation

GPT-2 IMDB Sentiment Fine-Tuning with PPO

Table of Contents

Installation

Usage

Code Overview

Saving the Model

Hugging Face Model

Reference Papers

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages