IMDb Sentiment Classification with DistilBERT

This project uses the DistilBERT model to perform sentiment classification on the IMDb dataset. The script fine-tunes the pre-trained DistilBERT model using the IMDb dataset and evaluates its performance.

Setup

Prerequisites

Python 3.8 or later
An NVIDIA GPU with CUDA installed (optional, but recommended for faster training)

Installation

Clone the repository:

git clone https://github.com/your-repository/imdb-sentiment-classification.git
cd imdb-sentiment-classification

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install the required packages:
```
pip install -r requirements.txt
```

Usage

Training and Evaluation

To train and evaluate the model, run the following command:

python tfm_classifier.py

This script will:

Load and preprocess the IMDb dataset.
Fine-tune the DistilBERT model.
Evaluate the model on the test set.
Save the fine-tuned model and tokenizer.

Requirements

torch: PyTorch for model training and inference.
transformers: Hugging Face Transformers library for using the DistilBERT model.
datasets: Hugging Face Datasets library for loading and processing the IMDb dataset.
pandas: Data manipulation library.

Notes:

Replace your-repository with the actual repository URL if you have one.
Make sure the tfm_classifier.py script contains the training and evaluation code provided earlier.

This README.md provides comprehensive instructions on setting up, running, and using the project, ensuring clarity for any users or contributors.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
tfm_classifier.py		tfm_classifier.py
tfm_finetuned_test.py		tfm_finetuned_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IMDb Sentiment Classification with DistilBERT

Setup

Prerequisites

Installation

Usage

Training and Evaluation

Requirements

Notes:

License

About

Releases

Packages

Languages

License

julicq/TFM-Classifier

Folders and files

Latest commit

History

Repository files navigation

IMDb Sentiment Classification with DistilBERT

Setup

Prerequisites

Installation

Usage

Training and Evaluation

Requirements

Notes:

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages