Text Tone Classifier

This is a fine-tuned model that checks if a sentence sounds toxic or not. The project uses a small dataset of online comments. It fine-tunes a pretrained DistilBERT model to do binary classification: toxic or not toxic.

📁 Important Project files

├── main.py             # Trains the model
├── predict.py          # Usage of trained model
├── download_dataset.py # Downloads dataset
├── model/              # Saved model after training (ignored by git)

📊 Dataset

Dataset: Jigsaw Toxic Comment Classification Challenge from Kaggle.
Only train.csv is needed.
Use download_dataset.py to download it into the project folder.

🚀 How to run

Install libraries

pip install transformers datasets scikit-learn pandas accelerate kagglehub

Download dataset
python download_dataset.py
Train the model
python main.py
Check tone of your sentence
python predict.py

Then type a sentence in the console. The model will say if it is toxic or not.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
download_dataset.py		download_dataset.py
main.py		main.py
predict.py		predict.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Text Tone Classifier

📁 Important Project files

📊 Dataset

🚀 How to run

About

Uh oh!

Languages

Pfauberg/text-tone-classifier

Folders and files

Latest commit

History

Repository files navigation

Text Tone Classifier

📁 Important Project files

📊 Dataset

🚀 How to run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages