Toxic-Comment-Classification-Challenge

Kaggle Toxic Comment Classification Challenge: (https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge)

Single LSTM + GRU Model with 10 fold CV yields a ROC-AUC score of 0.9871 against Public LB highest of 0.9890 with current solution ranked 300^th on Public LB

Additional Details:

Embedding Vectors - fastText & GloVe Twitter (200d)
Implementation Libraries - Pytorch (Model) & Keras (Text Pre-processing)

Potential Areas of Improvement:

Modifying model architecture with focus on better regularization
Ensembling (though ensembling with NB-SVM baseline did not help improve the score)

Note - Did not use BERT baseline since it wasn't released at the time of competition

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
old		old
Fasttext_Word Correction Map.pickle		Fasttext_Word Correction Map.pickle
README.md		README.md
Toxic_Comment_Classification_(LSTM+GRU).ipynb		Toxic_Comment_Classification_(LSTM+GRU).ipynb
Toxic_Comment_Classification_(NB-SVM).ipynb		Toxic_Comment_Classification_(NB-SVM).ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Toxic-Comment-Classification-Challenge

About

Releases

Packages

Languages

SaumilShah-7/Toxic-Comment-Classification-Challenge-Kaggle

Folders and files

Latest commit

History

Repository files navigation

Toxic-Comment-Classification-Challenge

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages