Kaggle-Twitter-Sentiment-Extraction - https://www.kaggle.com/c/tweet-sentiment-extraction/overview

Ranking

Ensembling between 10 x Electra-Large and 10 x Roberta and XGB
Preprocess and Postprocess on unknown tokens and grouped punctuations tokens for Roberta
Error analysis on Predictions (backfired a little bit)
XGB used to decide when to use original text for neutral tweets
Used weighted confidence between all model's predictions to decide the final prediction
Ensembling mostly done by (https://www.kaggle.com/css919)

pretrained with Squad2
Multi-Sample Dropout (https://arxiv.org/pdf/1905.09788.pdf)
Average Pooling of Last 4 Hidden Layer
5x Trained on all data but removed all tokens that are impossible to predict (done by https://www.kaggle.com/css919)
5x Trained on Neutral Tweets only

fine tuned using TF scripts on colab with TPU and GCloud
mostly done by my teammates (https://www.kaggle.com/ajinomoto132 and https://www.kaggle.com/tretrausaigon)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
Roberta_Best_Model.ipynb		Roberta_Best_Model.ipynb
impossible_tokens_removal.ipynb		impossible_tokens_removal.ipynb