hope speech detection

hope is the beacon that illuminates our darkest moments

Even in the toughest of times, humans manage to find hope. No better way to illustrate this than with a montage of some of the most poignant moments from the last few years, when people from across the world came together and helped each other amidst the coronavirus pandemic.

Social media has a huge impact on society. Everyone is free to state their opinions and they could be both positive and negative. Constructive criticism is always good but the way things are turning in today’s world, social media has laid its roots deep into our society. It has come up with its own pros and cons. Thus, in order to minimize the cons, we have tried to come up with an NLP based model for hope speech detection (Aggarwal et al., 2022). Removing those harsh comments will definitely help in uplifting the society and strengthening our beliefs.

The model is comprised of three components: the pre-processing unit, the feature extraction and data balancing unit, and the classification unit, as illustrated in the figure.

In the pre-processing unit, the dataset undergoes cleanup, involving the removal of unwanted text portions, expansion of abbreviations, and lemmatization for converting words to their root forms. The resulting output is then directed to the feature extraction unit, where the TF-IDF vectorization technique is employed to discern the significance of specific words or phrases within the dataset. Consequently, sentences in the dataset are transformed into vectors with distinct values. These vectors are subsequently utilized to balance the dataset through methods such as SMOTE and ADASYN, involving the addition of synthetic samples.

The outcomes of the above processes are fed into the final unit, which includes various traditional machine learning models like Naïve Bayes, Logistic Regression, and SVM. These models were implemented using the scikit-learn library in Python. We conducted experiments with several traditional models and also incorporated the BERT architecture into our analysis.

references

2022

  1. HSD.png
    Hope Speech Detection on Social Media Platforms
    Pranjal Aggarwal, Pasupuleti Chandana, Jagrut Nemade, and 3 more authors
    In Cybercrime in Social Media, 2022