Comparing the Effectiveness of Support Vector Classifier and Stochastic Gradient Descent in Hate-Speech Detection
DOI:
https://doi.org/ 10.47611/harp.315Keywords:
Support Vector Classifier, Stochastic Gradient Descent, Hate-Speech DetectionAbstract
The increased use of Social Media with easy access to most people in the world has given rise to a multitude of problems; with cyberbullying and online hate-speech standing out as significant issues. With the choice of a user to maintain there anonymity and post most things that would be considered uncivil in a one-to-one real life conversation, has led to a widespread dissemination of online hate-speech, posing significant societal challenges and determinantal effects to an individual’s mental health. In this paper, we explored two simple Classifiers, Support Vector Classifier (SVC) and Stochastic Gradient Descent (SGD) which are compared and analysed through there accuracy score to determine there effectiveness in detecting hate-speech within the context of Twitter data. To train the models, a publicly available dataset by Analytics Vidhya which can be found on Kaggle.com is used which contains 32k tweets labelled with a ‘1’ if it is sexist/racist or ‘0’ if it’s not. The goal of this paper is identifying the differences in performances in hate-speech detection by the two classifiers.
Downloads
Posted
License
Copyright (c) 2024 Dania Ali
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.