Classification Of Twitter’s Data To Get Gender Identification

Authors

  • Waqas Ali Department of Computer Science, University of Management and Technology, Lahore, Pakistan
  • Malik Tahir Hassan Department of Computer Science, University of Management and Technology, Lahore, Pakistan
  • Syed Fawad Raza Department of Computer Science, University of Management and Technology, Lahore, Pakistan
  • Usman Fiaz Department of Computer Science, University of Management and Technology, Lahore, Pakistan

DOI:

https://doi.org/10.21015/vtcs.v16i1.545

Abstract

This paper describes the accuracy of various algorithms for classification of text on the basis of gender identification. We examined the knowledge extracted from corpus of Twitter's online social media in term of gender identity. By comparing algorithms on different feature sets, we established a feature set of 20 distinct arguments which increase the correctness of gender identification on all over the twitter. We reported accuracies of three algorithms obtained by using two approaches applied on two classes of gender i.e. male and female; a model where a lot of features are reduced using powerset transformation.

Downloads

Published

2019-01-16

How to Cite

Ali, W., Hassan, M. T., Raza, S. F., & Fiaz, U. (2019). Classification Of Twitter’s Data To Get Gender Identification. VAWKUM Transactions on Computer Sciences, 7(1), 26–30. https://doi.org/10.21015/vtcs.v16i1.545