Automatic Demographic Classification of Indonesian Twitter Users

by Hashfi Rasis,Alva Erwin,Julius Polar,Maulahikmah Galinium
( Department of Information Technology Faculty of Engineering and Information Technology Swiss German University )

Date Published: 02 Dec 2013
Published In: Information Systems International Conference (ISICO)
Volume: 2013
Publisher: Departemen Sistem Informasi, Institut Teknologi Sepuluh Nopember
Language: id-ID

Keywords: classification,Twitter,Demographics,Indonesia,Users

Abstract

Demographic Classification is a method to classify people by its demographic. Twitter has become one of largest social media which there are millions of tweets posted every day. Indonesia also becomes one of major country who uses Twitter. In Twitter there is no way to know how to classify each user into its demographic attributes. Because in Twitter profile there’s no demographic attributes like gender or age. This research will focus on how to classify Indonesian Twitter users based on their gender, age and occupation. This research will use Naïve Bayes and K-Nearest Neighbor as its classifier algorithm. According to the result, Naïve Bayes performs well only in gender classification while K-Nearest Neighbor does not perform well in any demographic classification. Testing set successfully classified gender but failed with age and occupation


© 2024 Open Access Journal of Information Systems (OAJIS) | created by : radityo p.w (http://about.me/radityopw) and rully a.h (eraha99 [at] gmail.com)