Show simple item record

Text Mining of Twitter Data: Topic Modelling

dc.contributor.authorNjoku, Uchechukwu Fortune
dc.date.accessioned2019-08-08T12:07:25Z
dc.date.available2019-08-08T12:07:25Z
dc.date.issued2019-06-16
dc.identifier.urihttp://repository.aust.edu.ng/xmlui/handle/123456789/4905
dc.description.abstractAccess to the Internet is becoming more affordable especially in Africa and with this the number of active social media users is also on the rise. Twitter is a social media platform on which users post and interact with messages known as "tweets". These tweets are usually short with a limit of 280 characters. With over 100 million Internet users and 6 million active monthly users in Nigeria, lots of data is generated through this medium daily. This thesis aims to gain insights from the ever-growing Nigerian data generated from twitter using Topic modelling. We use Latent Dirichlet Allocation (LDA) on Nigerian heath tweets from verified accounts covering time period of 2015 – 2019 to derive top health topics in Nigeria. We detected the outbreaks of Ebola, Lassa fever and meningitis within this time frame. We also detected reoccurring topics of child immunization/vaccination. Twitter data contains useful information that can give insights to individuals, organizations and the government hence it should be further explored and utilized.en_US
dc.description.sponsorshipAUST and AfDB.en_US
dc.language.isoenen_US
dc.subject2019 Computer Science and Engineering Thesesen_US
dc.subjectNjoku, Uchechukwu Fortuneen_US
dc.subjectDr. Rajesh Prasaden_US
dc.subjectTwitteren_US
dc.subjectText miningen_US
dc.subjectHealthen_US
dc.subjectTopic modellingen_US
dc.subjectLatent Dirichlet Allocationen_US
dc.titleText Mining of Twitter Data: Topic Modellingen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

  • Computer Science105

    This collection contains Computer Science Student's Theses from 2009-2022

Show simple item record