UQ ITEE - INFS7203 Project (Sem2, 2016)
-
Collecting data of users and the Tweets of some specific users
-
TF-IDF and Agglomerative Clustering to find the clusters of Tweets
-
LDA to analysis the topics
- sklearn, gensim, numpy, nltk, pymongo, mongoengine, Tweepy
-
Create your app on Twitter Developer's website and copy your tokens to twitter_client.py
-
(Optional) If you want to collect the information of a list of users, and you have created a Twitter list with users, change the list name and run text/get_members.py. Remember to set up you local MongoDB because it is needed in this step. The MongoDB schema is in mongodb.py.
-
Run text/get_timeline.py. You can get the timeline of multiple users if you have already finished step 2. If you want to collect the timeline of a specific user, just modify the user's screen name.
-
Run text/text_mining.py. If you want to collect the timeline of a specific user, just modify the user's screen name.