项目作者: williamo1099

项目描述 :
Analyze Twitter data using Hadoop MapReduce
高级语言: Java
项目地址: git://github.com/williamo1099/Twitter-Data-Analysis.git
创建时间: 2020-12-16T05:40:38Z
项目社区:https://github.com/williamo1099/Twitter-Data-Analysis

开源协议:

下载


Twitter Data Analysis

Analyzing a Twitter dataset consisting of Tweet objects using MapReduce. A Tweet object is an object encoded in JSON representing a Tweet from a user and has a series of attributes (see more details here). Here is the list of analyses done using these MapReduce codes.

  • Analysis on average tweet length in one hour per month.
  • Analysis on average tweet length in one day per month.
  • Analysis on top 20 most active Twitter users.
  • Analysis on top 10 most retweeted tweets.
  • Analysis on tweet partition.

Lincense

Distributed under the MIT Lincese. See LICENSE for more information.