Data mining of Sina-Weibo
Sina-Weibo is a Twitter-like microblogging system in China provided by Sina, one of the largest Chinese Internet content providers. It produces a mass of microblogs every day. These data are very helpful for research with respect to anthropology, social networks, urban planning and other aspects.
Our purpose is to investigate whether there exists a hierarchical structure about the crowd’s attention on a site or a region which is similar with the ego networks proposed by an anthropologist named Dunbar. We will take advantage of the data in Sina-Weibo and as many data types as possible should be considered in our data mining. It is because that we believe not only the geographical information of a microblog is related to our issue, but also the text and update time of a microblog and other data types may have some critical connection with the site preference of the crowd. The result can be used in studying human behavior and applied to a recommender system such as a friend recommender system which recommends friends that have the same interest.