Abstract
Spamming has been a widespread problem for social networks. In recent years there is an increasing interest in the analysis of anti-spamming for microblogs, such as Twitter. In this paper we present a systematic research on the analysis of spamming in Sina Weibo platform, which is currently a dominant microblogging service provider in China. Our research objectives are to understand the specific spamming behaviors in Sina Weibo and find approaches to identify and block spammers in Sina Weibo based on spamming behavior classifiers. To start with the analysis of spamming behaviors we devise several effective methods to collect a large set of spammer samples, including uses of proactive honeypots and crawlers, keywords based searching and buying spammer samples directly from online merchants. We processed the database associated with these spammer samples and interestingly we found three representative spamming behaviors: Aggressive advertising, repeated duplicate reposting and aggressive following. We extract various features and compare the behaviors of spammers and legitimate users with regard to these features. It is found that spamming behaviors and normal behaviors have distinct characteristics. Based on these findings we design an automatic online spammer identification system. Through tests with real data it is demonstrated that the system can effectively detect the spamming behaviors and identify spammers in Sina Weibo.
Original language | English |
---|---|
Title of host publication | Proceedings of the 7th Workshop on Social Network Mining and Analysis, SNA-KDD 2013 |
Place of Publication | New York |
Publisher | Association for Computing Machinery (ACM) |
ISBN (Print) | 9781450323307 |
DOIs | |
Publication status | Published - 2013 |
Event | 7th Workshop on Social Network Mining and Analysis, SNA-KDD 2013 - Chicago, IL, United States Duration: 11 Aug 2013 → 14 Aug 2013 |
Conference
Conference | 7th Workshop on Social Network Mining and Analysis, SNA-KDD 2013 |
---|---|
Country/Territory | United States |
City | Chicago, IL |
Period | 11/08/13 → 14/08/13 |
Keywords
- Automatic spammer identification
- Crawlers
- Proactive honeypots
- Sina Weibo
- Spamming behaviors
ASJC Scopus subject areas
- Computer Networks and Communications