Abstract
Various socio-political organizations, from activist groups to propaganda campaigners, create accounts on Twitter to reach out, influence and gain followers. In order to analyze the impact of these organizational accounts, the first step is to identify them. In this paper, we develop and experiment with a set of network-based, behavioral, temporal and spatial characteristics in these accounts, independent of domain or language, to identify features that can be useful in detecting organizational accounts. In order to assess this model, we experimented with a microblog corpus comprised of over 7 million tweets from 150,000 Twitter users in Bangladesh, tweeted between June and October 2016. We sampled 31,139 accounts using cold-start heuristics to locate and label nearly 200 organizational accounts, distributed as 68 NGOs, 62 news outlets, 35 political groups, and 17 public intellectual and iconic figures. The remaining accounts were labeled as individuals. Next, we developed a set of features and experimented with a set of linear and non-linear classifiers. The highest performing sparse logistic regression classifier achieved an accuracy of 68.2% precision and 64.4% recall leading to a 66.2% F1-score in detecting less than 1% rare organizational accounts using a set of content- and language-independent features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Twitter Usage Statistics Report, Internet Live Stats, http://www.internetlivestats.com/twitter-statistics/.
- 2.
Since public intellectuals, leaders and celebrities share similar spatial, behavioral and connectivity related characteristics with group and organizational accounts, we labeled them together as “organizational” - as opposed to “individual” accounts.
- 3.
Internet World Stats. Bangladesh Internet Usage and Telecommunications Reports. http://www.internetworldstats.com/asia/bd.htm.
References
Baringhaus, L., Franz, C.: On a new multivariate two-sample test. J. Multivar. Anal. 88, 190–206 (2004)
Bonacich, P.: Factoring and weighting approaches to status scores and clique identification. J. Math. Sociol. 2(1), 113–120 (1972)
Can, E.F., Oktay, H., Manmatha, R.: Predicting retweet count using visual cues. In: Proceedings of the 22nd ACM International Conference on Conference on Information & Knowledge Management. ACM (2013)
Celli, F., Rossi, L.: The role of emotional stability in twitter conversations. In: Proceedings of WSASM. Association for Computational Linguistics (2012)
De Silva, L., Riloff, E.: User type classification of tweets with implications for event recognition. In: ACL 2014, p. 98 (2014)
McCorriston, J., et al.: Organizations are users too: characterizing and detecting the presence of organizations on twitter. In: ICWSM (2015)
Wagner, C., et al.: The wisdom in tweetonomies: acquiring latent conceptual structures from social awareness streams. In: Proceedings of the 3rd ISSW. ACM (2010)
Conover, M.D., et al.: Predicting the political alignment of twitter users. In: Privacy, Security, Risk and Trust (PASSAT), pp. 192–199. IEEE (2011)
Rao, D., et al.: Classifying latent user attributes in Twitter. In: Proceedings of the 2nd International Workshop on Search and Mining User-generated Contents. ACM (2010)
Burger, J., et al.: Discriminating gender on Twitter. In: EMNLP 2011 (2011)
Kim, S.M., et al.: Distinguishing individuals from organisations on twitter. In: WWW 2017 (2017)
Varol, O., et al.: Online human-bot interactions: Detection, estimation, and characterization (2017) arXiv preprint arXiv:1703.03107
Page, L., et al.: The pagerank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Wu, S., et al.: Who says what to whom on twitter. In: Proceedings of the 20th International Conference on World Wide Web. ACM (2011)
Yasugi, N., et al.: Use of twitter as an instrument for disseminating public information in providing public goods and roles of e-government: evidence from Japanese prefectures. Int. J. Eng. Innovative Technol. 3, 128–133 (2013)
Katz, E.: The two-step flow of communication: an up-to-date report on an hypothesis. Public Opin. Q. 21, 61–78 (1957)
Klemm, K., Eguiluz, V.M.: Growing scale-free networks with small-world behavior. Phys. Rev. E 65, 057102 (2002)
Mahmud, J., Nichols, J., Drews, C.: Home location identification of twitter users. ACM Trans. Intell. Syst. Technol. 5, 47 (2014)
Nacos, B.: Politics and the twitter revolution: how tweets influence the relationship between political leaders and the public. Polit. Sci. Q. 128, 178–179 (2013)
Nguyen, D.-P., Gravel, R., Trieschnigg, D., Meder, T.: “How Old Do You Think I Am?” a study of language and age in twitter (2013)
Sabidussi, G.: The centrality index of a graph. Psychometrika 31, 581–606 (1966)
Savage, D., Zhang, X., Yu, X., Chou, P., Wang, Q.: Anomaly detection in online social networks. Soc. Netw. 39, 62–70 (2014)
Seidman, S.: Network structure and minimum degree. Soc. Netw. 5, 269–287 (1983)
Tomek, I.: Two modifications of CNN. IEEE Trans. Syst. Man Cybern. 6, 769–772 (1976)
Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998)
Acknowledgements
This work was supported by the Office of Naval Research under Grant No.: N00014-16-1-2015, and N00014-15-1-2722.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Alzahrani, S., Gore, C., Salehi, A., Davulcu, H. (2018). Finding Organizational Accounts Based on Structural and Behavioral Factors on Twitter. In: Thomson, R., Dancy, C., Hyder, A., Bisgin, H. (eds) Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2018. Lecture Notes in Computer Science(), vol 10899. Springer, Cham. https://doi.org/10.1007/978-3-319-93372-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-93372-6_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93371-9
Online ISBN: 978-3-319-93372-6
eBook Packages: Computer ScienceComputer Science (R0)