Skip to main content

Deep Dive into Anonymity: Large Scale Analysis of Quora Questions

  • Conference paper
  • First Online:
Social Informatics (SocInfo 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11864))

Included in the following conference series:

Abstract

Anonymity forms an integral and important part of our digital life. It enables us to express our true selves without the fear of judgment. In this paper, we investigate the different aspects of anonymity in the social Q&A site Quora. Quora allows users to explicitly post anonymous questions and such activity in this forum has become normative rather than a taboo. Through an analysis of millions of questions, we observe that at a global scale almost no difference manifests between the linguistic structure of the anonymous and the non-anonymous questions posted on Quora. We find that topical mixing at the global scale to be the primary reason for the absence. However, the differences start to feature once we “deep dive” and (topically) cluster the questions and compare them. In particular, we observe that the choice to post the question as anonymous is dependent on the user’s perception of anonymity and they often choose to speak about depression, anxiety, social ties and personal issues under the guise of anonymity. Subsequently, to gain further insights, we build an anonymity grid to identify the differences in the perception on anonymity of the user posting the question and the community of users answering it. We also look into the first response time of the questions and observe that it is lowest for topics which talk about personal and sensitive issues, which hints toward a higher degree of community support.

B. Mathew and R. Dutt—Contributed equally to this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
eBook
USD 39.99
Price excludes VAT (USA)
Softcover Book
USD 54.99
Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    www.quora.com/Why-did-Quora-add-the-anonymous-option-for-answers-and-questions.

  2. 2.

    We did not remove the users whose accounts were deleted. On deletion, the username is simply replaced with “user” placeholder and does not make it anonymous. For more details please visit: https://www.quora.com/What-happens-when-I-deactivate-or-delete-my-Quora-account.

  3. 3.

    https://goo.gl/EFXQJx.

  4. 4.

    We use the POS (Parts of Speech) tagger and NER (Named Entity Recognizer) of SpaCy: https://spacy.io.

  5. 5.

    We use VADER [8].

  6. 6.

    http://liwc.wpengine.com/.

  7. 7.

    Definition available in Appendix.

  8. 8.

    http://universaldependencies.org/u/pos/.

References

  1. Bernstein, M.S., Monroy-Hernández, A., Harry, D., André, P., Panovich, K., Vargas, G.G.: 4chan and/b: an analysis of anonymity and ephemerality in a large online community. In: ICWSM, pp. 50–57 (2011)

    Google Scholar 

  2. Birnholtz, J., Merola, N.A.R., Paul, A.: Is it weird to still be a virgin: anonymous, locally targeted questions on facebook confession boards. In: CHI, pp. 2613–2622. ACM (2015)

    Google Scholar 

  3. Cohen, J.: Statistical Power Analysis for the Behavioral Sciences, 2nd edn. Routledge, Abingdon (1988)

    MATH  Google Scholar 

  4. Correa, D., Silva, L.A., Mondal, M., Benevenuto, F., Gummadi, K.P.: The many shades of anonymity: characterizing anonymous social media content. In: ICWSM, pp. 71–80 (2015)

    Google Scholar 

  5. De Choudhury, M., De, S.: Mental health discourse on reddit: self-disclosure, social support, and anonymity. In: ICWSM, pp. 71–80 (2014)

    Google Scholar 

  6. Faber, J., Fonseca, L.M.: How sample size influences research outcomes. Dent. Press J. Orthod. 19(4), 27–29 (2014)

    Article  Google Scholar 

  7. Huang, Z., Thint, M., Qin, Z.: Question classification using head words and their hypernyms. In: EMNLP, pp. 927–936. ACL (2008)

    Google Scholar 

  8. Hutto, C.J., Gilbert, E.: Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: ICWSM (2014)

    Google Scholar 

  9. Li, F., Zhang, X., Yuan, J., Zhu, X.: Classifying what-type questions by head noun tagging. In: ACL, pp. 481–488 (2008)

    Google Scholar 

  10. Maity, S.K., Kharb, A., Mukherjee, A.: Language use matters: analysis of the linguistic structure of question texts can characterize answerability in Quora. In: ICWSM, pp. 612–615 (2017)

    Google Scholar 

  11. Maity, S.K., Kharb, A., Mukherjee, A.: Analyzing the linguistic structure of question texts to characterize answerability in Quora. IEEE Trans. Comput. Soc. Syst. 5(3), 816–828 (2018)

    Article  Google Scholar 

  12. Maity, S.K., Sahni, J.S.S., Mukherjee, A.: Analysis and prediction of question topic popularity in community Q&A sites: a case study of Quora. In: ICWSM, pp. 238–247 (2015)

    Google Scholar 

  13. Paskuda, M., Lewkowicz, M.: Anonymous Quorans are still Quorans, just anonymous. In: 7th International Conference on Communities and Technologies, pp. 9–18. ACM (2015)

    Google Scholar 

  14. Patil, S., Lee, K.: Detecting experts on Quora: by their activity, quality of answers, linguistic characteristics and temporal behaviors. Soc. Netw. Anal. Min. 6(1), 5 (2016)

    Article  Google Scholar 

  15. Wang, G., Gill, K., Mohanlal, M., Zheng, H., Zhao, B.Y.: Wisdom in the social crowd: an analysis of Quora. In: WWW, pp. 1341–1352. ACM (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Binny Mathew .

Editor information

Editors and Affiliations

A Appendix

A Appendix

Cohen’s d: The Cohen’s d test is primarily used to measure the effect size, which is an estimate of the strength of the relationship of two variables. Given the mean and standard deviation of two populations, denoted by \(\mu _{1}\) and \(\sigma _{1}\) and \(\mu _{2}\) and \(\sigma _{2}\) respectively, the Cohen’s d-test is the ratio of the difference of their means to their pooled standard deviation, more succinctly represented by the following expression

figure a

A Cohen’s d-value of magnitude 0.2 indicates small effect, 0.5 indicates medium effect while 0.8 signifies large effect.

Diff(%): We use the metric \(Diff(\%)\) in a way similar to Correa et al. [4] to quantify the difference in features between the Anon and the Known Group. The Diff(%) metric is simply the percentage mean difference of a feature between the Anon and Known group and is represented as

figure b

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mathew, B., Dutt, R., Maity, S.K., Goyal, P., Mukherjee, A. (2019). Deep Dive into Anonymity: Large Scale Analysis of Quora Questions. In: Weber, I., et al. Social Informatics. SocInfo 2019. Lecture Notes in Computer Science(), vol 11864. Springer, Cham. https://doi.org/10.1007/978-3-030-34971-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-34971-4_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-34970-7

  • Online ISBN: 978-3-030-34971-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics