skip to main content
research-article

Infer User Interests via Link Structure Regularization

Published: 30 April 2014 Publication History

Abstract

Learning user interests from online social networks helps to better understand user behaviors and provides useful guidance to design user-centric applications. Apart from analyzing users' online content, it is also important to consider users' social connections in the social Web. Graph regularization methods have been widely used in various text mining tasks, which can leverage the graph structure information extracted from data. Previously, graph regularization methods operate under the cluster assumption that nearby nodes are more similar and nodes on the same structure (typically referred to as a cluster or a manifold) are likely to be similar. We argue that learning user interests from complex, sparse, and dynamic social networks should be based on the link structure assumption under which node similarities are evaluated based on the local link structures instead of explicit links between two nodes. We propose a regularization framework based on the relation bipartite graph, which can be constructed from any type of relations. Using Twitter as our case study, we evaluate our proposed framework from social networks built from retweet relations. Both quantitative and qualitative experiments show that our proposed method outperforms a few competitive baselines in learning user interests over a set of predefined topics. It also gives superior results compared to the baselines on retweet prediction and topical authority identification.

References

[1]
Amr Ahmed, Yucheng Low, Mohamed Aly, Vanja Josifovski, and Alexander J. Smola. 2011. Scalable distributed inference of dynamic user interests for behavioral targeting. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11). ACM, 114--122.
[2]
Aris Anagnostopoulos, Ravi Kumar, and Mohammad Mahdian. 2008. Influence and correlation in social networks. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08). ACM, 7--15.
[3]
Eytan Bakshy, Jake M. Hofman, Winter A. Mason, and Duncan J. Watts. 2011. Everyone's an influencer: Quantifying influence on Twitter. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11). ACM, 65--74.
[4]
David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent dirichlet allocation. J. Mach. Learn. Res.
[5]
Wei Feng and Jianyong Wang. 2013. Retweet or not?: Personalized tweet re-ranking. In Proceedings of the 6th ACM International Conference on Web Search and Data Mining (WSDM'13). ACM, 577--586.
[6]
T. L. Griffiths and M. Steyvers. 2004. Finding scientific topics. Proc. Nat. Acad. Sci. 101, 5228--5235.
[7]
Ziyu Guan, Jiajun Bu, Qiaozhu Mei, Chun Chen, and Can Wang. 2009. Personalized tag recommendation using graph-based ranking on multi-type interrelated objects. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'09). ACM, 540--547.
[8]
Lei Guo, Enhua Tan, Songqing Chen, Xiaodong Zhang, and Yihong (Eric) Zhao. 2009. Analyzing patterns of user content generation in online social networks. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). ACM, 369--378.
[9]
Jiawei Han, Yizhou Sun, Xifeng Yan, and Philip S. Yu. 2012. Mining knowledge from data: An information network analysis approach. In Proceedings of the International Conference on Data Engineering (ICDE'12). IEEE.
[10]
Liangjie Hong, Amr Ahmed, Siva Gurumurthy, Alexander J. Smola, and Kostas Tsioutsiouliklis. 2012. Discovering geographical topics in the Twitter stream. In Proceedings of the 21st International Conference on World Wide Web (WWW'12). ACM, 769--778.
[11]
Liangjie Hong, Ovidiu Dan, and Brian D. Davison. 2011. Predicting popular messages in Twitter. In Proceedings of the 20th International Conference Companion on World Wide Web (WWW'11). ACM, 57--58.
[12]
Liangjie Hong and Brian D. Davison. 2010. Empirical study of topic modeling in Twitter. In Proceedings of the 1st Workshop on Social Media Analytics (SOMA'10). ACM, 80--88.
[13]
John Hopcroft, Tiancheng Lou, and Jie Tang. 2011. Who will follow you back?: Reciprocal relationship prediction. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11). ACM, 1137--1146.
[14]
Glen Jeh and Jennifer Widom. 2002. SimRank: A measure of structural-context similarity. In Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'02). ACM, 538--543.
[15]
Ming Ji, Jun Yan, Siyu Gu, Jiawei Han, Xiaofei He, Wei Vivian Zhang, and Zheng Chen. 2011. Learning search tasks in queries and web pages via graph regularization. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'11). ACM, 55--64.
[16]
Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue Moon. 2010. What is Twitter, a social network or a news media? In Proceedings of the 19th International Conference on World Wide Web (WWW'10). ACM, 591--600.
[17]
Janette Lehmann, Bruno Gonçalves, José J. Ramasco, and Ciro Cattuto. 2012. Dynamical classes of collective attention in Twitter. In Proceedings of the 21st International Conference on World Wide Web (WWW'12). ACM, 251--260.
[18]
Xiao Li, Ye-Yi Wang, and Alex Acero. 2008. Learning query intent from regularized click graphs. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'08). ACM, 339--346.
[19]
Hao Ma, Irwin King, and Michael R. Lyu. 2009a. Learning to recommend with social trust ensemble. In Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'09). ACM, 203--210.
[20]
Hao Ma, Michael R. Lyu, and Irwin King. 2009b. Learning to recommend with trust and distrust relationships. In Proceedings of the 3rd ACM Conference on Recommender Systems (RecSys'09). ACM, 189--196.
[21]
Hao Ma, Dengyong Zhou, Chao Liu, Michael R. Lyu, and Irwin King. 2011. Recommender systems with social regularization. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11). ACM, New York, NY, 287--296.
[22]
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schtze. 2008. Introduction to Information Retrieval. Cambridge University Press.
[23]
Qiaozhu Mei, Deng Cai, Duo Zhang, and ChengXiang Zhai. 2008. Topic modeling with network regularization. In Proceedings of the 17th International Conference on World Wide Web (WWW'08). ACM, 101--110.
[24]
Arnold Neumaier. 1998. Solving ill-conditioned and singular linear systems: A tutorial on regularization. Siam Rev. 40, 3, 636--666.
[25]
Daniel M. Romero, Brendan Meeder, and Jon Kleinberg. 2011. Differences in the mechanics of information diffusion across topics: Idioms, political hashtags, and complex contagion on Twitter. In Proceedings of the 20th International Conference on World Wide Web (WWW'11). ACM, 695--704.
[26]
Xiaolin Shi, Jun Zhu, Rui Cai, and Lei Zhang. 2009. User grouping behavior in online forums. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). ACM, 777--786.
[27]
Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, and Thomas Griffiths. 2004. Probabilistic author-topic models for information discovery. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04). ACM, 306--315.
[28]
Jie Tang, Jimeng Sun, Chi Wang, and Zi Yang. 2009. Social influence analysis in large-scale networks. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). ACM, 807--816.
[29]
Robert Tibshirani. 1994. Regression shrinkage and selection via the lasso. J. Royal Stat. Soci. Series B 58, 267--288.
[30]
Chi Wang, Rajat Raina, David Fong, Ding Zhou, Jiawei Han, and Greg Badros. 2011. Learning relevance from heterogeneous social network and its application in online targeting. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'11). ACM, 655--664.
[31]
Michael J. Welch, Uri Schonfeld, Dan He, and Junghoo Cho. 2011. Topical semantics of Twitter links. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining (WSDM'11). ACM, 327--336.
[32]
Jianshu Weng, Ee-Peng Lim, Jing Jiang, and Qi He. 2010. TwitterRank: Finding topic-sensitive influential twitterers. In Proceedings of the 3rd ACM International Conference on Web Search and Data Mining (WSDM'10). ACM, 261--270.
[33]
Shaomei Wu, Jake M. Hofman, Winter A. Mason, and Duncan J. Watts. 2011. Who says what to whom on Twitter. In Proceedings of the 20th International Conference on World Wide Web (WWW'11). ACM, 705--714.
[34]
Lei Yang, Tao Sun, Ming Zhang, and Qiaozhu Mei. 2012. We know what @you #tag: Does the dual role affect hashtag adoption? In Proceedings of the 21st International Conference on World Wide Web (WWW'12). ACM, 261--270.
[35]
Dawei Yin, Liangjie Hong, Zhenzhen Xue, and Brian D. Davison. 2011. Temporal dynamics of user interests in tagging systems. In Proceedings of the 25th AAAI Conference on Artificial Intelligence (AAAI'11).
[36]
Dawei Yin, Zhenzhen Xue, Liangjie Hong, and Brian D. Davison. 2010. A probabilistic model for personalized tag prediction. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'10). ACM, 959--968.
[37]
Zhijun Yin, Rui Li, Qiaozhu Mei, and Jiawei Han. 2009. Exploring social tagging graph for Web object classification. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'09). ACM, 957--966.
[38]
Wayne Xin Zhao, Jing Jiang, Jianshu Weng, Jing He, Ee-Peng Lim, Hongfei Yan, and Xiaoming Li. 2011. Comparing Twitter and traditional media using topic models. In Proceedings of the 33rd European Conference on Advances in Information Retrieval (ECIR'11). Lecture Notes in Computer Science, vol. 6611, Springer-Verlag, 338--349.
[39]
Dengyong Zhou, Olivier Bousquet, Thomas Navin Lal, Jason Weston, and Bernhard Schölkopf. 2004. Learning with local and global consistency. In Proceedings of the Conference on Advances in Neural Information Processing Systems 16 (NIPS'04). MIT Press.
[40]
Dengyong Zhou, Jiayuan Huang, and Bernhard Schölkopf. 2005a. Learning from labeled and unlabeled data on a directed graph. In Proceedings of the 22nd International Conference on Machine Learning (ICML'05). ACM, 1036--1043.
[41]
Dengyong Zhou, Bernhard Schölkopf, and Thomas Hofmann. 2005b. Semi-supervised learning on directed graphs. In Proceedings of the Conference on Advances in Neural Information Processing Systems 17. MIT Press, 1633--1640.

Cited By

View all
  • (2024)Predicting users’ future interests on social networks: A reference frameworkInformation Processing & Management10.1016/j.ipm.2024.10376561:5(103765)Online publication date: Sep-2024
  • (2022)Rough-Set-Based Real-Time Interest Label Extraction over Large-Scale Social NetworksComplexity10.1155/2022/20729502022Online publication date: 1-Jan-2022
  • (2021)Personality-Aware Product Recommendation System Based on User Interests Mining and Metapath DiscoveryIEEE Transactions on Computational Social Systems10.1109/TCSS.2020.30370408:1(86-98)Online publication date: Feb-2021
  • Show More Cited By

Index Terms

  1. Infer User Interests via Link Structure Regularization

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Intelligent Systems and Technology
    ACM Transactions on Intelligent Systems and Technology  Volume 5, Issue 2
    Special Issue on Linking Social Granularity and Functions
    April 2014
    347 pages
    ISSN:2157-6904
    EISSN:2157-6912
    DOI:10.1145/2611448
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 April 2014
    Accepted: 01 June 2013
    Revised: 01 April 2013
    Received: 01 October 2012
    Published in TIST Volume 5, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. User interests
    2. graph regularization
    3. link structure

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 22 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Predicting users’ future interests on social networks: A reference frameworkInformation Processing & Management10.1016/j.ipm.2024.10376561:5(103765)Online publication date: Sep-2024
    • (2022)Rough-Set-Based Real-Time Interest Label Extraction over Large-Scale Social NetworksComplexity10.1155/2022/20729502022Online publication date: 1-Jan-2022
    • (2021)Personality-Aware Product Recommendation System Based on User Interests Mining and Metapath DiscoveryIEEE Transactions on Computational Social Systems10.1109/TCSS.2020.30370408:1(86-98)Online publication date: Feb-2021
    • (2021)Advertisement recommendations for products and community prediction2021 5th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT)10.1109/ISMSIT52890.2021.9604627(755-760)Online publication date: 21-Oct-2021
    • (2020)Learning Semantic Representations from Directed Social Links to Tag Microblog Users at ScaleACM Transactions on Information Systems10.1145/337755038:2(1-30)Online publication date: 7-Mar-2020
    • (2020)Invested or IndebtedACM Transactions on Management Information Systems10.1145/337138811:1(1-26)Online publication date: 20-Jan-2020
    • (2020)The Four Dimensions of Social Network Analysis: An Overview of Research Methods, Applications, and Software ToolsInformation Fusion10.1016/j.inffus.2020.05.009Online publication date: Jun-2020
    • (2020)Extracting User Interests from Operation Logs on Museum Devices for Post-LearningDigital Libraries at Times of Massive Societal Transition10.1007/978-3-030-64452-9_15(176-186)Online publication date: 26-Nov-2020
    • (2019)Extracting, Mining and Predicting Users' Interests from Social NetworksProceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3331184.3331383(1407-1408)Online publication date: 18-Jul-2019
    • (2019)Social User Interest MiningProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3332279(3235-3236)Online publication date: 25-Jul-2019
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media