Skip to main content

Showing 1–10 of 10 results for author: Gurukar, S

  1. arXiv:2404.00816  [pdf, other

    cs.LG cs.AI

    HeteroMILE: a Multi-Level Graph Representation Learning Framework for Heterogeneous Graphs

    Authors: Yue Zhang, Yuntian He, Saket Gurukar, Srinivasan Parthasarathy

    Abstract: Heterogeneous graphs are ubiquitous in real-world applications because they can represent various relationships between different types of entities. Therefore, learning embeddings in such graphs is a critical problem in graph machine learning. However, existing solutions for this problem fail to scale to large heterogeneous graphs due to their high computational complexity. To address this issue,… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  2. arXiv:2306.14357  [pdf, other

    cs.LG cs.SI

    PolicyClusterGCN: Identifying Efficient Clusters for Training Graph Convolutional Networks

    Authors: Saket Gurukar, Shaileshh Bojja Venkatakrishnan, Balaraman Ravindran, Srinivasan Parthasarathy

    Abstract: Graph convolutional networks (GCNs) have achieved huge success in several machine learning (ML) tasks on graph-structured data. Recently, several sampling techniques have been proposed for the efficient training of GCNs and to improve the performance of GCNs on ML tasks. Specifically, the subgraph-based sampling approaches such as ClusterGCN and GraphSAINT have achieved state-of-the-art performanc… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

  3. FairMILE: Towards an Efficient Framework for Fair Graph Representation Learning

    Authors: Yuntian He, Saket Gurukar, Srinivasan Parthasarathy

    Abstract: Graph representation learning models have demonstrated great capability in many real-world applications. Nevertheless, prior research indicates that these models can learn biased representations leading to discriminatory outcomes. A few works have been proposed to mitigate the bias in graph representations. However, most existing works require exceptional time and computing resources for training… ▽ More

    Submitted 17 October, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

  4. arXiv:2205.10666  [pdf, other

    cs.LG cs.SI

    MultiBiSage: A Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest

    Authors: Saket Gurukar, Nikil Pancha, Andrew Zhai, Eric Kim, Samson Hu, Srinivasan Parthasarathy, Charles Rosenberg, Jure Leskovec

    Abstract: Graph Convolutional Networks (GCN) can efficiently integrate graph structure and node features to learn high-quality node embeddings. These embeddings can then be used for several tasks such as recommendation and search. At Pinterest, we have developed and deployed PinSage, a data-efficient GCN that learns pin embeddings from the Pin-Board graph. The Pin-Board graph contains pin and board entities… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  5. FairEGM: Fair Link Prediction and Recommendation via Emulated Graph Modification

    Authors: Sean Current, Yuntian He, Saket Gurukar, Srinivasan Parthasarathy

    Abstract: As machine learning becomes more widely adopted across domains, it is critical that researchers and ML engineers think about the inherent biases in the data that may be perpetuated by the model. Recently, many studies have shown that such biases are also imbibed in Graph Neural Network (GNN) models if the input graph is biased, potentially to the disadvantage of underserved and underrepresented co… ▽ More

    Submitted 20 October, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 14 pages, 3 figures, 6 tables

    Journal ref: EAAMO (2022)

  6. arXiv:2104.02174  [pdf, other

    q-bio.QM

    A Machine Learning Model for Nowcasting Epidemic Incidence

    Authors: Saumya Yashmohini Sahai, Saket Gurukar, Wasiur R. KhudaBukhsh, Srinivasan Parthasarathy, Grzegorz A. Rempala

    Abstract: Due to delay in reporting, the daily national and statewide COVID-19 incidence counts are often unreliable and need to be estimated from recent data. This process is known in economics as nowcasting. We describe in this paper a simple random forest statistical model for nowcasting the COVID - 19 daily new infection counts based on historic data along with a set of simple covariates, such as the cu… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

  7. arXiv:2001.09879  [pdf, other

    cs.CL

    Towards Quantifying the Distance between Opinions

    Authors: Saket Gurukar, Deepak Ajwani, Sourav Dutta, Juho Lauri, Srinivasan Parthasarathy, Alessandra Sala

    Abstract: Increasingly, critical decisions in public policy, governance, and business strategy rely on a deeper understanding of the needs and opinions of constituent members (e.g. citizens, shareholders). While it has become easier to collect a large number of opinions on a topic, there is a necessity for automated tools to help navigate the space of opinions. In such contexts understanding and quantifying… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted in ICWSM '20

  8. Twitter Watch: Leveraging Social Media to Monitor and Predict Collective-Efficacy of Neighborhoods

    Authors: Moniba Keymanesh, Saket Gurukar, Bethany Boettner, Christopher Browning, Catherine Calder, Srinivasan Parthasarathy

    Abstract: Sociologists associate the spatial variation of crime within an urban setting, with the concept of collective efficacy. The collective efficacy of a neighborhood is defined as social cohesion among neighbors combined with their willingness to intervene on behalf of the common good. Sociologists measure collective efficacy by conducting survey studies designed to measure individuals' perception of… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: 10 pages, 7 figures

    Journal ref: Complex Networks XI 2020

  9. arXiv:1905.00987  [pdf, other

    cs.LG cs.SI stat.ML

    Network Representation Learning: Consolidation and Renewed Bearing

    Authors: Saket Gurukar, Priyesh Vijayan, Aakash Srinivasan, Goonmeet Bajaj, Chen Cai, Moniba Keymanesh, Saravana Kumar, Pranav Maneriker, Anasua Mitra, Vedang Patel, Balaraman Ravindran, Srinivasan Parthasarathy

    Abstract: Graphs are a natural abstraction for many problems where nodes represent entities and edges represent a relationship across entities. An important area of research that has emerged over the last decade is the use of graphs as a vehicle for non-linear dimensionality reduction in a manner akin to previous efforts based on manifold learning with uses for downstream database processing, machine learni… ▽ More

    Submitted 15 June, 2019; v1 submitted 2 May, 2019; originally announced May 2019.

  10. arXiv:1802.09612  [pdf, other

    cs.AI cs.SI

    MILE: A Multi-Level Framework for Scalable Graph Embedding

    Authors: Jiongqian Liang, Saket Gurukar, Srinivasan Parthasarathy

    Abstract: Recently there has been a surge of interest in designing graph embedding methods. Few, if any, can scale to a large-sized graph with millions of nodes due to both computational complexity and memory requirements. In this paper, we relax this limitation by introducing the MultI-Level Embedding (MILE) framework -- a generic methodology allowing contemporary graph embedding methods to scale to large… ▽ More

    Submitted 13 August, 2020; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: Accepted in ICWSM 2021