Skip to main content

Showing 1–25 of 25 results for author: Murai, F

  1. arXiv:2409.07424  [pdf, other

    cs.CL cs.CY cs.LG

    Towards Fairer Health Recommendations: finding informative unbiased samples via Word Sense Disambiguation

    Authors: Gavin Butts, Pegah Emdad, Jethro Lee, Shannon Song, Chiman Salavati, Willmar Sosa Diaz, Shiri Dori-Hacohen, Fabricio Murai

    Abstract: There have been growing concerns around high-stake applications that rely on models trained with biased data, which consequently produce biased predictions, often harming the most vulnerable. In particular, biased medical data could cause health-related applications and recommender systems to create outputs that jeopardize patient care and widen disparities in health outcomes. A recent framework t… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: Accepted for long presentation at the FAcctRec @ Recsys 2024

    ACM Class: I.2.7; J.3; K.4

  2. arXiv:2407.17459  [pdf, other

    cs.LG cs.CY

    Hidden or Inferred: Fair Learning-To-Rank with Unknown Demographics

    Authors: Oluseun Olulana, Kathleen Cachel, Fabricio Murai, Elke Rundensteiner

    Abstract: As learning-to-rank models are increasingly deployed for decision-making in areas with profound life implications, the FairML community has been developing fair learning-to-rank (LTR) models. These models rely on the availability of sensitive demographic features such as race or sex. However, in practice, regulatory obstacles and privacy concerns protect this data from collection and use. As a res… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by AAAI/AIES to the AIES 2024 conference

  3. arXiv:2407.12680  [pdf, other

    cs.CY cs.CL

    Reducing Biases towards Minoritized Populations in Medical Curricular Content via Artificial Intelligence for Fairer Health Outcomes

    Authors: Chiman Salavati, Shannon Song, Willmar Sosa Diaz, Scott A. Hale, Roberto E. Montenegro, Fabricio Murai, Shiri Dori-Hacohen

    Abstract: Biased information (recently termed bisinformation) continues to be taught in medical curricula, often long after having been debunked. In this paper, we introduce BRICC, a firstin-class initiative that seeks to mitigate medical bisinformation using machine learning to systematically identify and flag text with potential biases, for subsequent review in an expert-in-the-loop fashion, thus greatly… ▽ More

    Submitted 21 May, 2024; originally announced July 2024.

    Comments: Under review

  4. arXiv:2401.04264  [pdf, ps, other

    cs.GT cs.MA math.CO math.OC

    General Performance Evaluation for Competitive Resource Allocation Games via Unseen Payoff Estimation

    Authors: N'yoma Diamond, Fabricio Murai

    Abstract: Many high-stakes decision-making problems, such as those found within cybersecurity and economics, can be modeled as competitive resource allocation games. In these games, multiple players must allocate limited resources to overcome their opponent(s), while minimizing any induced individual losses. However, existing means of assessing the performance of resource allocation algorithms are highly di… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  5. arXiv:2312.15040  [pdf, other

    cs.SI cs.AI

    Towards Detecting Cascades of Biased Medical Claims on Twitter

    Authors: Libby Tiderman, Juan Sanchez Mercedes, Fiona Romanoschi, Fabricio Murai

    Abstract: Social media may disseminate medical claims that highlight misleading correlations between social identifiers and diseases due to not accounting for structural determinants of health. Our research aims to identify biased medical claims on Twitter and measure their spread. We propose a machine learning framework that uses two models in tandem: RoBERTa to detect medical claims and DistilBERT to clas… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted at 2023 MIT Undergraduate Research Technology Conference (URTC'23)

    ACM Class: I.2.1; I.2.7

  6. arXiv:2308.14782  [pdf, other

    cs.CY

    Helping Fact-Checkers Identify Fake News Stories Shared through Images on WhatsApp

    Authors: Julio C. S. Reis, Philipe Melo, Fabiano Belém, Fabricio Murai, Jussara M. Almeida, Fabricio Benevenuto

    Abstract: WhatsApp has introduced a novel avenue for smartphone users to engage with and disseminate news stories. The convenience of forming interest-based groups and seamlessly sharing content has rendered WhatsApp susceptible to the exploitation of misinformation campaigns. While the process of fact-checking remains a potent tool in identifying fabricated news, its efficacy falters in the face of the unp… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: This is a preprint version of an accepted manuscript on the Brazilian Symposium on Multimedia and the Web (WebMedia). Please, consider to cite it instead of this one

  7. arXiv:2205.10293  [pdf, other

    cs.LG cs.SI

    DELATOR: Money Laundering Detection via Multi-Task Learning on Large Transaction Graphs

    Authors: Henrique S. Assumpção, Fabrício Souza, Leandro Lacerda Campos, Vinícius T. de Castro Pires, Paulo M. Laurentys de Almeida, Fabricio Murai

    Abstract: Money laundering has become one of the most relevant criminal activities in modern societies, as it causes massive financial losses for governments, banks and other institutions. Detecting such activities is among the top priorities when it comes to financial analysis, but current approaches are often costly and labor intensive partly due to the sheer amount of data to be analyzed. Hence, there is… ▽ More

    Submitted 24 October, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in the 2022 IEEE International Conference on Big Data (IEEE BigData) as a short paper

  8. arXiv:2112.03398  [pdf, other

    cs.LG cs.CV

    Top-Down Deep Clustering with Multi-generator GANs

    Authors: Daniel de Mello, Renato Assunção, Fabricio Murai

    Abstract: Deep clustering (DC) leverages the representation power of deep architectures to learn embedding spaces that are optimal for cluster analysis. This approach filters out low-level information irrelevant for clustering and has proven remarkably successful for high dimensional data spaces. Some DC methods employ Generative Adversarial Networks (GANs), motivated by the powerful latent representations… ▽ More

    Submitted 24 December, 2021; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to AAAI 2022

    ACM Class: I.5.3; I.4.10; I.2.6

  9. On the Dynamics of Political Discussions on Instagram: A Network Perspective

    Authors: Carlos H. G. Ferreira, Fabricio Murai, Ana P. C. Silva, Jussara M. Almeida, Martino Trevisan, Luca Vassio, Marco Mellia, Idilio Drago

    Abstract: Instagram has been increasingly used as a source of information especially among the youth. As a result, political figures now leverage the platform to spread opinions and political agenda. We here analyze online discussions on Instagram, notably in political topics, from a network perspective. Specifically, we investigate the emergence of communities of co-commenters, that is, groups of users who… ▽ More

    Submitted 13 September, 2022; v1 submitted 19 September, 2021; originally announced September 2021.

    Journal ref: Online Social Networks and Media, Volume 25, 2021, ISSN 2468-6964

  10. arXiv:2109.08446  [pdf, ps, other

    cs.NI cs.PF

    Heterogeneous download times in bandwidth-homogeneous BitTorrent swarms

    Authors: Fabricio Murai, Antonio A. de A. Rocha, Daniel R. Figueiredo, Edmundo A. de Souza e Silva

    Abstract: Modeling and understanding BitTorrent (BT) dynamics is a recurrent research topic mainly due to its high complexity and tremendous practical efficiency. Over the years, different models have uncovered various phenomena exhibited by the system, many of which have direct impact on its performance. In this paper we identify and characterize a phenomenon that has not been previously observed: homogene… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Published in Computer Networks. arXiv admin note: substantial text overlap with arXiv:1102.3610

    ACM Class: C.4; I.6

  11. arXiv:2109.02202  [pdf, ps, other

    cs.AI cs.CY cs.IR cs.LG

    Fairness via AI: Bias Reduction in Medical Information

    Authors: Shiri Dori-Hacohen, Roberto Montenegro, Fabricio Murai, Scott A. Hale, Keen Sung, Michela Blain, Jennifer Edwards-Johnson

    Abstract: Most Fairness in AI research focuses on exposing biases in AI systems. A broader lens on fairness reveals that AI can serve a greater aspiration: rooting out societal inequities from their source. Specifically, we focus on inequities in health information, and aim to reduce bias in that domain using AI. The AI algorithms under the hood of search engines and social media, many of which are based on… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

    Comments: To appear in: The 4th FAccTRec Workshop on Responsible Recommendation at RecSys 2021

  12. arXiv:2108.12214  [pdf, other

    cs.DC cs.PF

    Machine Learning for Performance Prediction of Spark Cloud Applications

    Authors: Alexandre Maros, Fabricio Murai, Ana Paula Couto da Silva, Jussara M. Almeida, Marco Lattuada, Eugenio Gianniti, Marjan Hosseini, Danilo Ardagna

    Abstract: Big data applications and analytics are employed in many sectors for a variety of goals: improving customers satisfaction, predicting market behavior or improving processes in public health. These applications consist of complex software stacks that are often run on cloud systems. Predicting execution times is important for estimating the cost of cloud services and for effectively managing the und… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Published in 2019 IEEE 12th International Conference on Cloud Computing (CLOUD)

    ACM Class: B.8.2; I.2

  13. How effective are Graph Neural Networks in Fraud Detection for Network Data?

    Authors: Ronald D. R. Pereira, Fabrício Murai

    Abstract: Graph-based Neural Networks (GNNs) are recent models created for learning representations of nodes (and graphs), which have achieved promising results when detecting patterns that occur in large-scale data relating different entities. Among these patterns, financial fraud stands out for its socioeconomic relevance and for presenting particular challenges, such as the extreme imbalance between the… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: 12 pages, in Portuguese

    Report number: brasnam.2021.16141

    Journal ref: X Brazilian Workshop on Social Network Analysis and Mining (2021)

  14. Evaluating the state-of-the-art in mapping research spaces: a Brazilian case study

    Authors: Francisco Galuppo Azevedo, Fabricio Murai

    Abstract: Scientific knowledge cannot be seen as a set of isolated fields, but as a highly connected network. Understanding how research areas are connected is of paramount importance for adequately allocating funding and human resources (e.g., assembling teams to tackle multidisciplinary problems). The relationship between disciplines can be drawn from data on the trajectory of individual scientists, as re… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: 28 pages, 11 figures

    MSC Class: 68U35 ACM Class: H.4; I.2.1; I.6

    Journal ref: PLoS ONE 16(3): e0248724 (2021)

  15. arXiv:2103.12095  [pdf, other

    cs.LG cs.AI

    Am I fit for this physical activity? Neural embedding of physical conditioning from inertial sensors

    Authors: Davi Pedrosa de Aguiar, Fabricio Murai

    Abstract: Inertial Measurement Unit (IMU) sensors are present in everyday devices such as smartphones and fitness watches. As a result, the array of health-related research and applications that tap onto this data has been growing, but little attention has been devoted to the prediction of an individual's heart rate (HR) from IMU data, when undergoing a physical activity. Would that be even possible? If so,… ▽ More

    Submitted 19 August, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: To be published in 10th Brazilian Conference on Intelligent Systems, BRACIS 2021

    MSC Class: 68T07 ACM Class: I.2.1; I.5.1; J.3

  16. Characterizing (Un)moderated Textual Data in Social Systems

    Authors: Lucas Henrique Costa de Lima, Julio Reis, Philipe Melo, Fabricio Murai, Fabricio Benevenuto

    Abstract: Despite the valuable social interactions that online media promote, these systems provide space for speech that would be potentially detrimental to different groups of people. The moderation of content imposed by many social media has motivated the emergence of a new social system for free speech named Gab, which lacks moderation of content. This article characterizes and compares moderated textua… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: Accepted to IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM, 2020)

  17. arXiv:2005.07473  [pdf, other

    cs.LG cs.CL cs.SI stat.ML

    Predicting User Emotional Tone in Mental Disorder Online Communities

    Authors: Bárbara Silveira, Henrique S. Silva, Fabricio Murai, Ana Paula Couto da Silva

    Abstract: In recent years, Online Social Networks have become an important medium for people who suffer from mental disorders to share moments of hardship, and receive emotional and informational support. In this work, we analyze how discussions in Reddit communities related to mental disorders can help improve the health conditions of their users. Using the emotional tone of users' writing as a proxy for e… ▽ More

    Submitted 27 July, 2021; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: 8 pages, 3 figures, 3 tables

    ACM Class: J.3; I.2.7

    Journal ref: Future Generation Computer Systems, Volume 125, 2021, Pages 641-651, ISSN 0167-739X

  18. Towards Understanding Political Interactions on Instagram

    Authors: Martino Trevisan, Luca Vassio, Idilio Drago, Marco Mellia, Fabricio Murai, Flavio Figueiredo, Ana Paula Couto da Silva, Jussara M. Almeida

    Abstract: Online Social Networks (OSNs) allow personalities and companies to communicate directly with the public, bypassing filters of traditional medias. As people rely on OSNs to stay up-to-date, the political debate has moved online too. We witness the sudden explosion of harsh political debates and the dissemination of rumours in OSNs. Identifying such behaviour requires a deep understanding on how peo… ▽ More

    Submitted 4 May, 2021; v1 submitted 26 April, 2019; originally announced April 2019.

    Comments: 5 pages, 8 figures, Proceedings of the 30th ACM Conference on Hypertext and Social Media, https://dl.acm.org/doi/10.1145/3342220.3343657

    Journal ref: HT19: Proceedings of the 30th ACM Conference on Hypertext and Social Media. September 2019. Pages 247-251. Association for Computing Machinery

  19. arXiv:1807.03688  [pdf, other

    cs.SI

    Inside the Right-Leaning Echo Chambers: Characterizing Gab, an Unmoderated Social System

    Authors: Lucas Lima, Julio C. S. Reis, Philipe Melo, Fabricio Murai, Leandro Araújo, Pantelis Vikatos, Fabrício Benevenuto

    Abstract: The moderation of content in many social media systems, such as Twitter and Facebook, motivated the emergence of a new social network system that promotes free speech, named Gab. Soon after that, Gab has been removed from Google Play Store for violating the company's hate speech policy and it has been rejected by Apple for similar reasons. In this paper we characterize Gab, aiming at understanding… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

    Comments: This is a preprint of a paper that will appear on ASONAM'18

  20. Modelos de Resposta para Experimentos Randomizados em Redes Sociais de Larga Escala

    Authors: Francisco Galuppo Azevedo, Bruno Demattos Nogueira, Fabricio Murai, Ana Paula Couto da Silva

    Abstract: A/B tests are randomized experiments frequently used by companies that offer services on the Web for assessing the impact of new features. During an experiment, each user is randomly redirected to one of two versions of the website, called treatments. Several response models were proposed to describe the behavior of a user in a social network website, where the treatment assigned to her neighbors… ▽ More

    Submitted 9 March, 2018; originally announced March 2018.

    Comments: 15 pages, in Portuguese, 2 figures, submitted to SBC WPerformance 2018

  21. arXiv:1703.08252  [pdf, other

    cs.SI physics.soc-ph

    Characterizing Directed and Undirected Networks via Multidimensional Walks with Jumps

    Authors: Fabricio Murai, Bruno Ribeiro, Don Towsley, Pinghui Wang

    Abstract: Estimating distributions of node characteristics (labels) such as number of connections or citizenship of users in a social network via edge and node sampling is a vital part of the study of complex networks. Due to its low cost, sampling via a random walk (RW) has been proposed as an attractive solution to this task. Most RW methods assume either that the network is undirected or that walkers can… ▽ More

    Submitted 13 July, 2018; v1 submitted 23 March, 2017; originally announced March 2017.

    Comments: 35 pages, submitted to ACM Transactions on Knowledge Discovery from Data (TKDD)

    ACM Class: G.3

  22. arXiv:1703.05082  [pdf, other

    cs.SI cs.LG stat.ML

    Selective Harvesting over Networks

    Authors: Fabricio Murai, Diogo Rennó, Bruno Ribeiro, Gisele L. Pappa, Don Towsley, Krista Gile

    Abstract: Active search (AS) on graphs focuses on collecting certain labeled nodes (targets) given global knowledge of the network topology and its edge weights under a query budget. However, in most networks, nodes, topology and edge weights are all initially unknown. We introduce selective harvesting, a variant of AS where the next node to be queried must be chosen among the neighbors of the current queri… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

    Comments: 28 pages, 9 figures

    ACM Class: I.2.6; E.1

  23. arXiv:1302.5847  [pdf, other

    stat.AP

    Characterizing Branching Processes from Sampled Data

    Authors: Fabricio Murai, Bruno Ribeiro, Don Towsley, Krista Gile

    Abstract: Branching processes model the evolution of populations of agents that randomly generate offsprings. These processes, more patently Galton-Watson processes, are widely used to model biological, social, cognitive, and technological phenomena, such as the diffusion of ideas, knowledge, chain letters, viruses, and the evolution of humans through their Y-chromosome DNA or mitochondrial RNA. A practical… ▽ More

    Submitted 23 February, 2013; originally announced February 2013.

  24. arXiv:1209.0736  [pdf, other

    math.ST cs.IT cs.SI

    On Set Size Distribution Estimation and the Characterization of Large Networks via Sampling

    Authors: Fabricio Murai, Bruno Ribeiro, Don Towsley, Pinghui Wang

    Abstract: In this work we study the set size distribution estimation problem, where elements are randomly sampled from a collection of non-overlapping sets and we seek to recover the original set size distribution from the samples. This problem has applications to capacity planning, network theory, among other areas. Examples of real-world applications include characterizing in-degree distributions in large… ▽ More

    Submitted 2 December, 2012; v1 submitted 4 September, 2012; originally announced September 2012.

    Report number: Technical Report UM-CS-2012-023v2

  25. arXiv:1102.3610  [pdf, ps, other

    cs.NI

    Heterogeneous download times in a homogeneous BitTorrent swarm

    Authors: Fabricio Murai, Antonio A de A Rocha, Daniel R. Figueiredo, Edmundo de Souza e Silva

    Abstract: Modeling and understanding BitTorrent (BT) dynamics is a recurrent research topic mainly due to its high complexity and tremendous practical efficiency. Over the years, different models have uncovered various phenomena exhibited by the system, many of which have direct impact on its performance. In this paper we identify and characterize a phenomenon that has not been previously observed: homogene… ▽ More

    Submitted 18 February, 2011; v1 submitted 17 February, 2011; originally announced February 2011.

    ACM Class: C.2.2; I.6.3; I.6.4