Skip to main content

Showing 1–21 of 21 results for author: Shaikh, O

  1. arXiv:2406.00888  [pdf, other

    cs.CL cs.HC

    Show, Don't Tell: Aligning Language Models with Demonstrated Feedback

    Authors: Omar Shaikh, Michelle Lam, Joey Hejna, Yijia Shao, Michael Bernstein, Diyi Yang

    Abstract: Language models are aligned to emulate the collective voice of many, resulting in outputs that align with no one in particular. Steering LLMs away from generic output is possible through supervised finetuning or RLHF, but requires prohibitively large datasets for new ad-hoc tasks. We argue that it is instead possible to align an LLM to a specific setting by leveraging a very small number ($<10$) o… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  2. arXiv:2404.04204  [pdf, other

    cs.CL cs.HC

    Social Skill Training with Large Language Models

    Authors: Diyi Yang, Caleb Ziems, William Held, Omar Shaikh, Michael S. Bernstein, John Mitchell

    Abstract: People rely on social skills like conflict resolution to communicate effectively and to thrive in both work and personal life. However, practice environments for social skills are typically out of reach for most people. How can we make social skill training more available, accessible, and inviting? Drawing upon interdisciplinary research from communication and psychology, this perspective paper id… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  3. arXiv:2311.09144  [pdf, other

    cs.CL cs.HC

    Grounding Gaps in Language Model Generations

    Authors: Omar Shaikh, Kristina Gligorić, Ashna Khetan, Matthias Gerstgrasser, Diyi Yang, Dan Jurafsky

    Abstract: Effective conversation requires common ground: a shared understanding between the participants. Common ground, however, does not emerge spontaneously in conversation. Speakers and listeners work together to both identify and construct a shared basis while avoiding misunderstanding. To accomplish grounding, humans rely on a range of dialogue acts, like clarification (What do you mean?) and acknowle… ▽ More

    Submitted 2 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024; 18 pages, 2 figures

  4. arXiv:2309.12309  [pdf, other

    cs.HC cs.AI cs.CL

    Rehearsal: Simulating Conflict to Teach Conflict Resolution

    Authors: Omar Shaikh, Valentino Chai, Michele J. Gelfand, Diyi Yang, Michael S. Bernstein

    Abstract: Interpersonal conflict is an uncomfortable but unavoidable fact of life. Navigating conflict successfully is a skill -- one that can be learned through deliberate practice -- but few have access to effective training or feedback. To expand this access, we introduce Rehearsal, a system that allows users to rehearse conflicts with a believable simulated interlocutor, explore counterfactual "what if?… ▽ More

    Submitted 29 February, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: CHI 2024

  5. arXiv:2306.02475  [pdf, other

    cs.CL

    Modeling Cross-Cultural Pragmatic Inference with Codenames Duet

    Authors: Omar Shaikh, Caleb Ziems, William Held, Aryan J. Pariani, Fred Morstatter, Diyi Yang

    Abstract: Pragmatic reference enables efficient interpersonal communication. Prior work uses simple reference games to test models of pragmatic reasoning, often with unidentified speakers and listeners. In practice, however, speakers' sociocultural background shapes their pragmatic assumptions. For example, readers of this paper assume NLP refers to "Natural Language Processing," and not "Neuro-linguistic P… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: ACL 2023 Findings

  6. arXiv:2305.03514  [pdf, other

    cs.CL cs.LG

    Can Large Language Models Transform Computational Social Science?

    Authors: Caleb Ziems, William Held, Omar Shaikh, Jiaao Chen, Zhehao Zhang, Diyi Yang

    Abstract: Large Language Models (LLMs) are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and political ideology, then LLMs could augment the Computational Social Science (CSS) pipeline in important ways. This work provides a road map for using LLMs as CSS tools… ▽ More

    Submitted 26 February, 2024; v1 submitted 12 April, 2023; originally announced May 2023.

    Comments: To appear in "Computational Linguistics" (CL)

  7. arXiv:2212.08061  [pdf, other

    cs.CL

    On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning

    Authors: Omar Shaikh, Hongxin Zhang, William Held, Michael Bernstein, Diyi Yang

    Abstract: Generating a Chain of Thought (CoT) has been shown to consistently improve large language model (LLM) performance on a wide range of NLP tasks. However, prior work has mainly focused on logical reasoning tasks (e.g. arithmetic, commonsense QA); it remains unclear whether improvements hold for more diverse types of reasoning, especially in socially situated contexts. Concretely, we perform a contro… ▽ More

    Submitted 4 June, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: ACL 2023 Main Conference

  8. arXiv:2210.00160  [pdf, other

    cs.SI cs.CR cs.CY cs.HC

    Explaining Website Reliability by Visualizing Hyperlink Connectivity

    Authors: Seongmin Lee, Sadia Afroz, Haekyu Park, Zijie J. Wang, Omar Shaikh, Vibhor Sehgal, Ankit Peshin, Duen Horng Chau

    Abstract: As the information on the Internet continues growing exponentially, understanding and assessing the reliability of a website is becoming increasingly important. Misinformation has far-ranging repercussions, from sowing mistrust in media to undermining democratic elections. While some research investigates how to alert people to misinformation on the web, much less research has been conducted on ex… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted at IEEE VIS 2022, 5 pages, 4 figures, For a live demo, visit https://poloclub.github.io/MisVis

  9. arXiv:2204.13895  [pdf, other

    cs.HC

    Six Feet Apart: Online Payments During the COVID-19 Pandemic

    Authors: Omar Shaikh, Cassandra Ung, Diyi Yang, Felipe Chacon

    Abstract: Since the COVID-19 pandemic, businesses have faced unprecedented challenges when trying to remain open. Because COVID-19 spreads through aerosolized droplets, businesses were forced to distance their services; in some cases, distancing may have involved moving business services online. In this work, we explore digitization strategies used by small businesses that remained open during the pandemic,… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 33 pages, 8 figures, CSCW 2022

  10. arXiv:2203.16475  [pdf, other

    cs.LG cs.CV

    Concept Evolution in Deep Learning Training: A Unified Interpretation Framework and Discoveries

    Authors: Haekyu Park, Seongmin Lee, Benjamin Hoover, Austin P. Wright, Omar Shaikh, Rahul Duggal, Nilaksh Das, Kevin Li, Judy Hoffman, Duen Horng Chau

    Abstract: We present ConceptEvo, a unified interpretation framework for deep neural networks (DNNs) that reveals the inception and evolution of learned concepts during training. Our work addresses a critical gap in DNN interpretation research, as existing methods primarily focus on post-training interpretation. ConceptEvo introduces two novel technical contributions: (1) an algorithm that generates a unifie… ▽ More

    Submitted 22 August, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted at CIKM'23

  11. arXiv:2108.12931  [pdf, other

    cs.CV

    NeuroCartography: Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks

    Authors: Haekyu Park, Nilaksh Das, Rahul Duggal, Austin P. Wright, Omar Shaikh, Fred Hohman, Duen Horng Chau

    Abstract: Existing research on making sense of deep neural networks often focuses on neuron-level interpretation, which may not adequately capture the bigger picture of how concepts are collectively encoded by multiple neurons. We present NeuroCartography, an interactive system that scalably summarizes and visualizes concepts learned by neural networks. It automatically discovers and groups neurons that det… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: Accepted to IEEE VIS'21

  12. arXiv:2103.16435  [pdf, other

    cs.LG cs.AI cs.HC

    EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML Models

    Authors: Omar Shaikh, Jon Saad-Falcon, Austin P Wright, Nilaksh Das, Scott Freitas, Omar Isaac Asensio, Duen Horng Chau

    Abstract: The advent of larger machine learning (ML) models have improved state-of-the-art (SOTA) performance in various modeling tasks, ranging from computer vision to natural language. As ML models continue increasing in size, so does their respective energy consumption and computational requirements. However, the methods for tracking, reporting, and comparing energy consumption remain limited. We present… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: 7 pages, 5 figures; CHI 2021 Extended Abstracts

  13. arXiv:2102.04427  [pdf, other

    cs.HC cs.CL cs.LG cs.SI

    RECAST: Enabling User Recourse and Interpretability of Toxicity Detection Models with Interactive Visualization

    Authors: Austin P Wright, Omar Shaikh, Haekyu Park, Will Epperson, Muhammed Ahmed, Stephane Pinel, Duen Horng Chau, Diyi Yang

    Abstract: With the widespread use of toxic language online, platforms are increasingly using automated systems that leverage advances in natural language processing to automatically flag and remove toxic comments. However, most automated systems -- when detecting and moderating toxic language -- do not provide feedback to their users, let alone provide an avenue of recourse for these users to make actionabl… ▽ More

    Submitted 10 February, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: 26 pages, 5 figures, CSCW '21

  14. arXiv:2010.04625  [pdf, other

    cs.CL cs.AI cs.LG

    Examining the Ordering of Rhetorical Strategies in Persuasive Requests

    Authors: Omar Shaikh, Jiaao Chen, Jon Saad-Falcon, Duen Horng Chau, Diyi Yang

    Abstract: Interpreting how persuasive language influences audiences has implications across many domains like advertising, argumentation, and propaganda. Persuasion relies on more than a message's content. Arranging the order of the message itself (i.e., ordering specific rhetorical strategies) also plays an important role. To examine how strategy orderings contribute to persuasiveness, we first utilize a V… ▽ More

    Submitted 11 October, 2020; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  15. arXiv:2009.00091  [pdf

    cs.DL cs.CL cs.HC

    Mapping Researchers with PeopleMap

    Authors: Jon Saad-Falcon, Omar Shaikh, Zijie J. Wang, Austin P. Wright, Sasha Richardson, Duen Horng Chau

    Abstract: Discovering research expertise at universities can be a difficult task. Directories routinely become outdated, and few help in visually summarizing researchers' work or supporting the exploration of shared interests among researchers. This results in lost opportunities for both internal and external entities to discover new connections, nurture research collaboration, and explore the diversity of… ▽ More

    Submitted 31 August, 2020; originally announced September 2020.

    Comments: 2020 IEEE Visualization

  16. Argo Lite: Open-Source Interactive Graph Exploration and Visualization in Browsers

    Authors: Siwei Li, Zhiyan Zhou, Anish Upadhayay, Omar Shaikh, Scott Freitas, Haekyu Park, Zijie J. Wang, Susanta Routray, Matthew Hull, Duen Horng Chau

    Abstract: Graph data have become increasingly common. Visualizing them helps people better understand relations among entities. Unfortunately, existing graph visualization tools are primarily designed for single-person desktop use, offering limited support for interactive web-based exploration and online collaborative analysis. To address these issues, we have developed Argo Lite, a new in-browser interacti… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: CIKM'20 Resource Track (October 19-23, 2020), 6 pages, 6 figures

  17. arXiv:2006.06105  [pdf, other

    cs.DL cs.CL cs.HC

    PeopleMap: Visualization Tool for Mapping Out Researchers using Natural Language Processing

    Authors: Jon Saad-Falcon, Omar Shaikh, Zijie J. Wang, Austin P. Wright, Sasha Richardson, Duen Horng Chau

    Abstract: Discovering research expertise at institutions can be a difficult task. Manually curated university directories easily become out of date and they often lack the information necessary for understanding a researcher's interests and past work, making it harder to explore the diversity of research at an institution and identify research talents. This results in lost opportunities for both internal an… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: 7 pages, 3 figures, submission to the 29th ACM International Conference on Information and Knowledge Management (CIKM '20), October 19-23, 2020, Galway, Ireland

  18. arXiv:2004.15004  [pdf, other

    cs.HC cs.AI cs.CV cs.LG

    CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization

    Authors: Zijie J. Wang, Robert Turko, Omar Shaikh, Haekyu Park, Nilaksh Das, Fred Hohman, Minsuk Kahng, Duen Horng Chau

    Abstract: Deep learning's great success motivates many practitioners and students to learn about this exciting technology. However, it is often challenging for beginners to take their first step due to the complexity of understanding and applying deep learning. We present CNN Explainer, an interactive visualization tool designed for non-experts to learn and examine convolutional neural networks (CNNs), a fo… ▽ More

    Submitted 28 August, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: 11 pages, 14 figures, to be presented at IEEE VIS 2020. For a demo video, see https://youtu.be/HnWIHWFbuUQ . For a live demo, visit https://poloclub.github.io/cnn-explainer/

  19. arXiv:2001.10156  [pdf

    physics.geo-ph cs.LG stat.AP

    Real-Time Well Log Prediction From Drilling Data Using Deep Learning

    Authors: Rayan Kanfar, Obai Shaikh, Mehrdad Yousefzadeh, Tapan Mukerji

    Abstract: The objective is to study the feasibility of predicting subsurface rock properties in wells from real-time drilling data. Geophysical logs, namely, density, porosity and sonic logs are of paramount importance for subsurface resource estimation and exploitation. These wireline petro-physical measurements are selectively deployed as they are expensive to acquire; meanwhile, drilling information is r… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  20. arXiv:2001.02004  [pdf, other

    cs.HC cs.AI cs.LG

    CNN 101: Interactive Visual Learning for Convolutional Neural Networks

    Authors: Zijie J. Wang, Robert Turko, Omar Shaikh, Haekyu Park, Nilaksh Das, Fred Hohman, Minsuk Kahng, Duen Horng Chau

    Abstract: The success of deep learning solving previously-thought hard problems has inspired many non-experts to learn and understand this exciting technology. However, it is often challenging for learners to take the first steps due to the complexity of deep learning models. We present our ongoing work, CNN 101, an interactive visualization system for explaining and teaching convolutional neural networks.… ▽ More

    Submitted 27 February, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: CHI'20 Late-Breaking Work (April 25-30, 2020), 7 pages, 3 figures

  21. arXiv:2001.01819  [pdf, other

    cs.CL cs.CY cs.LG

    RECAST: Interactive Auditing of Automatic Toxicity Detection Models

    Authors: Austin P. Wright, Omar Shaikh, Haekyu Park, Will Epperson, Muhammed Ahmed, Stephane Pinel, Diyi Yang, Duen Horng Chau

    Abstract: As toxic language becomes nearly pervasive online, there has been increasing interest in leveraging the advancements in natural language processing (NLP), from very large transformer models to automatically detecting and removing toxic comments. Despite the fairness concerns, lack of adversarial robustness, and limited prediction explainability for deep learning systems, there is currently little… ▽ More

    Submitted 1 July, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

    Comments: 8 Pages, 3 figures, The eighth International Workshop of Chinese CHI Proceedings

    ACM Class: I.2; I.7; J.4; K.4