Skip to main content

Showing 1–50 of 75 results for author: Cheung, A

  1. arXiv:2409.10446  [pdf, other

    cs.HC

    KoroT-3E: A Personalized Musical Mnemonics Tool for Enhancing Memory Retention of Complex Computer Science Concepts

    Authors: Xiangzhe Yuan, Jiajun Wang, Siying Hu, Andrew Cheung, Zhicong Lu

    Abstract: As the demand for computer science (CS) skills grows, mastering foundational concepts is crucial yet challenging for novice learners. To address this challenge, we present KoroT-3E, an AI-based system that creates personalized musical mnemonics to enhance both memory retention and understanding of concepts in CS. KoroT-3E enables users to transform complex concepts into memorable lyrics and compos… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 26 pages , 2 tables,7 figures, undereview

  2. arXiv:2409.01770  [pdf, ps, other

    math.OC

    Randomized Submanifold Subgradient Method for Optimization over Stiefel Manifolds

    Authors: Andy Yat-Ming Cheung, Jinxin Wang, Man-Chung Yue, Anthony Man-Cho So

    Abstract: Optimization over Stiefel manifolds has found wide applications in many scientific and engineering domains. Despite considerable research effort, high-dimensional optimization problems over Stiefel manifolds remain challenging, and the situation is exacerbated by nonsmooth objective functions. The purpose of this paper is to propose and study a novel coordinate-type algorithm for weakly convex (po… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  3. arXiv:2408.03408  [pdf, other

    cs.AR cs.LG cs.PL

    LLM-Aided Compilation for Tensor Accelerators

    Authors: Charles Hong, Sahil Bhatia, Altan Haan, Shengjun Kris Dong, Dima Nikiforov, Alvin Cheung, Yakun Sophia Shao

    Abstract: Hardware accelerators, in particular accelerators for tensor processing, have many potential application domains. However, they currently lack the software infrastructure to support the majority of domains outside of deep learning. Furthermore, a compiler that can easily be updated to reflect changes at both application and hardware levels would enable more agile development and design space explo… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 4 page workshop paper

  4. arXiv:2407.06291  [pdf, other

    cs.SD eess.AS

    Transfer Learning with Pseudo Multi-Label Birdcall Classification for DS@GT BirdCLEF 2024

    Authors: Anthony Miyaguchi, Adrian Cheung, Murilo Gustineli, Ashley Kim

    Abstract: We present working notes for the DS@GT team on transfer learning with pseudo multi-label birdcall classification for the BirdCLEF 2024 competition, focused on identifying Indian bird species in recorded soundscapes. Our approach utilizes production-grade models such as the Google Bird Vocalization Classifier, BirdNET, and EnCodec to address representation and labeling challenges in the competition… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Submitted and accepted into CLEF 2024 CEUR-WS proceedings

  5. arXiv:2406.14733  [pdf, other

    cs.PL cs.DC

    Suki: Choreographed Distributed Dataflow in Rust

    Authors: Shadaj Laddad, Alvin Cheung, Joseph M. Hellerstein

    Abstract: Programming models for distributed dataflow have long focused on analytical workloads that allow the runtime to dynamically place and schedule compute logic. Meanwhile, models that enable fine-grained control over placement, such as actors, make global optimization difficult. In this extended abstract, we present Suki, an embedded Rust DSL that lets developers implement streaming dataflow with exp… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2406.14066  [pdf, other

    cs.AI cs.PF

    Optimizing Speculative Decoding for Serving Large Language Models Using Goodput

    Authors: Xiaoxuan Liu, Cade Daniel, Langxiang Hu, Woosuk Kwon, Zhuohan Li, Xiangxi Mo, Alvin Cheung, Zhijie Deng, Ion Stoica, Hao Zhang

    Abstract: Reducing the inference latency of large language models (LLMs) is crucial, and speculative decoding (SD) stands out as one of the most effective techniques. Rather than letting the LLM generate all tokens directly, speculative decoding employs effective proxies to predict potential outputs, which are then verified by the LLM without compromising the generation quality. Yet, deploying SD in real on… ▽ More

    Submitted 25 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2406.03003  [pdf, other

    cs.PL

    Verified Code Transpilation with LLMs

    Authors: Sahil Bhatia, Jie Qiu, Niranjan Hasabnis, Sanjit A. Seshia, Alvin Cheung

    Abstract: Domain-specific languages (DSLs) are integral to various software workflows. Such languages offer domain-specific optimizations and abstractions that improve code readability and maintainability. However, leveraging these languages requires developers to rewrite existing code using the specific DSL's API. While large language models (LLMs) have shown some success in automatic code transpilation, n… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  8. arXiv:2404.18249  [pdf, other

    cs.PL

    Tenspiler: A Verified Lifting-Based Compiler for Tensor Operations

    Authors: Jie Qiu, Colin Cai, Sahil Bhatia, Niranjan Hasabnis, Sanjit A. Seshia, Alvin Cheung

    Abstract: Tensor processing infrastructures such as deep learning frameworks and specialized hardware accelerators have revolutionized how computationally intensive code from domains such as deep learning and image processing is executed and optimized. These infrastructures provide powerful and expressive abstractions while ensuring high performance. However, to utilize them, code must be written specifical… ▽ More

    Submitted 28 July, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  9. arXiv:2404.14527  [pdf, other

    cs.DC cs.LG

    Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity

    Authors: Tyler Griggs, Xiaoxuan Liu, Jiaxiang Yu, Doyoung Kim, Wei-Lin Chiang, Alvin Cheung, Ion Stoica

    Abstract: Large language models (LLMs) are increasingly integrated into many online services, yet they remain cost-prohibitive to deploy due to the requirement of expensive GPU instances. Prior work has addressed the high cost of LLM serving by improving the inference engine, but less attention has been given to selecting the most cost-efficient GPU type(s) for a specific LLM service. There is a large and g… ▽ More

    Submitted 22 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  10. arXiv:2404.00786  [pdf, ps, other

    cs.AR cs.PL

    There and Back Again: A Netlist's Tale with Much Egraphin'

    Authors: Gus Henry Smith, Zachary D. Sisco, Thanawat Techaumnuaiwit, Jingtao Xia, Vishal Canumalla, Andrew Cheung, Zachary Tatlock, Chandrakana Nandi, Jonathan Balkind

    Abstract: EDA toolchains are notoriously unpredictable, incomplete, and error-prone; the generally-accepted remedy has been to re-imagine EDA tasks as compilation problems. However, any compiler framework we apply must be prepared to handle the wide range of EDA tasks, including not only compilation tasks like technology mapping and optimization (the "there"} in our title), but also decompilation tasks like… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  11. arXiv:2403.04814  [pdf, other

    cs.CL cs.AI cs.LG cs.SE

    Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks

    Authors: Linyuan Gong, Sida Wang, Mostafa Elhoushi, Alvin Cheung

    Abstract: We introduce Syntax-Aware Fill-In-the-Middle (SAFIM), a new benchmark for evaluating Large Language Models (LLMs) on the code Fill-in-the-Middle (FIM) task. This benchmark focuses on syntax-aware completions of program structures such as code blocks and conditional expressions, and includes 17,720 examples from multiple programming languages, sourced from recent code submissions after April 2022 t… ▽ More

    Submitted 22 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 22 pages; ICML 2024 Oral: https://icml.cc/virtual/2024/oral/35482

  12. General characterisation of Hamiltonians generating velocity-independent forces

    Authors: Fredy Yip, A. C. H. Cheung

    Abstract: Dynamics generated from Hamiltonians enjoy potential pathways to quantisation, but standard Hamiltonians are only capable of generating conservative forces. Classes of Hamiltonians have been proposed in Berry et al. capable of generating non-conservative velocity-independent forces. Such Hamiltonians have been classified in the past, under the strict assumption that they are polynomial in momentum… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 27 pages

    Journal ref: J. Phys. A: Math. Theor. 57 275203 (2024)

  13. FPGA Technology Mapping Using Sketch-Guided Program Synthesis

    Authors: Gus Henry Smith, Ben Kushigian, Vishal Canumalla, Andrew Cheung, Steven Lyubomirsky, Sorawee Porncharoenwase, René Just, Gilbert Louis Bernstein, Zachary Tatlock

    Abstract: FPGA technology mapping is the process of implementing a hardware design expressed in high-level HDL (hardware design language) code using the low-level, architecture-specific primitives of the target FPGA. As FPGAs become increasingly heterogeneous, achieving high performance requires hardware synthesis tools that better support mapping to complex, highly configurable primitives like digital sign… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  14. arXiv:2401.03003  [pdf, other

    cs.SE cs.CL cs.LG

    AST-T5: Structure-Aware Pretraining for Code Generation and Understanding

    Authors: Linyuan Gong, Mostafa Elhoushi, Alvin Cheung

    Abstract: Large language models (LLMs) have made significant advancements in code-related tasks, yet many LLMs treat code as simple sequences, neglecting its structured nature. We introduce AST-T5, a novel pretraining paradigm that leverages the Abstract Syntax Tree (AST) for enhanced code generation, transpilation, and understanding. Using dynamic programming, our AST-Aware Segmentation retains code struct… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 15 pages; ICML 2024: https://icml.cc/virtual/2024/poster/33601

  15. arXiv:2310.07177  [pdf, other

    cs.AI cs.CL cs.LG

    Online Speculative Decoding

    Authors: Xiaoxuan Liu, Lanxiang Hu, Peter Bailis, Alvin Cheung, Zhijie Deng, Ion Stoica, Hao Zhang

    Abstract: Speculative decoding is a pivotal technique to accelerate the inference of large language models (LLMs) by employing a smaller draft model to predict the target model's outputs. However, its efficacy can be limited due to the low predictive accuracy of the draft model, particularly when faced with diverse text inputs and a significant capability gap between the draft and target models. We introduc… ▽ More

    Submitted 9 June, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  16. arXiv:2308.06410  [pdf, ps, other

    cs.PL cs.AR

    Code Transpilation for Hardware Accelerators

    Authors: Yuto Nishida, Sahil Bhatia, Shadaj Laddad, Hasan Genc, Yakun Sophia Shao, Alvin Cheung

    Abstract: DSLs and hardware accelerators have proven to be very effective in optimizing computationally expensive workloads. In this paper, we propose a solution to the challenge of manually rewriting legacy or unoptimized code in domain-specific languages and hardware accelerators. We introduce an approach that integrates two open-source tools: Metalift, a code translation framework, and Gemmini, a DNN acc… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

  17. Spatialyze: A Geospatial Video Analytics System with Spatial-Aware Optimizations

    Authors: Chanwut Kittivorawong, Yongming Ge, Yousef Helal, Alvin Cheung

    Abstract: Videos that are shot using commodity hardware such as phones and surveillance cameras record various metadata such as time and location. We encounter such geospatial videos on a daily basis and such videos have been growing in volume significantly. Yet, we do not have data management systems that allow users to interact with such data effectively. In this paper, we describe Spatialyze, a new fra… ▽ More

    Submitted 14 July, 2024; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: Project Page: https://spatialyze.github.io

    Journal ref: Proc. VLDB Endow. 17 (2024) 2136-2148

  18. arXiv:2306.10585  [pdf, other

    cs.PL cs.DC

    Optimizing Stateful Dataflow with Local Rewrites

    Authors: Shadaj Laddad, Conor Power, Tyler Hou, Alvin Cheung, Joseph M. Hellerstein

    Abstract: Optimizing a stateful dataflow language is a challenging task. There are strict correctness constraints for preserving properties expected by downstream consumers, a large space of possible optimizations, and complex analyses that must reason about the behavior of the program over time. Classic compiler techniques with specialized optimization passes yield unpredictable performance and have comple… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: EGRAPHS 2023

  19. arXiv:2305.18513  [pdf, ps, other

    cs.CL

    SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics

    Authors: Arash Ardakani, Altan Haan, Shangyin Tan, Doru Thom Popovici, Alvin Cheung, Costin Iancu, Koushik Sen

    Abstract: Transformer-based models, such as BERT and ViT, have achieved state-of-the-art results across different natural language processing (NLP) and computer vision (CV) tasks. However, these models are extremely memory intensive during their fine-tuning process, making them difficult to deploy on GPUs with limited memory resources. To address this issue, we introduce a new tool called SlimFit that reduc… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  20. Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers

    Authors: Linyuan Gong, Chenyan Xiong, Xiaodong Liu, Payal Bajaj, Yiqing Xie, Alvin Cheung, Jianfeng Gao, Xia Song

    Abstract: This paper explores the effectiveness of model-generated signals in improving zero-shot generalization of text-to-text Transformers such as T5. We study various designs to pretrain T5 using an auxiliary model to construct more challenging token replacements for the main model to denoise. Key aspects under study include the decoding target, the location of the RTD head, and the masking pattern. Bas… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at ACL 2023. 9 pages

  21. arXiv:2305.09580  [pdf, other

    cs.PL cs.AR

    Generate Compilers from Hardware Models!

    Authors: Gus Henry Smith, Ben Kushigian, Vishal Canumalla, Andrew Cheung, René Just, Zachary Tatlock

    Abstract: Compiler backends should be automatically generated from hardware design language (HDL) models of the hardware they target. Generating compiler components directly from HDL can provide stronger correctness guarantees, ease development effort, and encourage hardware exploration. Past work has already championed this idea; here we argue that advances in program synthesis make the approach more feasi… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 3 pages, 2 figures, to be presented at the 2023 PLARCH Workshop at FCRC

  22. arXiv:2303.14633  [pdf, other

    cs.LG cs.PF

    An Evaluation of Memory Optimization Methods for Training Neural Networks

    Authors: Xiaoxuan Liu, Siddharth Jha, Alvin Cheung

    Abstract: As models continue to grow in size, the development of memory optimization methods (MOMs) has emerged as a solution to address the memory bottleneck encountered when training large models. To comprehensively examine the practical value of various MOMs, we have conducted a thorough analysis of existing literature from a systems perspective. Our analysis has revealed a notable challenge within the r… ▽ More

    Submitted 4 June, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  23. arXiv:2303.03593  [pdf, other

    cs.CL cs.LG

    ADELT: Transpilation Between Deep Learning Frameworks

    Authors: Linyuan Gong, Jiayi Wang, Alvin Cheung

    Abstract: We propose the Adversarial DEep Learning Transpiler (ADELT), a novel approach to source-to-source transpilation between deep learning frameworks. ADELT uniquely decouples code skeleton transpilation and API keyword mapping. For code skeleton transpilation, it uses few-shot prompting on large language models (LLMs), while for API keyword mapping, it uses contextual embeddings from a code-specific B… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 19 pages, to be published in the main track of IJCAI 2024

  24. arXiv:2211.07047  [pdf, other

    cs.CL

    Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

    Authors: Grace Yang, Ming Cao, Lavender Y. Jiang, Xujin C. Liu, Alexander T. M. Cheung, Hannah Weiss, David Kurland, Kyunghyun Cho, Eric K. Oermann

    Abstract: Traditional evaluation metrics for classification in natural language processing such as accuracy and area under the curve fail to differentiate between models with different predictive behaviors despite their similar performance metrics. We introduce sensitivity score, a metric that scrutinizes models' behaviors at the vocabulary level to provide insights into disparities in their decision-making… ▽ More

    Submitted 15 November, 2022; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 13 pages

  25. arXiv:2210.12605  [pdf, other

    cs.DB

    Keep CALM and CRDT On

    Authors: Shadaj Laddad, Conor Power, Mae Milano, Alvin Cheung, Natacha Crooks, Joseph M. Hellerstein

    Abstract: Despite decades of research and practical experience, developers have few tools for programming reliable distributed applications without resorting to expensive coordination techniques. Conflict-free replicated datatypes (CRDTs) are a promising line of work that enable coordination-free replication and offer certain eventual consistency guarantees in a relatively simple object-oriented API. Yet CR… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

  26. arXiv:2208.09300  [pdf

    cs.LG cs.AI math.DS math.RT

    Expressing Multivariate Time Series as Graphs with Time Series Attention Transformer

    Authors: William T. Ng, K. Siu, Albert C. Cheung, Michael K. Ng

    Abstract: A reliable and efficient representation of multivariate time series is crucial in various downstream machine learning tasks. In multivariate time series forecasting, each variable depends on its historical values and there are inter-dependencies among variables as well. Models have to be designed to capture both intra- and inter-relationships among the time series. To move towards this goal, we pr… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: IJCAI'22 WORKSHOP AI4TS: AI FOR TIME SERIES ANALYSIS

  27. arXiv:2208.01204  [pdf

    cs.NE cs.AI cs.CV cs.LG

    Making a Spiking Net Work: Robust brain-like unsupervised machine learning

    Authors: Peter G. Stratton, Andrew Wabnitz, Chip Essam, Allen Cheung, Tara J. Hamilton

    Abstract: The surge in interest in Artificial Intelligence (AI) over the past decade has been driven almost exclusively by advances in Artificial Neural Networks (ANNs). While ANNs set state-of-the-art performance for many previously intractable problems, the use of global gradient descent necessitates large datasets and computational resources for training, potentially limiting their scalability for real-w… ▽ More

    Submitted 31 August, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: 12 pages (manuscript), 5 figures, 10 pages (appendix), 11 pages (extended data)

  28. arXiv:2206.14276  [pdf, other

    cs.DC cs.LG cs.MS stat.AP

    NumS: Scalable Array Programming for the Cloud

    Authors: Melih Elibol, Vinamra Benara, Samyu Yagati, Lianmin Zheng, Alvin Cheung, Michael I. Jordan, Ion Stoica

    Abstract: Scientists increasingly rely on Python tools to perform scalable distributed memory array operations using rich, NumPy-like expressions. However, many of these tools rely on dynamic schedulers optimized for abstract task graphs, which often encounter memory and network bandwidth-related bottlenecks due to sub-optimal data and operator placement decisions. Tools built on the message passing interfa… ▽ More

    Submitted 12 July, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

  29. arXiv:2206.11357  [pdf, other

    cs.LG

    GACT: Activation Compressed Training for Generic Network Architectures

    Authors: Xiaoxuan Liu, Lianmin Zheng, Dequan Wang, Yukuo Cen, Weize Chen, Xu Han, Jianfei Chen, Zhiyuan Liu, Jie Tang, Joey Gonzalez, Michael Mahoney, Alvin Cheung

    Abstract: Training large neural network (NN) models requires extensive memory resources, and Activation Compressed Training (ACT) is a promising approach to reduce training memory footprint. This paper presents GACT, an ACT framework to support a broad range of machine learning tasks for generic NN architectures with limited domain knowledge. By analyzing a linearized version of ACT's approximate gradient,… ▽ More

    Submitted 3 September, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  30. arXiv:2205.12425  [pdf, other

    cs.PL cs.DC

    Katara: Synthesizing CRDTs with Verified Lifting

    Authors: Shadaj Laddad, Conor Power, Mae Milano, Alvin Cheung, Joseph M. Hellerstein

    Abstract: Conflict-free replicated data types (CRDTs) are a promising tool for designing scalable, coordination-free distributed systems. However, constructing correct CRDTs is difficult, posing a challenge for even seasoned developers. As a result, CRDT development is still largely the domain of academics, with new designs often awaiting peer review and a manual proof of correctness. In this paper, we pres… ▽ More

    Submitted 21 September, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    ACM Class: D.1.2

  31. arXiv:2205.09452  [pdf, other

    cs.LG eess.SY

    Learning-based AC-OPF Solvers on Realistic Network and Realistic Loads

    Authors: Tsun Ho Aaron Cheung, Min Zhou, Minghua Chen

    Abstract: Deep learning approaches for the Alternating Current-Optimal Power Flow (AC-OPF) problem are under active research in recent years. A common shortcoming in this area of research is the lack of a dataset that includes both a realistic power network topology and the corresponding realistic loads. To address this issue, we construct an AC-OPF formulation-ready dataset called TAS-97 that contains real… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 8 pages, 6 figures

  32. arXiv:2205.07147  [pdf

    cs.DC

    The Sky Above The Clouds

    Authors: Sarah Chasins, Alvin Cheung, Natacha Crooks, Ali Ghodsi, Ken Goldberg, Joseph E. Gonzalez, Joseph M. Hellerstein, Michael I. Jordan, Anthony D. Joseph, Michael W. Mahoney, Aditya Parameswaran, David Patterson, Raluca Ada Popa, Koushik Sen, Scott Shenker, Dawn Song, Ion Stoica

    Abstract: Technology ecosystems often undergo significant transformations as they mature. For example, telephony, the Internet, and PCs all started with a single provider, but in the United States each is now served by a competitive market that uses comprehensive and universal technology standards to provide compatibility. This white paper presents our view on how the cloud ecosystem, barely over fifteen ye… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: 35 pages

  33. arXiv:2205.02954  [pdf, other

    cs.DB cs.PL

    Leveraging Application Data Constraints to Optimize Database-Backed Web Applications

    Authors: Xiaoxuan Liu, Shuxian Wang, Mengzhu Sun, Sicheng Pan, Ge Li, Siddharth Jha, Cong Yan, Junwen Yang, Shan Lu, Alvin Cheung

    Abstract: Exploiting the relationships among data is a classical query optimization technique. As persistent data is increasingly being created and maintained programmatically, prior work that infers data relationships from data statistics misses an important opportunity. We present ConstrOpt, the first tool that identifies data relationships by analyzing database-backed applications. Once identified, Const… ▽ More

    Submitted 28 December, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

  34. Synthesizing Analytical SQL Queries from Computation Demonstration

    Authors: Xiangyu Zhou, Rastislav Bodik, Alvin Cheung, Chenglong Wang

    Abstract: Analytical SQL is widely used in modern database applications and data analysis. However, its partitioning and grouping operators are challenging for novice users. Unfortunately, programming by example, shown effective on standard SQL, are less attractive because examples for analytical queries are more laborious to solve by hand. To make demonstrations easier to create, we designed a new end-us… ▽ More

    Submitted 22 April, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

  35. arXiv:2203.00218  [pdf, other

    cs.AR cs.PL

    Application-Level Validation of Accelerator Designs Using a Formal Software/Hardware Interface

    Authors: Bo-Yuan Huang, Steven Lyubomirsky, Yi Li, Mike He, Gus Henry Smith, Thierry Tambe, Akash Gaonkar, Vishal Canumalla, Andrew Cheung, Gu-Yeon Wei, Aarti Gupta, Zachary Tatlock, Sharad Malik

    Abstract: Ideally, accelerator development should be as easy as software development. Several recent design languages/tools are working toward this goal, but actually testing early designs on real applications end-to-end remains prohibitively difficult due to the costs of building specialized compiler and simulator support. We propose a new first-in-class, mostly automated methodology termed "3LA" to enable… ▽ More

    Submitted 22 August, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

  36. arXiv:2202.05388  [pdf, other

    physics.comp-ph cs.ET physics.optics

    Massively parallel pixel-by-pixel nanophotonic optimization using a Green's function formalism

    Authors: Jiahui Wang, Alfred K. C. Cheung, Aleksandra Spyra, Ian A. D. Williamson, Jian Guan, Martin F. Schubert

    Abstract: We introduce an efficient parallelization scheme to implement pixel-by-pixel nanophotonic optimization using a Green's function based formalism. The crucial insight in our proposal is the reframing of the optimization algorithm as a large-scale data processing pipeline, which allows for the efficient distribution of computational tasks across thousands of workers. We demonstrate the utility of our… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 10 pages, 7 figures

  37. arXiv:2201.12965  [pdf, other

    cs.ET cs.LG physics.optics

    Inverse design of photonic devices with strict foundry fabrication constraints

    Authors: Martin F. Schubert, Alfred K. C. Cheung, Ian A. D. Williamson, Aleksandra Spyra, David H. Alexander

    Abstract: We introduce a new method for inverse design of nanophotonic devices which guarantees that resulting designs satisfy strict length scale constraints - including minimum width and spacing constraints required by commercial semiconductor foundries. The method adopts several concepts from machine learning to transform the problem of topology optimization with strict length scale constraints to an unc… ▽ More

    Submitted 13 June, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: 16 pages, 17 figures

    Journal ref: ACS Photonics, vol. 9, no. 7, pp. 2327-2336, Jun. 2022

  38. arXiv:2103.16604  [pdf, other

    cs.DB

    VSS: A Storage System for Video Analytics [Technical Report]

    Authors: Brandon Haynes, Maureen Daum, Dong He, Amrita Mazumdar, Magdalena Balazinska, Alvin Cheung, Luis Ceze

    Abstract: We present a new video storage system (VSS) designed to decouple high-level video operations from the low-level details required to store and efficiently retrieve video data. VSS is designed to be the storage subsystem of a video data management system (VDBMS) and is responsible for: (1) transparently and automatically arranging the data on disk in an efficient, granular format; (2) caching freque… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  39. Falx: Synthesis-Powered Visualization Authoring

    Authors: Chenglong Wang, Yu Feng, Rastislav Bodik, Isil Dillig, Alvin Cheung, Amy J. Ko

    Abstract: Modern visualization tools aim to allow data analysts to easily create exploratory visualizations. When the input data layout conforms to the visualization design, users can easily specify visualizations by mapping data columns to visual channels of the design. However, when there is a mismatch between data layout and the design, users need to spend significant effort on data transformation. We… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: CHI 2021

  40. arXiv:2101.01159  [pdf, other

    cs.DC cs.DB cs.OS cs.PL

    New Directions in Cloud Programming

    Authors: Alvin Cheung, Natacha Crooks, Joseph M. Hellerstein, Mae Milano

    Abstract: Nearly twenty years after the launch of AWS, it remains difficult for most developers to harness the enormous potential of the cloud. In this paper we lay out an agenda for a new generation of cloud programming research aimed at bringing research ideas to programmers in an evolutionary fashion. Key to our approach is a separation of distributed programs into a PACT of four facets: Program semant… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Journal ref: CIDR 2021

  41. arXiv:2011.14403  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn

    Navigating the Complex Compositional Landscape of High-Entropy Alloys

    Authors: Jie Qi, Andrew M. Cheung, S. Joseph Poon

    Abstract: High-entropy alloys, which exist in the high-dimensional composition space, provide enormous unique opportunities for realizing unprecedented structural and functional properties. A fundamental challenge, however, lies in how to predict the specific alloy phases and desirable properties accurately. This review article provides an overview of the data-driven methods published to date to tackle this… ▽ More

    Submitted 18 May, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: To appear as a book chapter

  42. arXiv:2008.00748  [pdf, other

    cs.LG eess.IV stat.ML

    Tensorizing GAN with High-Order Pooling for Alzheimer's Disease Assessment

    Authors: Wen Yu, Baiying Lei, Michael K. Ng, Albert C. Cheung, Yanyan Shen, Shuqiang Wang

    Abstract: It is of great significance to apply deep learning for the early diagnosis of Alzheimer's Disease (AD). In this work, a novel tensorizing GAN with high-order pooling is proposed to assess Mild Cognitive Impairment (MCI) and AD. By tensorizing a three-player cooperative game based framework, the proposed model can benefit from the structural information of the brain. By incorporating the high-order… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: 15 pages, 20 figures

  43. arXiv:2003.03133  [pdf, other

    cs.HC cs.AI cs.LG cs.RO

    DeFINE: Delayed Feedback based Immersive Navigation Environment for Studying Goal-Directed Human Navigation

    Authors: Kshitij Tiwari, Ville Kyrki, Allen Cheung, Naohide Yamamoto

    Abstract: With the advent of consumer-grade products for presenting an immersive virtual environment (VE), there is a growing interest in utilizing VEs for testing human navigation behavior. However, preparing a VE still requires a high level of technical expertise in computer graphics and virtual reality, posing a significant hurdle to embracing the emerging technology. To address this issue, this paper pr… ▽ More

    Submitted 15 February, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: 43 pages, 10 figures, 5 tables, Submitted to Behavioral Research Methods

    Journal ref: Behav Res (2021)

  44. arXiv:1911.09668  [pdf, other

    cs.PL cs.HC

    Visualization by Example

    Authors: Chenglong Wang, Yu Feng, Rastislav Bodik, Alvin Cheung, Isil Dillig

    Abstract: While visualizations play a crucial role in gaining insights from data, generating useful visualizations from a complex dataset is far from an easy task. Besides understanding the functionality provided by existing visualization libraries, generating the desired visualization also requires reshaping and aggregating the underlying data as well as composing different visual elements to achieve the i… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

  45. arXiv:1904.09086  [pdf, other

    cs.CL

    Learning Programmatic Idioms for Scalable Semantic Parsing

    Authors: Srinivasan Iyer, Alvin Cheung, Luke Zettlemoyer

    Abstract: Programmers typically organize executable source code using high-level coding patterns or idiomatic structures such as nested loops, exception handlers and recursive blocks, rather than as individual code tokens. In contrast, state of the art (SOTA) semantic parsers still map natural language instructions to source code by building the code syntax tree one node at a time. In this paper, we introdu… ▽ More

    Submitted 6 September, 2019; v1 submitted 19 April, 2019; originally announced April 2019.

    Comments: Accepted at EMNLP 2019

  46. arXiv:1904.08880  [pdf

    cond-mat.dis-nn cond-mat.mtrl-sci

    High Entropy Alloys Mined From Phase Diagrams

    Authors: Qi Jie, Andrew Cheung, S. Joseph Poon

    Abstract: High entropy alloys (HEA) show promise as a new type of high-performance structural material. Their vast degrees of freedom provide for extensive opportunities to design alloys with tailored properties. However, the compositional complexities of HEAs present great challenges for alloy design. Current approaches have shown limited reliability in accounting for the compositional regions of single so… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: 20 pages, 6 figures

  47. arXiv:1902.01372  [pdf, other

    cs.MM cs.DB

    Vignette: Perceptual Compression for Video Storage and Processing Systems

    Authors: Amrita Mazumdar, Brandon Haynes, Magdalena Balazinska, Luis Ceze, Alvin Cheung, Mark Oskin

    Abstract: Compressed videos constitute 70% of Internet traffic, and video upload growth rates far outpace compute and storage improvement trends. Past work in leveraging perceptual cues like saliency, i.e., regions where viewers focus their perceptual attention, reduces compressed video size while maintaining perceptual quality, but requires significant changes to video codecs and ignores the data managemen… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  48. arXiv:1810.01997  [pdf, other

    cs.DB

    Improving High Contention OLTP Performance via Transaction Scheduling

    Authors: Guna Prasaad, Alvin Cheung, Dan Suciu

    Abstract: Research in transaction processing has made significant progress in improving the performance of multi-core in-memory transactional systems. However, the focus has mainly been on low-contention workloads. Modern transactional systems perform poorly on workloads with transactions accessing a few highly contended data items. We observe that most transactional workloads, including those with high con… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

  49. Hund's coupling stabilized superconductivity in the presence of spin-orbit interactions

    Authors: Alfred K. C. Cheung, D. F. Agterberg

    Abstract: The intraorbital repulsive Hubbard interaction cannot lead to attractive superconducting pairing states, except through the Kohn-Luttinger mechanism. This situation may change when we include additional local interactions such as the interorbital repulsion $U^\prime$ and Hund's interactions $J$. Adding these local interactions, we study the nature of the superconducting pairs in systems with tetra… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: 9 pages

    Journal ref: Phys. Rev. B 99, 024516 (2019)

  50. arXiv:1808.09588  [pdf, ps, other

    cs.CL

    Mapping Language to Code in Programmatic Context

    Authors: Srinivasan Iyer, Ioannis Konstas, Alvin Cheung, Luke Zettlemoyer

    Abstract: Source code is rarely written in isolation. It depends significantly on the programmatic context, such as the class that the code would reside in. To study this phenomenon, we introduce the task of generating class member functions given English documentation and the programmatic context provided by the rest of the class. This task is challenging because the desired code can vary greatly depending… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: Accepted at EMNLP 2018