Skip to main content

Showing 1–46 of 46 results for author: Min, R

  1. arXiv:2410.16184  [pdf, other

    cs.CL

    RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

    Authors: Yantao Liu, Zijun Yao, Rui Min, Yixin Cao, Lei Hou, Juanzi Li

    Abstract: Reward models are critical in techniques like Reinforcement Learning from Human Feedback (RLHF) and Inference Scaling Laws, where they guide language model alignment and select optimal responses. Despite their importance, existing reward model benchmarks often evaluate models by asking them to distinguish between responses generated by models of varying power. However, this approach fails to asses… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  2. arXiv:2410.09838  [pdf, other

    cs.LG cs.AI cs.CR

    Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense

    Authors: Rui Min, Zeyu Qin, Nevin L. Zhang, Li Shen, Minhao Cheng

    Abstract: Backdoor attacks pose a significant threat to Deep Neural Networks (DNNs) as they allow attackers to manipulate model predictions with backdoor triggers. To address these security vulnerabilities, various backdoor purification methods have been proposed to purify compromised models. Typically, these purified models exhibit low Attack Success Rates (ASR), rendering them resistant to backdoored inpu… ▽ More

    Submitted 16 October, 2024; v1 submitted 13 October, 2024; originally announced October 2024.

    Comments: NeurIPS 2024 Spotlight paper. The first two authors contributed equally

  3. arXiv:2410.08436  [pdf, other

    cs.CL cs.AI

    Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models

    Authors: Zi'ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu

    Abstract: When performing complex multi-step reasoning tasks, the ability of Large Language Models (LLMs) to derive structured intermediate proof steps is important for ensuring that the models truly perform the desired reasoning and for improving models' explainability. This paper is centred around a focused study: whether the current state-of-the-art generalist LLMs can leverage the structures in a few ex… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

    Comments: Accepted by EMNLP2024 main conference

  4. arXiv:2410.08207  [pdf, other

    cs.CV cs.LG

    DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

    Authors: Xiaoxiao He, Ligong Han, Quan Dao, Song Wen, Minhao Bai, Di Liu, Han Zhang, Martin Renqiang Min, Felix Juefei-Xu, Chaowei Tan, Bo Liu, Kang Li, Hongdong Li, Junzhou Huang, Faez Ahmed, Akash Srivastava, Dimitris Metaxas

    Abstract: Discrete diffusion models have achieved success in tasks like image generation and masked language modeling but face limitations in controlled content editing. We introduce DICE (Discrete Inversion for Controllable Editing), the first approach to enable precise inversion for discrete diffusion models, including multinomial diffusion and masked generative models. By recording noise sequences and ma… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  5. arXiv:2409.16145  [pdf, other

    cs.CV

    Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment

    Authors: Yuxiao Chen, Kai Li, Wentao Bao, Deep Patel, Yu Kong, Martin Renqiang Min, Dimitris N. Metaxas

    Abstract: Learning to localize temporal boundaries of procedure steps in instructional videos is challenging due to the limited availability of annotated large-scale training videos. Recent works focus on learning the cross-modal alignment between video segments and ASR-transcripted narration texts through contrastive learning. However, these methods fail to account for the alignment noise, i.e., irrelevant… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: Accepted to ECCV 2024

  6. arXiv:2408.12173  [pdf, other

    cs.IR cs.PF

    Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments

    Authors: Maciej Besta, Robert Gerstenberger, Patrick Iff, Pournima Sonawane, Juan Gómez Luna, Raghavendra Kanakagiri, Rui Min, Onur Mutlu, Torsten Hoefler, Raja Appuswamy, Aidan O Mahony

    Abstract: Knowledge graphs (KGs) have achieved significant attention in recent years, particularly in the area of the Semantic Web as well as gaining popularity in other application domains such as data mining and search engines. Simultaneously, there has been enormous progress in the development of different types of heterogeneous hardware, impacting the way KGs are processed. The aim of this paper is to p… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  7. arXiv:2403.12848  [pdf, other

    cs.CV

    Planner3D: LLM-enhanced graph prior meets 3D indoor scene explicit regularization

    Authors: Yao Wei, Martin Renqiang Min, George Vosselman, Li Erran Li, Michael Ying Yang

    Abstract: Compositional 3D scene synthesis has diverse applications across a spectrum of industries such as robotics, films, and video games, as it closely mirrors the complexity of real-world multi-object environments. Conventional works typically employ shape retrieval based frameworks which naturally suffer from limited shape diversity. Recent progresses have been made in object shape generation with gen… ▽ More

    Submitted 26 August, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 16 pages, 10 figures

  8. arXiv:2403.10893  [pdf, other

    cs.CR

    A Watermark-Conditioned Diffusion Model for IP Protection

    Authors: Rui Min, Sen Li, Hongyang Chen, Minhao Cheng

    Abstract: The ethical need to protect AI-generated content has been a significant concern in recent years. While existing watermarking strategies have demonstrated success in detecting synthetic content (detection), there has been limited exploration in identifying the users responsible for generating these outputs from a single model (owner identification). In this paper, we focus on both practical scenari… ▽ More

    Submitted 16 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  9. arXiv:2403.02782  [pdf, other

    cs.CV

    Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos

    Authors: Kumaranage Ravindu Yasas Nagasinghe, Honglu Zhou, Malitha Gunawardhana, Martin Renqiang Min, Daniel Harari, Muhammad Haris Khan

    Abstract: In this paper, we explore the capability of an agent to construct a logical sequence of action steps, thereby assembling a strategic procedural plan. This plan is crucial for navigating from an initial visual observation to a target visual outcome, as depicted in real-life instructional videos. Existing works have attained partial success by extensively leveraging various sources of information av… ▽ More

    Submitted 15 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 8 pages, 6 figures, (supplementary material: 9 pages, 5 figures), accepted to CVPR 2024

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 , Pages 18816-18826

  10. arXiv:2310.01875  [pdf, other

    cs.LG cs.AI cs.CR

    Towards Stable Backdoor Purification through Feature Shift Tuning

    Authors: Rui Min, Zeyu Qin, Li Shen, Minhao Cheng

    Abstract: It has been widely observed that deep neural networks (DNN) are vulnerable to backdoor attacks where attackers could manipulate the model behavior maliciously by tampering with a small set of training samples. Although a line of defense methods is proposed to mitigate this threat, they either require complicated modifications to the training process or heavily rely on the specific model architectu… ▽ More

    Submitted 21 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 paper. The first two authors contributed equally

  11. arXiv:2304.12536  [pdf, other

    cs.CV

    Exploring Compositional Visual Generation with Latent Classifier Guidance

    Authors: Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min

    Abstract: Diffusion probabilistic models have achieved enormous success in the field of image generation and manipulation. In this paper, we explore a novel paradigm of using the diffusion model and classifier guidance in the latent semantic space for compositional visual tasks. Specifically, we train latent diffusion models and auxiliary latent classifiers to facilitate non-linear navigation of latent repr… ▽ More

    Submitted 24 May, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR Workshop 2023

  12. arXiv:2303.13744  [pdf, other

    cs.CV

    Conditional Image-to-Video Generation with Latent Flow Diffusion Models

    Authors: Haomiao Ni, Changhao Shi, Kai Li, Sharon X. Huang, Martin Renqiang Min

    Abstract: Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video starting from an image (e.g., a person's face) and a condition (e.g., an action class label like smile). The key challenge of the cI2V task lies in the simultaneous generation of realistic spatial appearance and temporal dynamics corresponding to the given image and condition. In this paper, we propose an approac… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  13. arXiv:2303.02162  [pdf, other

    q-bio.QM cs.LG

    T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy

    Authors: Ziqi Chen, Martin Renqiang Min, Hongyu Guo, Chao Cheng, Trevor Clancy, Xia Ning

    Abstract: T cells monitor the health status of cells by identifying foreign peptides displayed on their surface. T-cell receptors (TCRs), which are protein complexes found on the surface of T cells, are able to bind to these peptides. This process is known as TCR recognition and constitutes a key step for immune response. Optimizing TCR sequences for TCR recognition represents a fundamental step towards the… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  14. arXiv:2301.01413  [pdf, other

    cs.CV

    Attribute-Centric Compositional Text-to-Image Generation

    Authors: Yuren Cong, Martin Renqiang Min, Li Erran Li, Bodo Rosenhahn, Michael Ying Yang

    Abstract: Despite the recent impressive breakthroughs in text-to-image generation, generative models have difficulty in capturing the data distribution of underrepresented attribute compositions while over-memorizing overrepresented attribute compositions, which raises public concerns about their robustness and fairness. To tackle this challenge, we propose ACTIG, an attribute-centric compositional text-to-… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  15. arXiv:2209.05244  [pdf, other

    cs.CV cs.CR

    Universal Backdoor Attacks Detection via Adaptive Adversarial Probe

    Authors: Yuhang Wang, Huafeng Shi, Rui Min, Ruijia Wu, Siyuan Liang, Yichao Wu, Ding Liang, Aishan Liu

    Abstract: Extensive evidence has demonstrated that deep neural networks (DNNs) are vulnerable to backdoor attacks, which motivates the development of backdoor attacks detection. Most detection methods are designed to verify whether a model is infected with presumed types of backdoor attacks, yet the adversary is likely to generate diverse backdoor attacks in practice that are unforeseen to defenders, which… ▽ More

    Submitted 7 December, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 8 pages, 8 figures

  16. arXiv:2203.15799  [pdf, other

    cs.CV

    StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis

    Authors: Zhiheng Li, Martin Renqiang Min, Kai Li, Chenliang Xu

    Abstract: Although progress has been made for text-to-image synthesis, previous methods fall short of generalizing to unseen or underrepresented attribute compositions in the input text. Lacking compositionality could have severe implications for robustness and fairness, e.g., inability to synthesize the face images of underrepresented demographic groups. In this paper, we introduce a new framework, StyleT2… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  17. arXiv:2203.03475  [pdf, other

    stat.ML cs.LG eess.SP

    State space partitioning based on constrained spectral clustering for block particle filtering

    Authors: Rui Min, Christelle Garnier, François Septier, John Klein

    Abstract: The particle filter (PF) is a powerful inference tool widely used to estimate the filtering distribution in non-linear and/or non-Gaussian problems. To overcome the curse of dimensionality of PF, the block PF (BPF) inserts a blocking step to partition the state space into several subspaces or blocks of smaller dimension so that the correction and resampling steps can be performed independently on… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  18. arXiv:2202.12403  [pdf, other

    cs.CV cs.LG

    Learning Transferable Reward for Query Object Localization with Policy Adaptation

    Authors: Tingfeng Li, Shaobo Han, Martin Renqiang Min, Dimitris N. Metaxas

    Abstract: We propose a reinforcement learning based approach to query object localization, for which an agent is trained to localize objects of interest specified by a small exemplary set. We learn a transferable reward signal formulated using the exemplary set by ordinal metric learning. Our proposed method enables test-time policy adaptation to new environments where the reward signals are not readily ava… ▽ More

    Submitted 14 March, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: ICLR 2022

  19. arXiv:2110.08718  [pdf, other

    cs.CV eess.IV

    AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

    Authors: Ligong Han, Sri Harsha Musunuri, Martin Renqiang Min, Ruijiang Gao, Yu Tian, Dimitris Metaxas

    Abstract: StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad hoc after the generator is trained in a two-stage fashion. In this paper, we focus on style-based generators asking a scientific question: Does forcing such a g… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: Accepted at WACV-22

  20. arXiv:2108.09016  [pdf, other

    cs.CV

    Dual Projection Generative Adversarial Networks for Conditional Image Generation

    Authors: Ligong Han, Martin Renqiang Min, Anastasis Stathopoulos, Yu Tian, Ruijiang Gao, Asim Kadav, Dimitris Metaxas

    Abstract: Conditional Generative Adversarial Networks (cGANs) extend the standard unconditional GAN framework to learning joint data-label distributions from samples, and have been established as powerful generative models capable of generating high-fidelity imagery. A challenge of training such a model lies in properly infusing class information into its generator and discriminator. For the discriminator,… ▽ More

    Submitted 29 November, 2021; v1 submitted 20 August, 2021; originally announced August 2021.

    Comments: Accepted at ICCV-21

  21. arXiv:2103.10574  [pdf, other

    cs.CV

    Hopper: Multi-hop Transformer for Spatiotemporal Reasoning

    Authors: Honglu Zhou, Asim Kadav, Farley Lai, Alexandru Niculescu-Mizil, Martin Renqiang Min, Mubbasir Kapadia, Hans Peter Graf

    Abstract: This paper considers the problem of spatiotemporal object-centric reasoning in videos. Central to our approach is the notion of object permanence, i.e., the ability to reason about the location of objects as they move through the video while being occluded, contained or carried by other objects. Existing deep learning based approaches often suffer from spatiotemporal biases when applied to video r… ▽ More

    Submitted 21 March, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

  22. arXiv:2101.07496  [pdf, other

    cs.LG cs.AI

    Disentangled Recurrent Wasserstein Autoencoder

    Authors: Jun Han, Martin Renqiang Min, Ligong Han, Li Erran Li, Xuan Zhang

    Abstract: Learning disentangled representations leads to interpretable models and facilitates data generation with style transfer, which has been extensively studied on static data such as images in an unsupervised learning framework. However, only a few works have explored unsupervised disentangled sequential representation learning due to challenges of generating sequential data. In this paper, we propose… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: ICLR 2021

  23. arXiv:2012.04231  [pdf, other

    cs.LG cs.NE stat.ML

    A Deep Generative Model for Molecule Optimization via One Fragment Modification

    Authors: Ziqi Chen, Martin Renqiang Min, Srinivasan Parthasarathy, Xia Ning

    Abstract: Molecule optimization is a critical step in drug development to improve desired properties of drug candidates through chemical modification. We developed a novel deep generative model Modof over molecular graphs for molecule optimization. Modof modifies a given molecule through the prediction of a single site of disconnection at the molecule and the removal and/or addition of fragments at that sit… ▽ More

    Submitted 13 January, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: This paper has been accepted by Nature Machine Intelligence

    Journal ref: Nat Mach Intell. 3 (2021) 1040-1049

  24. arXiv:2012.02840  [pdf, other

    q-bio.QM cs.AI cs.LG

    Ranking-based Convolutional Neural Network Models for Peptide-MHC Binding Prediction

    Authors: Ziqi Chen, Martin Renqiang Min, Xia Ning

    Abstract: T-cell receptors can recognize foreign peptides bound to major histocompatibility complex (MHC) class-I proteins, and thus trigger the adaptive immune response. Therefore, identifying peptides that can bind to MHC class-I molecules plays a vital role in the design of peptide vaccines. Many computational methods, for example, the state-of-the-art allele-specific method MHCflurry, have been develope… ▽ More

    Submitted 7 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: 17 pages, 5 figures

  25. arXiv:2006.00693  [pdf, other

    cs.LG stat.ML

    Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

    Authors: Pengyu Cheng, Martin Renqiang Min, Dinghan Shen, Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

    Abstract: Learning disentangled representations of natural language is essential for many NLP tasks, e.g., conditional text generation, style transfer, personalized dialogue systems, etc. Similar problems have been studied extensively for other forms of data, such as images and videos. However, the discrete nature of natural language makes the disentangling of textual representations more challenging (e.g.,… ▽ More

    Submitted 12 January, 2022; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: Accepted by the 58th Annual Meeting of the Association for Computational Linguistics (ACL2020)

  26. arXiv:2005.11437  [pdf, other

    cs.CV

    S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

    Authors: Yizhe Zhu, Martin Renqiang Min, Asim Kadav, Hans Peter Graf

    Abstract: We propose a sequential variational autoencoder to learn disentangled representations of sequential data (e.g., videos and audios) under self-supervision. Specifically, we exploit the benefits of some readily accessible supervisory signals from input data itself or some off-the-shelf functional models and accordingly design auxiliary tasks for our model to utilize these signals. With the supervisi… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    Comments: to appear in CVPR2020

  27. arXiv:1911.06910  [pdf, other

    cs.CL cs.IR cs.LG

    CNN-based Dual-Chain Models for Knowledge Graph Learning

    Authors: Bo Peng, Renqiang Min, Xia Ning

    Abstract: Knowledge graph learning plays a critical role in integrating domain specific knowledge bases when deploying machine learning and data mining models in practice. Existing methods on knowledge graph learning primarily focus on modeling the relations among entities as translations among the relations and entities, and many of these methods are not able to handle zero-shot problems, when new entities… ▽ More

    Submitted 26 November, 2019; v1 submitted 15 November, 2019; originally announced November 2019.

  28. arXiv:1909.05995  [pdf, other

    cs.CV

    Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective

    Authors: Kai Li, Martin Renqiang Min, Yun Fu

    Abstract: Zero-shot learning (ZSL) aims to recognize instances of unseen classes solely based on the semantic descriptions of the classes. Existing algorithms usually formulate it as a semantic-visual correspondence problem, by learning mappings from one feature space to the other. Despite being reasonable, previous approaches essentially discard the highly precious discriminative power of visual features i… ▽ More

    Submitted 27 November, 2019; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Accepted to ICCV 2019. First update: add project link and correct some typos

  29. A Deep Spatio-Temporal Fuzzy Neural Network for Passenger Demand Prediction

    Authors: Xiaoyuan Liang, Guiling Wang, Martin Renqiang Min, Yi Qi, Zhu Han

    Abstract: In spite of its importance, passenger demand prediction is a highly challenging problem, because the demand is simultaneously influenced by the complex interactions among many spatial and temporal factors and other external factors such as weather. To address this problem, we propose a Spatio-TEmporal Fuzzy neural Network (STEF-Net) to accurately predict passenger demands incorporating the complex… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: https://epubs.siam.org/doi/abs/10.1137/1.9781611975673.12

    Journal ref: Proceedings of the 2019 SIAM International Conference on Data Mining

  30. arXiv:1902.11134  [pdf, other

    cs.CV cs.LG stat.ML

    Disentangled Deep Autoencoding Regularization for Robust Image Classification

    Authors: Zhenyu Duan, Martin Renqiang Min, Li Erran Li, Mingbo Cai, Yi Xu, Bingbing Ni

    Abstract: In spite of achieving revolutionary successes in machine learning, deep convolutional neural networks have been recently found to be vulnerable to adversarial attacks and difficult to generalize to novel test images with reasonably large geometric transformations. Inspired by a recent neuroscience discovery revealing that primate brain employs disentangled shape and appearance representations for… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: 9 pages

  31. arXiv:1811.07950  [pdf, other

    cs.CV

    Optimal Transport Classifier: Defending Against Adversarial Attacks by Regularized Deep Embedding

    Authors: Yao Li, Martin Renqiang Min, Wenchao Yu, Cho-Jui Hsieh, Thomas C. M. Lee, Erik Kruus

    Abstract: Recent studies have demonstrated the vulnerability of deep convolutional neural networks against adversarial examples. Inspired by the observation that the intrinsic dimension of image data is much smaller than its pixel space dimension and the vulnerability of neural networks grows with the input dimension, we propose to embed high-dimensional input images into a low-dimensional space to perform… ▽ More

    Submitted 9 December, 2018; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: 9 pages

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 7496-7505

  32. arXiv:1806.09464  [pdf, other

    cs.LG cs.AI stat.ML

    Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations

    Authors: Ting Chen, Martin Renqiang Min, Yizhou Sun

    Abstract: Conventional embedding methods directly associate each symbol with a continuous embedding vector, which is equivalent to applying a linear transformation based on a "one-hot" encoding of the discrete symbols. Despite its simplicity, such approach yields the number of parameters that grows linearly with the vocabulary size and can lead to overfitting. In this work, we propose a much more compact K-… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: ICML 2018. arXiv admin note: text overlap with arXiv:1711.03067

  33. arXiv:1805.09843  [pdf, other

    cs.CL cs.AI cs.LG

    Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms

    Authors: Dinghan Shen, Guoyin Wang, Wenlin Wang, Martin Renqiang Min, Qinliang Su, Yizhe Zhang, Chunyuan Li, Ricardo Henao, Lawrence Carin

    Abstract: Many deep learning architectures have been proposed to model the compositionality in text sequences, requiring a substantial number of parameters and expensive computations. However, there has not been a rigorous evaluation regarding the added value of sophisticated compositional functions. In this paper, we conduct a point-by-point comparative study between Simple Word-Embedding-based Models (SWE… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

    Comments: To appear at ACL 2018 (code: https://github.com/dinghanshen/SWEM)

  34. arXiv:1711.03067  [pdf, other

    cs.LG cs.AI stat.ML

    Learning K-way D-dimensional Discrete Code For Compact Embedding Representations

    Authors: Ting Chen, Martin Renqiang Min, Yizhou Sun

    Abstract: Embedding methods such as word embedding have become pillars for many applications containing discrete structures. Conventional embedding methods directly associate each symbol with a continuous embedding vector, which is equivalent to applying linear transformation based on "one-hot" encoding of the discrete symbols. Despite its simplicity, such approach yields number of parameters that grows lin… ▽ More

    Submitted 10 December, 2017; v1 submitted 8 November, 2017; originally announced November 2017.

    Comments: NIPS'17 DISCML

  35. arXiv:1710.08502   

    cs.LG stat.ML

    Convolutional Neural Knowledge Graph Learning

    Authors: Feipeng Zhao, Martin Renqiang Min, Chen Shen, Amit Chakraborty

    Abstract: Previous models for learning entity and relationship embeddings of knowledge graphs such as TransE, TransH, and TransR aim to explore new links based on learned representations. However, these models interpret relationships as simple translations on entity embeddings. In this paper, we try to learn more complex connections between entities and relationships. In particular, we use a Convolutional N… ▽ More

    Submitted 29 March, 2018; v1 submitted 23 October, 2017; originally announced October 2017.

    Comments: evaluation mistake

  36. arXiv:1710.05128  [pdf, ps, other

    cs.LG

    Parametric t-Distributed Stochastic Exemplar-centered Embedding

    Authors: Martin Renqiang Min, Hongyu Guo, Dinghan Shen

    Abstract: Parametric embedding methods such as parametric t-SNE (pt-SNE) have been widely adopted for data visualization and out-of-sample data embedding without further computationally expensive optimization or approximation. However, the performance of pt-SNE is highly sensitive to the hyper-parameter batch size due to conflicting optimization goals, and often produces dramatically different embeddings wi… ▽ More

    Submitted 20 April, 2018; v1 submitted 13 October, 2017; originally announced October 2017.

    Comments: fixed typos

  37. arXiv:1710.00421  [pdf, other

    cs.MM

    Video Generation From Text

    Authors: Yitong Li, Martin Renqiang Min, Dinghan Shen, David Carlson, Lawrence Carin

    Abstract: Generating videos from text has proven to be a significant challenge for existing generative models. We tackle this problem by training a conditional generative model to extract both static and dynamic information from text. This is manifested in a hybrid framework, employing a Variational Autoencoder (VAE) and a Generative Adversarial Network (GAN). The static features, called "gist," are used to… ▽ More

    Submitted 1 October, 2017; originally announced October 2017.

  38. arXiv:1709.08294  [pdf, other

    cs.CL cs.LG stat.ML

    Learning Context-Sensitive Convolutional Filters for Text Processing

    Authors: Dinghan Shen, Martin Renqiang Min, Yitong Li, Lawrence Carin

    Abstract: Convolutional neural networks (CNNs) have recently emerged as a popular building block for natural language processing (NLP). Despite their success, most existing CNN models employed in NLP share the same learned (and static) set of filters for all input sentences. In this paper, we consider an approach of using a small meta network to learn context-sensitive convolutional filters for text process… ▽ More

    Submitted 30 August, 2018; v1 submitted 24 September, 2017; originally announced September 2017.

    Comments: Accepted by EMNLP 2018 as a full paper

  39. arXiv:1702.06602  [pdf, ps, other

    cs.LG stat.ML

    Exemplar-Centered Supervised Shallow Parametric Data Embedding

    Authors: Martin Renqiang Min, Hongyu Guo, Dongjin Song

    Abstract: Metric learning methods for dimensionality reduction in combination with k-Nearest Neighbors (kNN) have been extensively deployed in many classification, data embedding, and information retrieval applications. However, most of these approaches involve pairwise training data comparisons, and thus have quadratic computational complexity with respect to the size of training set, preventing them from… ▽ More

    Submitted 5 July, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

    Comments: accepted to IJCAI2017

  40. A Context-aware Attention Network for Interactive Question Answering

    Authors: Huayu Li, Martin Renqiang Min, Yong Ge, Asim Kadav

    Abstract: Neural network based sequence-to-sequence models in an encoder-decoder framework have been successfully applied to solve Question Answering (QA) problems, predicting answers from statements and questions. However, almost all previous models have failed to consider detailed context information and unknown states under which systems do not have enough information to answer given questions. These sce… ▽ More

    Submitted 3 September, 2017; v1 submitted 21 December, 2016; originally announced December 2016.

    Comments: 9 pages

  41. arXiv:1611.07837  [pdf, other

    cs.CV cs.CL

    Adaptive Feature Abstraction for Translating Video to Text

    Authors: Yunchen Pu, Martin Renqiang Min, Zhe Gan, Lawrence Carin

    Abstract: Previous models for video captioning often use the output from a specific layer of a Convolutional Neural Network (CNN) as video features. However, the variable context-dependent semantics in the video may make it more appropriate to adaptively select features from the multiple CNN layers. We propose a new approach for generating adaptive spatiotemporal representations of videos for the captioning… ▽ More

    Submitted 17 November, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: Accepted to AAAI 2018

  42. arXiv:1608.04689  [pdf, ps, other

    cs.AI cs.LG stat.ML

    A Shallow High-Order Parametric Approach to Data Visualization and Compression

    Authors: Martin Renqiang Min, Hongyu Guo, Dongjin Song

    Abstract: Explicit high-order feature interactions efficiently capture essential structural knowledge about the data of interest and have been used for constructing generative models. We present a supervised discriminative High-Order Parametric Embedding (HOPE) approach to data visualization and compression. Compared to deep embedding models with complicated deep architectures, HOPE generates more effective… ▽ More

    Submitted 16 August, 2016; originally announced August 2016.

  43. arXiv:1603.05544  [pdf, other

    cs.LG cs.DC

    Accelerating Deep Neural Network Training with Inconsistent Stochastic Gradient Descent

    Authors: Linnan Wang, Yi Yang, Martin Renqiang Min, Srimat Chakradhar

    Abstract: SGD is the widely adopted method to train CNN. Conceptually it approximates the population with a randomly sampled batch; then it evenly trains batches by conducting a gradient update on every batch in an epoch. In this paper, we demonstrate Sampling Bias, Intrinsic Image Difference and Fixed Cycle Pseudo Random Sampling differentiate batches in training, which then affect learning speeds on them.… ▽ More

    Submitted 28 March, 2017; v1 submitted 17 March, 2016; originally announced March 2016.

    Comments: The patent of ISGD belongs to NEC Labs

  44. arXiv:1504.08022  [pdf, ps, other

    cs.LG cs.NE

    A Deep Learning Model for Structured Outputs with High-order Interaction

    Authors: Hongyu Guo, Xiaodan Zhu, Martin Renqiang Min

    Abstract: Many real-world applications are associated with structured data, where not only input but also output has interplay. However, typical classification and regression models often lack the ability of simultaneously exploring high-order interaction within input and that within output. In this paper, we present a deep learning model aiming to generate a powerful nonlinear functional mapping from struc… ▽ More

    Submitted 29 April, 2015; originally announced April 2015.

  45. arXiv:1205.5341  [pdf, other

    cs.IT

    Joint Channel Estimation and Data Detection for Multihop OFDM Relaying System under Unknown Channel Orders and Doppler Frequencies

    Authors: Rui Min, Yik-Chung Wu

    Abstract: In this paper, channel estimation and data detection for multihop relaying orthogonal frequency division multiplexing (OFDM) system is investigated under time-varying channel. Different from previous works, which highly depend on the statistical information of the doubly-selective channel (DSC) and noise to deliver accurate channel estimation and data detection results, we focus on more practical… ▽ More

    Submitted 24 May, 2012; originally announced May 2012.

  46. arXiv:0906.1814  [pdf, ps, other

    cs.LG cs.AI

    Large-Margin kNN Classification Using a Deep Encoder Network

    Authors: Martin Renqiang Min, David A. Stanley, Zineng Yuan, Anthony Bonner, Zhaolei Zhang

    Abstract: KNN is one of the most popular classification methods, but it often fails to work well with inappropriate choice of distance metric or due to the presence of numerous class-irrelevant features. Linear feature transformation methods have been widely applied to extract class-relevant information to improve kNN classification, which is very limited in many applications. Kernels have been used to le… ▽ More

    Submitted 9 June, 2009; originally announced June 2009.

    Comments: 13 pages (preliminary version)