Skip to main content

Showing 1–16 of 16 results for author: Koo, G

  1. arXiv:2409.13037  [pdf, other

    cs.CV

    DNI: Dilutional Noise Initialization for Diffusion Video Editing

    Authors: Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo

    Abstract: Text-based diffusion video editing systems have been successful in performing edits with high fidelity and textual alignment. However, this success is limited to rigid-type editing such as style transfer and object overlay, while preserving the original structure of the input video. This limitation stems from an initial latent noise employed in diffusion video editing systems. The diffusion video… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: 17 pages, 11 figures, ECCV 2024

  2. arXiv:2407.17850  [pdf, other

    cs.CV

    FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing

    Authors: Gwanhyeong Koo, Sunjae Yoon, Ji Woo Hong, Chang D. Yoo

    Abstract: Current image editing methods primarily utilize DDIM Inversion, employing a two-branch diffusion approach to preserve the attributes and layout of the original image. However, these methods encounter challenges with non-rigid edits, which involve altering the image's layout or structure. Our comprehensive analysis reveals that the high-frequency components of DDIM latent, crucial for retaining the… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  3. arXiv:2406.06044  [pdf, other

    cs.CV

    FRAG: Frequency Adapting Group for Diffusion Video Editing

    Authors: Sunjae Yoon, Gwanhyeong Koo, Geonwoo Kim, Chang D. Yoo

    Abstract: In video editing, the hallmark of a quality edit lies in its consistent and unobtrusive adjustment. Modification, when integrated, must be smooth and subtle, preserving the natural flow and aligning seamlessly with the original vision. Therefore, our primary focus is on overcoming the current challenges in high quality edit to ensure that each edit enhances the final product without disrupting its… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 16 pages, 16 figures, ICML 2024

  4. arXiv:2401.09794  [pdf, other

    cs.CV

    Wavelet-Guided Acceleration of Text Inversion in Diffusion-Based Image Editing

    Authors: Gwanhyeong Koo, Sunjae Yoon, Chang D. Yoo

    Abstract: In the field of image editing, Null-text Inversion (NTI) enables fine-grained editing while preserving the structure of the original image by optimizing null embeddings during the DDIM sampling process. However, the NTI process is time-consuming, taking more than two minutes per image. To address this, we introduce an innovative method that maintains the principles of the NTI while accelerating th… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: The International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024

  5. arXiv:2312.06708  [pdf, other

    cs.CV

    Neutral Editing Framework for Diffusion-based Video Editing

    Authors: Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo

    Abstract: Text-conditioned image editing has succeeded in various types of editing based on a diffusion framework. Unfortunately, this success did not carry over to a video, which continues to be challenging. Existing video editing systems are still limited to rigid-type editing such as style transfer and object overlay. To this end, this paper proposes Neutral Editing (NeuEdit) framework to enable complex… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 18 pages, 14 figures

  6. arXiv:2310.05241  [pdf, other

    cs.CV

    SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval

    Authors: Sunjae Yoon, Gwanhyeong Koo, Dahyun Kim, Chang D. Yoo

    Abstract: Video moment retrieval aims to localize moments in video corresponding to a given language query. To avoid the expensive cost of annotating the temporal moments, weakly-supervised VMR (wsVMR) systems have been studied. For such systems, generating a number of proposals as moment candidates and then selecting the most appropriate proposal has been a popular approach. These proposals are assumed to… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 11 pages, Accepted in ICCV 2023

  7. arXiv:2307.03077  [pdf, other

    cs.LG cs.SI

    Learning Disentangled Representations in Signed Directed Graphs without Social Assumptions

    Authors: Geonwoo Ko, Jinhong Jung

    Abstract: Signed graphs are complex systems that represent trust relationships or preferences in various domains. Learning node representations in such graphs is crucial for many mining tasks. Although real-world signed relationships can be influenced by multiple latent factors, most existing methods often oversimplify the modeling of signed relationships by relying on social theories and treating them as s… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 26 pages, 11 figures

  8. arXiv:2306.08162  [pdf, other

    cs.CL cs.AI cs.LG

    INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

    Authors: Yuji Chai, John Gkountouras, Glenn G. Ko, David Brooks, Gu-Yeon Wei

    Abstract: We introduce a method that dramatically reduces fine-tuning VRAM requirements and rectifies quantization errors in quantized Large Language Models. First, we develop an extremely memory-efficient fine-tuning (EMEF) method for quantized models using Low-Rank Adaptation (LoRA), and drawing upon it, we construct an error-correcting algorithm designed to minimize errors induced by the quantization pro… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  9. arXiv:2209.12127  [pdf, other

    cs.LG

    SpeedLimit: Neural Architecture Search for Quantized Transformer Models

    Authors: Yuji Chai, Luke Bailey, Yunho Jin, Matthew Karle, Glenn G. Ko, David Brooks, Gu-Yeon Wei, H. T. Kung

    Abstract: While research in the field of transformer models has primarily focused on enhancing performance metrics such as accuracy and perplexity, practical applications in industry often necessitate a rigorous consideration of inference latency constraints. Addressing this challenge, we introduce SpeedLimit, a novel Neural Architecture Search (NAS) technique that optimizes accuracy whilst adhering to an u… ▽ More

    Submitted 13 October, 2023; v1 submitted 24 September, 2022; originally announced September 2022.

  10. arXiv:2107.10480  [pdf, other

    cs.LG cs.CR cs.CV

    Unsupervised Detection of Adversarial Examples with Model Explanations

    Authors: Gihyuk Ko, Gyumin Lim

    Abstract: Deep Neural Networks (DNNs) have shown remarkable performance in a diverse range of machine learning applications. However, it is widely known that DNNs are vulnerable to simple adversarial perturbations, which causes the model to incorrectly classify inputs. In this paper, we propose a simple yet effective method to detect adversarial examples, using methods developed to explain the model's behav… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: AdvML@KDD'21

  11. arXiv:2001.05153  [pdf, other

    cs.CV

    Extending Class Activation Mapping Using Gaussian Receptive Field

    Authors: Bum Jun Kim, Gyogwon Koo, Hyeyeon Choi, Sang Woo Kim

    Abstract: This paper addresses the visualization task of deep learning models. To improve Class Activation Mapping (CAM) based visualization method, we offer two options. First, we propose Gaussian upsampling, an improved upsampling method that can reflect the characteristics of deep learning models. Second, we identify and modify unnatural terms in the mathematical derivation of the existing CAM studies. B… ▽ More

    Submitted 15 January, 2020; originally announced January 2020.

    Comments: 7 pages, 5 figures

  12. CHIPKIT: An agile, reusable open-source framework for rapid test chip development

    Authors: Paul Whatmough, Marco Donato, Glenn Ko, Sae-Kyu Lee, David Brooks, Gu-Yeon Wei

    Abstract: The current trend for domain-specific architectures (DSAs) has led to renewed interest in research test chips to demonstrate new specialized hardware. Tape-outs also offer huge pedagogical value garnered from real hands-on exposure to the whole system stack. However, successful tape-outs demand hard-earned experience, and the design process is time consuming and fraught with challenges. Therefore,… ▽ More

    Submitted 26 May, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

  13. arXiv:1904.07714  [pdf, other

    cs.CV cs.AI cs.PF

    Low-Power Computer Vision: Status, Challenges, Opportunities

    Authors: Sergei Alyamkin, Matthew Ardi, Alexander C. Berg, Achille Brighton, Bo Chen, Yiran Chen, Hsin-Pai Cheng, Zichen Fan, Chen Feng, Bo Fu, Kent Gauen, Abhinav Goel, Alexander Goncharenko, Xuyang Guo, Soonhoi Ha, Andrew Howard, Xiao Hu, Yuanjun Huang, Donghyun Kang, Jaeyoun Kim, Jong Gook Ko, Alexander Kondratyev, Junhyeok Lee, Seungjae Lee, Suwoong Lee , et al. (19 additional authors not shown)

    Abstract: Computer vision has achieved impressive progress in recent years. Meanwhile, mobile phones have become the primary computing platforms for millions of people. In addition to mobile phones, many autonomous systems rely on visual data for making decisions and some of these systems have limited energy (such as unmanned aerial vehicles also called drones and mobile robots). These systems rely on batte… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Preprint, Accepted by IEEE Journal on Emerging and Selected Topics in Circuits and Systems. arXiv admin note: substantial text overlap with arXiv:1810.01732

  14. Selective Distillation of Weakly Annotated GTD for Vision-based Slab Identification System

    Authors: Sang Jun Lee, Sang Woo Kim, Wookyong Kwon, Gyogwon Koo, Jong Pil Yun

    Abstract: This paper proposes an algorithm for recognizing slab identification numbers in factory scenes. In the development of a deep-learning based system, manual labeling to make ground truth data (GTD) is an important but expensive task. Furthermore, the quality of GTD is closely related to the performance of a supervised learning algorithm. To reduce manual work in the labeling process, we generated we… ▽ More

    Submitted 13 December, 2018; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: 10 pages, 12 figures, submitted to a journal

    Journal ref: IEEE Access 7 (2019) 23177-23186

  15. arXiv:1707.08120  [pdf, other

    cs.CY cs.LG

    Proxy Non-Discrimination in Data-Driven Systems

    Authors: Anupam Datta, Matt Fredrikson, Gihyuk Ko, Piotr Mardziel, Shayak Sen

    Abstract: Machine learnt systems inherit biases against protected classes, historically disparaged groups, from training data. Usually, these biases are not explicit, they rely on subtle correlations discovered by training algorithms, and are therefore difficult to detect. We formalize proxy discrimination in data-driven systems, a class of properties indicative of bias, as the presence of protected class c… ▽ More

    Submitted 25 July, 2017; originally announced July 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1705.07807

  16. arXiv:1705.07807  [pdf, other

    cs.CR cs.LG

    Use Privacy in Data-Driven Systems: Theory and Experiments with Machine Learnt Programs

    Authors: Anupam Datta, Matthew Fredrikson, Gihyuk Ko, Piotr Mardziel, Shayak Sen

    Abstract: This paper presents an approach to formalizing and enforcing a class of use privacy properties in data-driven systems. In contrast to prior work, we focus on use restrictions on proxies (i.e. strong predictors) of protected information types. Our definition relates proxy use to intermediate computations that occur in a program, and identify two essential properties that characterize this behavior:… ▽ More

    Submitted 7 September, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: extended CCS 2017 camera-ready: several new discussions, and complexity results added to appendix