Skip to main content

Showing 1–10 of 10 results for author: Ostermann, J

  1. arXiv:2410.03898  [pdf, other

    eess.IV

    On the Rate-Distortion-Complexity Trade-offs of Neural Video Coding

    Authors: Yi-Hsin Chen, Kuan-Wei Ho, Martin Benjak, Jörn Ostermann, Wen-Hsiao Peng

    Abstract: This paper aims to delve into the rate-distortion-complexity trade-offs of modern neural video coding. Recent years have witnessed much research effort being focused on exploring the full potential of neural video coding. Conditional autoencoders have emerged as the mainstream approach to efficient neural video coding. The central theme of conditional autoencoders is to leverage both spatial and t… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted to MMSP 2024

  2. arXiv:2407.10273  [pdf, other

    physics.optics physics.comp-ph

    Quantized Inverse Design for Photonic Integrated Circuits

    Authors: Frederik Schubert, Konrad Bethmann, Yannik Mahlau, Fabian Hartmann, Reinhard Caspary, Marco Munderloh, Jörn Ostermann, Bodo Rosenhahn

    Abstract: The inverse design of photonic integrated circuits (PICs) presents distinctive computational challenges, including their large memory requirements. Advancements in the two-photon polymerization (2PP) fabrication process introduce additional complexity, necessitating the development of more flexible optimization algorithms to enable the creation of multi-material 3D structures with unique propertie… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 19 pages, 10 figures

  3. arXiv:2312.15829  [pdf, other

    eess.IV

    MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

    Authors: Yi-Hsin Chen, Hong-Sheng Xie, Cheng-Wei Chen, Zong-Lin Gao, Martin Benjak, Wen-Hsiao Peng, Jörn Ostermann

    Abstract: Conditional coding has lately emerged as the mainstream approach to learned video compression. However, a recent study shows that it may perform worse than residual coding when the information bottleneck arises. Conditional residual coding was thus proposed, creating a new school of thought to improve on conditional coding. Notably, conditional residual coding relies heavily on the assumption that… ▽ More

    Submitted 10 July, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted for Publication in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)

  4. arXiv:2302.01585  [pdf, other

    cs.CV

    SegForestNet: Spatial-Partitioning-Based Aerial Image Segmentation

    Authors: Daniel Gritzner, Jörn Ostermann

    Abstract: Aerial image segmentation is the basis for applications such as automatically creating maps or tracking deforestation. In true orthophotos, which are often used in these applications, many objects and regions can be approximated well by polygons. However, this fact is rarely exploited by state-of-the-art semantic segmentation models. Instead, most models allow unnecessary degrees of freedom in the… ▽ More

    Submitted 8 April, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

    ACM Class: I.5.4

  5. arXiv:2002.03399  [pdf, other

    cs.CV cs.LG stat.ML

    Two-Stream Aural-Visual Affect Analysis in the Wild

    Authors: Felix Kuhnke, Lars Rumberg, Jörn Ostermann

    Abstract: Human affect recognition is an essential part of natural human-computer interaction. However, current methods are still in their infancy, especially for in-the-wild data. In this work, we introduce our submission to the Affective Behavior Analysis in-the-wild (ABAW) 2020 competition. We propose a two-stream aural-visual analysis model to recognize affective behavior from videos. Audio and image st… ▽ More

    Submitted 3 March, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: 6 pages, 2 figures, Face and Gesture 2020 Workshop Paper (ABAW2020 competition)

    ACM Class: I.5.5; I.4.9

  6. arXiv:1812.02137  [pdf, other

    cs.MM cs.LG

    HEVC Inter Coding Using Deep Recurrent Neural Networks and Artificial Reference Pictures

    Authors: Felix Haub, Thorsten Laude, Jörn Ostermann

    Abstract: The efficiency of motion compensated prediction in modern video codecs highly depends on the available reference pictures. Occlusions and non-linear motion pose challenges for the motion compensation and often result in high bit rates for the prediction error. We propose the generation of artificial reference pictures using deep recurrent neural networks. Conceptually, a reference picture at the t… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

    Comments: 7 pages, 4 figures, under review for ICME 2019

  7. arXiv:1805.07258  [pdf, other

    cs.CV

    Neural Network Compression using Transform Coding and Clustering

    Authors: Thorsten Laude, Yannick Richter, Jörn Ostermann

    Abstract: With the deployment of neural networks on mobile devices and the necessity of transmitting neural networks over limited or expensive channels, the file size of the trained model was identified as bottleneck. In this paper, we propose a codec for the compression of neural networks which is based on transform coding for convolutional and dense layers and on clustering for biases and normalizations.… ▽ More

    Submitted 18 May, 2018; originally announced May 2018.

  8. arXiv:1805.00780  [pdf, other

    cs.CV

    Unsupervised Features for Facial Expression Intensity Estimation over Time

    Authors: Maren Awiszus, Stella Graßhof, Felix Kuhnke, Jörn Ostermann

    Abstract: The diversity of facial shapes and motions among persons is one of the greatest challenges for automatic analysis of facial expressions. In this paper, we propose a feature describing expression intensity over time, while being invariant to person and the type of performed expression. Our feature is a weighted combination of the dynamics of multiple points adapted to the overall expression traject… ▽ More

    Submitted 3 May, 2018; v1 submitted 2 May, 2018; originally announced May 2018.

    Comments: Accepted for CVPR 2018 Workshop Track

  9. Region of Interest (ROI) Coding for Aerial Surveillance Video using AVC & HEVC

    Authors: Holger Meuel, Florian Kluger, Jörn Ostermann

    Abstract: Aerial surveillance from Unmanned Aerial Vehicles (UAVs), i.e. with moving cameras, is of growing interest for police as well as disaster area monitoring. For more detailed ground images the camera resolutions are steadily increasing. Simultaneously the amount of video data to transmit is increasing significantly, too. To reduce the amount of data, Region of Interest (ROI) coding systems were intr… ▽ More

    Submitted 19 January, 2018; originally announced January 2018.

    Comments: 5 pages, 7 figures, 1 table

    Journal ref: See also our extended OpenAccess article "Mesh-based Piecewise Planar Motion Compensation and Optical Flow Clustering for ROI Coding", APSIPA Transact. on Sig.&Inform. Proc., 2015

  10. arXiv:1305.4094  [pdf, ps, other

    quant-ph cond-mat.quant-gas cs.NE

    Evolutionary optimization of an experimental apparatus

    Authors: I. Geisel, K. Cordes, J. Mahnke, S. Jöllenbeck, J. Ostermann, J. Arlt, W. Ertmer, C. Klempt

    Abstract: In recent decades, cold atom experiments have become increasingly complex. While computers control most parameters, optimization is mostly done manually. This is a time-consuming task for a high-dimensional parameter space with unknown correlations. Here we automate this process using a genetic algorithm based on Differential Evolution. We demonstrate that this algorithm optimizes 21 correlated pa… ▽ More

    Submitted 4 June, 2013; v1 submitted 17 May, 2013; originally announced May 2013.

    Comments: minor revision

    Journal ref: Appl. Phys. Lett. 102, 214105 (2013)