Skip to main content

Showing 1–50 of 181 results for author: Kaup, A

  1. arXiv:2410.15873  [pdf, other

    eess.IV

    Variable Rate Learned Wavelet Video Coding with Temporal Layer Adaptivity

    Authors: Anna Meyer, André Kaup

    Abstract: Learned wavelet video coders provide an explainable framework by performing discrete wavelet transforms in temporal, horizontal, and vertical dimensions. With a temporal transform based on motion-compensated temporal filtering (MCTF), spatial and temporal scalability is obtained. In this paper, we introduce variable rate support and a mechanism for quality adaption to different temporal layers for… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: 5 pages, 4 figures, submitted to ICASSP2025

  2. arXiv:2410.02001  [pdf, other

    eess.IV

    Conditional Optimal Filter Selection for Multispectral Object Classification

    Authors: Katja Kossira, David Schön, Jürgen Seiler, André Kaup

    Abstract: Capturing images using multispectral camera arrays has gained importance in medical, agricultural and environmental processes. However, using all available spectral bands is infeasible and produces much data, while only a fraction is needed for a given task. Nearby bands may contain similar information, therefore redundant spectral bands should not be considered in the evaluation process to keep c… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  3. arXiv:2410.01158  [pdf, ps, other

    eess.IV cs.AR

    Modeling the Energy Consumption of the HEVC Software Encoding Process using Processor events

    Authors: Geetha Ramasubbu, Andrè Kaup, Christian Herglotz

    Abstract: Developing energy-efficient video encoding algorithms is highly important due to the high processing complexities and, consequently, the high energy demand of the encoding process. To accomplish this, the energy consumption of the video encoders must be studied, which is only possible with a complex and dedicated energy measurement setup. This emphasizes the need for simple energy estimation model… ▽ More

    Submitted 3 October, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

  4. arXiv:2410.00533  [pdf, ps, other

    eess.IV

    Design Space Exploration at Frame-Level for Joint Decoding Energy and Quality Optimization in VVC

    Authors: Teresa Stürzenhofäcker, Matthias Kränzler, Christian Herglotz, André Kaup

    Abstract: In the pursuit of a reduced energy demand of VVC decoders, it was found that the coding tool configuration has a substantial influence on the bit rate efficiency and the decoding energy demand. The Advanced Design Space Exploration algorithm as proposed in the literature, can derive coding tool configurations that provide optimal trade-offs between rate and energy efficiency. Yet, some trade-off p… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: submitted, accepted and published at EuSipCo 2024, Special Session on Frugality for Video Streaming

    Journal ref: EuSipCo 2024, ISBN: 978-9-4645-9361-7

  5. arXiv:2408.14050  [pdf, other

    eess.IV

    Fast Edge-Aware Occlusion Detection in the Context of Multispectral Camera Arrays

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Multispectral imaging is very beneficial in diverse applications, like healthcare and agriculture, since it can capture absorption bands of molecules in different spectral areas. A promising approach for multispectral snapshot imaging are camera arrays. Image processing is necessary to warp all different views to the same view to retrieve a consistent multispectral datacube. This process is also c… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  6. arXiv:2408.10665  [pdf, other

    eess.IV cs.LG

    End-to-end learned Lossy Dynamic Point Cloud Attribute Compression

    Authors: Dat Thanh Nguyen, Daniel Zieger, Marc Stamminger, Andre Kaup

    Abstract: Recent advancements in point cloud compression have primarily emphasized geometry compression while comparatively fewer efforts have been dedicated to attribute compression. This study introduces an end-to-end learned dynamic lossy attribute coding approach, utilizing an efficient high-dimensional convolution to capture extensive inter-point dependencies. This enables the efficient projection of a… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 6 pages, accepted for presentation at 2024 IEEE International Conference on Image Processing (ICIP) 2024

  7. arXiv:2407.09038  [pdf, other

    eess.IV

    High-Resolution Hyperspectral Video Imaging Using A Hexagonal Camera Array

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Retrieving the reflectance spectrum from objects is an essential task for many classification and detection problems, since many materials and processes have a unique spectral behaviour. In many cases, it is highly desirable to capture hyperspectral images due to the high spectral flexibility. Often, it is even necessary to capture hyperspectral videos or at least to be able to record a hyperspect… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  8. arXiv:2407.05900  [pdf, other

    eess.IV

    SVT-AV1 Encoding Bitrate Estimation Using Motion Search Information

    Authors: Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, Christian Herglotz, André Kaup

    Abstract: Enabling high compression efficiency while keeping encoding energy consumption at a low level, requires prioritization of which videos need more sophisticated encoding techniques. However, the effects vary highly based on the content, and information on how good a video can be compressed is required. This can be measured by estimating the encoded bitstream size prior to encoding. We identified the… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures, accepted for European Signal Processing Conference (EUSIPCO) 2024

  9. arXiv:2406.13709  [pdf, other

    eess.IV cs.CV

    A Study on the Effect of Color Spaces in Learned Image Compression

    Authors: Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup

    Abstract: In this work, we present a comparison between color spaces namely YUV, LAB, RGB and their effect on learned image compression. For this we use the structure and color based learned image codec (SLIC) from our prior work, which consists of two branches - one for the luminance component (Y or L) and another for chrominance components (UV or AB). However, for the RGB variant we input all 3 channels i… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepter pre-print version for ICIP 2024

  10. arXiv:2406.11284  [pdf, other

    eess.IV

    Multispectral Snapshot Image Registration Using Learned Cross Spectral Disparity Estimation and a Deep Guided Occlusion Reconstruction Network

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Multispectral imaging aims at recording images in different spectral bands. This is extremely beneficial in diverse discrimination applications, for example in agriculture, recycling or healthcare. One approach for snapshot multispectral imaging, which is capable of recording multispectral videos, is by using camera arrays, where each camera records a different spectral band. Since the cameras are… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  11. arXiv:2406.07938  [pdf, other

    eess.IV

    On Annotation-free Optimization of Video Coding for Machines

    Authors: Marc Windsheimer, Fabian Brand, André Kaup

    Abstract: Today, image and video data is not only viewed by humans, but also automatically analyzed by computer vision algorithms. However, current coding standards are optimized for human perception. Emerging from this, research on video coding for machines tries to develop coding methods designed for machines as information sink. Since many of these algorithms are based on neural networks, most proposals… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 7 pages, 10 figures

  12. Towards Video Codec Performance Evaluation: A Rate-Energy-Distortion Perspective

    Authors: Geetha Ramasubbu, André Kaup, Christian Herglotz

    Abstract: The Bjøntegaard Delta rate (BD-rate) objectively assesses the coding efficiency of video codecs using the rate-distortion (R-D) performance but overlooks encoding energy, which is crucial in practical applications, especially for those on handheld devices. Although R-D analysis can be extended to incorporate encoding energy as energy-distortion (E-D), it fails to integrate all three parameters sea… ▽ More

    Submitted 1 October, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Proc. 2024 16th International Conference on Quality of Multimedia Experience (QoMEX)

    Journal ref: 2024 16th International Conference on Quality of Multimedia Experience (QoMEX)

  13. arXiv:2405.12631  [pdf, other

    eess.IV

    Efficient Learned Wavelet Image and Video Coding

    Authors: Anna Meyer, Srivatsa Prativadibhayankaram, André Kaup

    Abstract: Learned wavelet image and video coding approaches provide an explainable framework with a latent space corresponding to a wavelet decomposition. The wavelet image coder iWave++ achieves state-of-the-art performance and has been employed for various compression tasks, including lossy as well as lossless image, video, and medical data compression. However, the approaches suffer from slow decoding sp… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 7 pages, 11 figures, submitted to ICIP2024

  14. arXiv:2402.17487  [pdf, other

    cs.CV cs.LG eess.IV

    Bit Rate Matching Algorithm Optimization in JPEG-AI Verification Model

    Authors: Panqi Jia, A. Burakhan Koyuncu, Jue Mao, Ze Cui, Yi Ma, Tiansheng Guo, Timofey Solovyev, Alexander Karabutov, Yin Zhao, Jing Wang, Elena Alshina, Andre Kaup

    Abstract: The research on neural network (NN) based image compression has shown superior performance compared to classical compression frameworks. Unlike the hand-engineered transforms in the classical frameworks, NN-based models learn the non-linear transforms providing more compact bit representations, and achieve faster coding speed on parallel devices over their classical counterparts. Those properties… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted at (IEEE) PCS 2024; 6 pages

  15. arXiv:2402.17470  [pdf, other

    cs.CV cs.LG eess.IV

    Bit Distribution Study and Implementation of Spatial Quality Map in the JPEG-AI Standardization

    Authors: Panqi Jia, Jue Mao, Esin Koyuncu, A. Burakhan Koyuncu, Timofey Solovyev, Alexander Karabutov, Yin Zhao, Elena Alshina, Andre Kaup

    Abstract: Currently, there is a high demand for neural network-based image compression codecs. These codecs employ non-linear transforms to create compact bit representations and facilitate faster coding speeds on devices compared to the hand-crafted transforms used in classical frameworks. The scientific and industrial communities are highly interested in these properties, leading to the standardization ef… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 5 pages, 3 figures, 4 tables

  16. arXiv:2402.10257  [pdf, other

    eess.IV

    Analysis of Neural Video Compression Networks for 360-Degree Video Coding

    Authors: Andy Regensky, Fabian Brand, André Kaup

    Abstract: With the increasing efforts of bringing high-quality virtual reality technologies into the market, efficient 360-degree video compression gains in importance. As such, the state-of-the-art H.266/VVC video coding standard integrates dedicated tools for 360-degree video, and considerable efforts have been put into designing 360-degree projection formats with improved compression efficiency. For the… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures, 1 table, accepted for Picture Coding Symposium 2024 (PCS 2024)

  17. arXiv:2402.09926  [pdf, other

    eess.IV

    Predicting the Energy Demand of a Hardware Video Decoder with Unknown Design Using Software Profiling

    Authors: Matthias Kränzler, Christian Herglotz, André Kaup

    Abstract: Energy efficiency for video communications and video-on-demand streaming is essential for mobile devices with a limited battery capacity. Therefore, hardware decoder implementations are commonly used to significantly reduce the energetic load of video playback. The energy consumption of such a hardware implementation largely depends on a previously published recommendation document of a video codi… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Submitted to IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS), 13 Pages

  18. arXiv:2402.09001  [pdf, other

    eess.IV

    A Comprehensive Review of Software and Hardware Energy Efficiency of Video Decoders

    Authors: Matthias Kränzler, Christian Herglotz, André Kaup

    Abstract: Energy and compression efficiency are two essential parts of modern video decoder implementations that have to be considered. This work comprehensively studies the following six video coding formats regarding compression and decoding energy efficiency: AVC, VP9, HEVC, AV1, VVC, and AVM. We first evaluate the energy demand of reference and optimized software decoder implementations. Furthermore, we… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: accepted as a conference paper for Picture Coding Symposium (PCS) 2024

  19. arXiv:2401.17246  [pdf, other

    eess.IV cs.CV

    SLIC: A Learned Image Codec Using Structure and Color

    Authors: Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup

    Abstract: We propose the structure and color based learned image codec (SLIC) in which the task of compression is split into that of luminance and chrominance. The deep learning model is built with a novel multi-scale architecture for Y and UV channels in the encoder, where the features from various stages are combined to obtain the latent representation. An autoregressive context model is employed for back… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepter paper for Data Compression Conference 2024

  20. arXiv:2401.16067  [pdf, other

    eess.IV cs.MM

    Encoding Time and Energy Model for SVT-AV1 based on Video Complexity

    Authors: Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, Christian Herglotz, André Kaup

    Abstract: The share of online video traffic in global carbon dioxide emissions is growing steadily. To comply with the demand for video media, dedicated compression techniques are continuously optimized, but at the expense of increasingly higher computational demands and thus rising energy consumption at the video encoder side. In order to find the best trade-off between compression and energy consumption,… ▽ More

    Submitted 30 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 5 pages, 1 figure, accepted for IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2024

  21. Enhanced Color Palette Modeling for Lossless Screen Content Compression

    Authors: Hannah Och, Shabhrish Reddy Uddehal, Tilo Strutz, André Kaup

    Abstract: Soft context formation is a lossless image coding method for screen content. It encodes images pixel by pixel via arithmetic coding by collecting statistics for probability distribution estimation. Its main pipeline includes three stages, namely a context model based stage, a color palette stage and a residual coding stage. Each subsequent stage is only employed if the previous stage can not be ap… ▽ More

    Submitted 7 October, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, 2 tables

    Journal ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  22. arXiv:2312.11209  [pdf, other

    eess.IV

    Quantized Decoder in Learned Image Compression for Deterministic Reconstruction

    Authors: Esin Koyuncu, Timofey Solovyev, Johannes Sauer, Elena Alshina, André Kaup

    Abstract: Learned image compression has a problem of non-bit-exact reconstruction due to different calculations of floating point arithmetic on different devices. This paper shows a method to achieve a deterministic reconstructed image by quantizing only the decoder of the learned image compression model. From the implementation perspective of an image codec, it is beneficial to have the results reproducibl… ▽ More

    Submitted 11 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures, 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)

  23. arXiv:2312.09266  [pdf, other

    eess.IV

    Geometry-Corrected Geodesic Motion Modeling with Per-Frame Camera Motion for 360-Degree Video Compression

    Authors: Andy Regensky, André Kaup

    Abstract: The large amounts of data associated with 360-degree video require highly effective compression techniques for efficient storage and distribution. The development of improved motion models for 360-degree motion compensation has shown significant improvements in compression efficiency. A geodesic motion model representing translational camera motion proved to be one of the most effective models. In… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 5 pages, 2 figures, 3 tables, accepted for IEEE International Conference on Acoustics, Speech and Signal Processing 2024 (IEEE ICASSP 2024)

  24. arXiv:2312.08949  [pdf, other

    eess.IV

    A Guided Upsampling Network for Short Wave Infrared Images Using Graph Regularization

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Exploiting the infrared area of the spectrum for classification problems is getting increasingly popular, because many materials have characteristic absorption bands in this area. However, sensors in the short wave infrared (SWIR) area and even higher wavelengths have a very low spatial resolution in comparison to classical cameras that operate in the visible wavelength area. Thus, in this paper a… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: 2024 IEEE International Conference on Acoustics, Speech and Signal Processing

  25. arXiv:2312.08946  [pdf, other

    eess.IV

    Color Agnostic Cross-Spectral Disparity Estimation

    Authors: Frank Sippel, Nils Genser, Hannah Och, Jürgen Seiler, André Kaup

    Abstract: Since camera modules become more and more affordable, multispectral camera arrays have found their way from special applications to the mass market, e.g., in automotive systems, smartphones, or drones. Due to multiple modalities, the registration of different viewpoints and the required cross-spectral disparity estimation is up to the present extremely challenging. To overcome this problem, we int… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: 2024 IEEE International Conference on Acoustics, Speech and Signal Processing

  26. arXiv:2310.17346  [pdf, ps, other

    eess.IV

    Extended Signaling Methods for Reduced Video Decoder Power Consumption Using Green Metadata

    Authors: Christian Herglotz, Matthias Kränzler, Xixue Chu, Edouard Francois, Yong He, André Kaup

    Abstract: In this paper, we discuss one aspect of the latest MPEG standard edition on energy-efficient media consumption, also known as Green Metadata (ISO/IEC 232001-11), which is the interactive signaling for remote decoder-power reduction for peer-to-peer video conferencing. In this scenario, the receiver of a video, e.g., a battery-driven portable device, can send a dedicated request to the sender which… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 5 pages, 2 figures

  27. Improving HEVC Encoding of Rendered Video Data Using True Motion Information

    Authors: Christian Herglotz, David Müller, Andreas Weinlich, Frank Bauer, Michael Ortner, Marc Stamminger, André Kaup

    Abstract: This paper shows that motion vectors representing the true motion of an object in a scene can be exploited to improve the encoding process of computer generated video sequences. Therefore, a set of sequences is presented for which the true motion vectors of the corresponding objects were generated on a per-pixel basis during the rendering process. In addition to conventional motion estimation meth… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 4 pages, 4 figures

    Journal ref: Proc. 2018 IEEE International Symposium on Multimedia (ISM)

  28. On Versatile Video Coding at UHD with Machine-Learning-Based Super-Resolution

    Authors: Kristian Fischer, Christian Herglotz, André Kaup

    Abstract: Coding 4K data has become of vital interest in recent years, since the amount of 4K data is significantly increasing. We propose a coding chain with spatial down- and upscaling that combines the next-generation VVC codec with machine learning based single image super-resolution algorithms for 4K. The investigated coding chain, which spatially downscales the 4K data before coding, shows superior qu… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Originally published as conference paper at QoMEX 2020

  29. Video Decoding Energy Estimation Using Processor Events

    Authors: Christian Herglotz, André Kaup

    Abstract: In this paper, we show that processor events like instruction counts or cache misses can be used to accurately estimate the processing energy of software video decoders. Therefore, we perform energy measurements on an ARM-based evaluation platform and count processor level events using a dedicated profiling software. Measurements are performed for various codecs and decoder implementations to prov… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures

    Journal ref: IEEE International Conference on Image Processing (ICIP), Beijing, China, 2017, pp. 2493-2497

  30. arXiv:2307.12864  [pdf, other

    eess.IV

    Conditional Residual Coding: A Remedy for Bottleneck Problems in Conditional Inter Frame Coding

    Authors: Fabian Brand, Jürgen Seiler, André Kaup

    Abstract: Conditional coding is a new video coding paradigm enabled by neural-network-based compression. It can be shown that conditional coding is in theory better than the traditional residual coding, which is widely used in video compression standards like HEVC or VVC. However, on closer inspection, it becomes clear that conditional coders can suffer from information bottlenecks in the prediction path, i… ▽ More

    Submitted 26 January, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 12 pages, 8 figures Accepted for Publication in TCSVT

  31. Component-wise Power Estimation of Electrical Devices Using Thermal Imaging

    Authors: Christian Herglotz, Simon Grosche, Akarsh Bharadwaj, André Kaup

    Abstract: This paper presents a novel method to estimate the power consumption of distinct active components on an electronic carrier board by using thermal imaging. The components and the board can be made of heterogeneous material such as plastic, coated microchips, and metal bonds or wires, where a special coating for high emissivity is not required. The thermal images are recorded when the components on… ▽ More

    Submitted 18 July, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 10 pages, 8 figures

    Journal ref: IEEE Transactions on Consumer Electronics, vol. 67, no. 4, pp. 383-392, Nov. 2021,

  32. Power Modeling for Virtual Reality Video Playback Applications

    Authors: Christian Herglotz, Stéphane Coulombe, Ahmad Vakili, André Kaup

    Abstract: This paper proposes a method to evaluate and model the power consumption of modern virtual reality playback and streaming applications on smartphones. Due to the high computational complexity of the virtual reality processing toolchain, the corresponding power consumption is very high, which reduces operating times of battery-powered devices. To tackle this problem, we analyze the power consumptio… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures

    Journal ref: 2019 IEEE 23rd International Symposium on Consumer Technologies (ISCT), Ancona, Italy, 2019, pp. 105-110

  33. Power-Efficient Video Streaming on Mobile Devices Using Optimal Spatial Scaling

    Authors: Christian Herglotz, André Kaup, Stéphane Coulombe, Ahmad Vakili

    Abstract: This paper derives optimal spatial scaling and rate control parameters for power-efficient wireless video streaming on portable devices. A video streaming application is studied, which receives a high-resolution and high-quality video stream from a remote server and displays the content to the end-user.We show that the resolution of the input video can be adjusted such that the quality-power trade… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 6 pages, 7 figures

    Journal ref: Proc. IEEE 9th International Conference on Consumer Electronics (ICCE-Berlin), Berlin, Germany, 2019, pp. 233-238

  34. arXiv:2307.06102  [pdf, other

    eess.IV

    Spatially-Adaptive Learning-Based Image Compression with Hierarchical Multi-Scale Latent Spaces

    Authors: Fabian Brand, Alexander Kopte, Kristian Fischer, André Kaup

    Abstract: Adaptive block partitioning is responsible for large gains in current image and video compression systems. This method is able to compress large stationary image areas with only a few symbols, while maintaining a high level of quality in more detailed areas. Current state-of-the-art neural-network-based image compression systems however use only one scale to transmit the latent space. In previous… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 5 pages, 3 figures Accepted for presentation at ICIP 2023

  35. arXiv:2307.05208  [pdf, other

    eess.IV

    Encoder Complexity Control in SVT-AV1 by Speed-Adaptive Preset Switching

    Authors: Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, André Kaup, Christian Herglotz

    Abstract: Current developments in video encoding technology lead to continuously improving compression performance but at the expense of increasingly higher computational demands. Regarding the online video traffic increases during the last years and the concomitant need for video encoding, encoder complexity control mechanisms are required to restrict the processing time to a sufficient extent in order to… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures, accepted for IEEE International Conference on Image Processing (ICIP) 2023

  36. arXiv:2306.16755  [pdf, ps, other

    eess.IV

    Processing Energy Modeling for Neural Network Based Image Compression

    Authors: Christian Herglotz, Fabian Brand, Andy Regensky, Felix Rievel, André Kaup

    Abstract: Nowadays, the compression performance of neural-networkbased image compression algorithms outperforms state-of-the-art compression approaches such as JPEG or HEIC-based image compression. Unfortunately, most neural-network based compression methods are executed on GPUs and consume a high amount of energy during execution. Therefore, this paper performs an in-depth analysis on the energy consumptio… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, accepted for IEEE International Conference on Image Processing (ICIP) 2023

  37. Cross Spectral Image Reconstruction Using a Deep Guided Neural Network

    Authors: Frank Sippel, Jürgen Seiler, André Kaup

    Abstract: Cross spectral camera arrays, where each camera records different spectral content, are becoming increasingly popular for RGB, multispectral and hyperspectral imaging, since they are capable of a high resolution in every dimension using off-the-shelf hardware. For these, it is necessary to build an image processing pipeline to calculate a consistent image data cube, i.e., it should look like as if… ▽ More

    Submitted 14 September, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Journal ref: 2023 IEEE International Conference on Image Processing (ICIP)

  38. Motion Plane Adaptive Motion Modeling for Spherical Video Coding in H.266/VVC

    Authors: Andy Regensky, Christian Herglotz, André Kaup

    Abstract: Motion compensation is one of the key technologies enabling the high compression efficiency of modern video coding standards. To allow compression of spherical video content, special mapping functions are required to project the video to the 2D image plane. Distortions inevitably occurring in these mappings impair the performance of classical motion models. In this paper, we propose a novel motion… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 5 pages, 4 figures, 1 table, accepted for IEEE International Conference on Image Processing 2023 (IEEE ICIP 2023). arXiv admin note: substantial text overlap with arXiv:2202.03323

  39. Improving Spherical Image Resampling through Viewport-Adaptivity

    Authors: Andy Regensky, Viktoria Heimann, Ruoyu Zhang, André Kaup

    Abstract: The conversion between different spherical image and video projection formats requires highly accurate resampling techniques in order to minimize the inevitable loss of information. Suitable resampling algorithms such as nearest neighbor, linear or cubic resampling are readily available. However, no generally applicable resampling technique exploits the special properties of spherical images so fa… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, 2 tables, accepted for IEEE International Conference on Image Processing 2023 (IEEE ICIP 2023)

  40. Video Decoding Energy Reduction Using Temporal-Domain Filtering

    Authors: Christian Herglotz, Matthias Kränzler, Robert Ludwig, André Kaup

    Abstract: In this paper, we study decoding energy reduction opportunities using temporal-domain filtering and subsampling methods. In particular, we study spatiotemporal filtering using a contrast sensitivity function and temporal downscaling, i.e., frame rate reduction. We apply these concepts as a pre-filtering to the video before compression and evaluate the bitrate, the decoding energy, and the visual q… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 6 pages, 5 figures

  41. Learned Wavelet Video Coding using Motion Compensated Temporal Filtering

    Authors: Anna Meyer, Fabian Brand, André Kaup

    Abstract: We present an end-to-end trainable wavelet video coder based on motion-compensated temporal filtering (MCTF). Thereby, we introduce a different coding scheme for learned video compression, which is currently dominated by residual and conditional coding approaches. By performing discrete wavelet transforms in temporal, horizontal, and vertical dimension, we obtain an explainable framework with spat… ▽ More

    Submitted 12 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 14 pages, 14 figures, Accepted for IEEE Access 2023

  42. arXiv:2305.15117  [pdf, other

    eess.IV

    Power Reduction Opportunities on End-User Devices in Quality-Steady Video Streaming

    Authors: Christian Herglotz, Werner Robitza, Alexander Raake, Tobias Hossfeld, André Kaup

    Abstract: This paper uses a crowdsourced dataset of online video streaming sessions to investigate opportunities to reduce the power consumption while considering QoE. For this, we base our work on prior studies which model both the end-user's QoE and the end-user device's power consumption with the help of high-level video features such as the bitrate, the frame rate, and the resolution. On top of existing… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 4 pages, 3 figures

  43. Image Segmentation For Improved Lossless Screen Content Compression

    Authors: Shabhrish Reddy Uddehal, Tilo Strutz, Hannah Och, André Kaup

    Abstract: In recent years, it has been found that screen content images (SCI) can be effectively compressed based on appropriate probability modelling and suitable entropy coding methods such as arithmetic coding. The key objective is determining the best probability distribution for each pixel position. This strategy works particularly well for images with synthetic (textual) content. However, usually scre… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 5 Pages, 3 Figures

  44. Multiscale Augmented Normalizing Flows for Image Compression

    Authors: Marc Windsheimer, Fabian Brand, André Kaup

    Abstract: Most learning-based image compression methods lack efficiency for high image quality due to their non-invertible design. The decoding function of the frequently applied compressive autoencoder architecture is only an approximated inverse of the encoding transform. This issue can be resolved by using invertible latent variable models, which allow a perfect reconstruction if no quantization is perfo… ▽ More

    Submitted 22 May, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 5 pages, 7 figures

  45. Improved Screen Content Coding in VVC Using Soft Context Formation

    Authors: Hannah Och, Shabhrish Reddy Uddehal, Tilo Strutz, André Kaup

    Abstract: Screen content images typically contain a mix of natural and synthetic image parts. Synthetic sections usually are comprised of uniformly colored areas and repeating colors and patterns. In the VVC standard, these properties are exploited using Intra Block Copy and Palette Mode. In this paper, we show that pixel-wise lossless coding can outperform lossy VVC coding in such areas. We propose an enha… ▽ More

    Submitted 7 October, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 5 pages, 5 figures, 2 tables

    Journal ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  46. The Bjøntegaard Bible -- Why your Way of Comparing Video Codecs May Be Wrong

    Authors: Christian Herglotz, Hannah Och, Anna Meyer, Geetha Ramasubbu, Lena Eichermüller, Matthias Kränzler, Fabian Brand, Kristian Fischer, Dat Thanh Nguyen, Andy Regensky, André Kaup

    Abstract: In this paper, we provide an in-depth assessment on the Bjøntegaard Delta. We construct a large data set of video compression performance comparisons using a diverse set of metrics including PSNR, VMAF, bitrate, and processing energies. These metrics are evaluated for visual data types such as classic perspective video, 360$^\circ$ video, point clouds, and screen content. As compression technology… ▽ More

    Submitted 22 December, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 21 pages, 14 figures

  47. arXiv:2304.12412  [pdf, other

    cs.CV cs.AI

    End-to-End Lidar-Camera Self-Calibration for Autonomous Vehicles

    Authors: Arya Rachman, Jürgen Seiler, André Kaup

    Abstract: Autonomous vehicles are equipped with a multi-modal sensor setup to enable the car to drive safely. The initial calibration of such perception sensors is a highly matured topic and is routinely done in an automated factory environment. However, an intriguing question arises on how to maintain the calibration quality throughout the vehicle's operating duration. Another challenge is to calibrate mul… ▽ More

    Submitted 27 April, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Accepted for The 35th IEEE Intelligent Vehicles Symposium (IV 2023)

  48. Lossless Point Cloud Geometry and Attribute Compression Using a Learned Conditional Probability Model

    Authors: Dat Thanh Nguyen, Andre Kaup

    Abstract: In recent years, we have witnessed the presence of point cloud data in many aspects of our life, from immersive media, autonomous driving to healthcare, although at the cost of a tremendous amount of data. In this paper, we present an efficient lossless point cloud compression method that uses sparse tensor-based deep neural networks to learn point cloud geometry and color probability distribution… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: 12 pages, accepted to IEEE Transactions on Circuits and Systems for Video Technology

    Journal ref: EEE Transactions on Circuits and Systems for Video Technology, vol. 33, no. 8, pp. 4337-4348, Aug. 2023

  49. arXiv:2303.06517  [pdf, other

    eess.IV cs.LG

    Deep probabilistic model for lossless scalable point cloud attribute compression

    Authors: Dat Thanh Nguyen, Kamal Gopikrishnan Nambiar, Andre Kaup

    Abstract: In recent years, several point cloud geometry compression methods that utilize advanced deep learning techniques have been proposed, but there are limited works on attribute compression, especially lossless compression. In this work, we build an end-to-end multiscale point cloud attribute coding method (MNeT) that progressively projects the attributes onto multiscale latent spaces. The multiscale… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 5 pages, accepted for presentation at ICASSP 2023

  50. Multispectral Image Compression Based on HEVC Using Pel-Recursive Inter-Band Prediction

    Authors: Anna Meyer, Nils Genser, André Kaup

    Abstract: Recent developments in optical sensors enable a wide range of applications for multispectral imaging, e.g., in surveillance, optical sorting, and life-science instrumentation. Increasing spatial and spectral resolution allows creating higher quality products, however, it poses challenges in handling such large amounts of data. Consequently, specialized compression techniques for multispectral imag… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 6 pages, 4 figures, 1 table; Originally published as conference paper at IEEE MMSP 2020

    Journal ref: IEEE MMSP 2020