Skip to main content

Showing 1–50 of 302 results for author: Backes, M

  1. arXiv:2410.07670  [pdf, other

    cs.CR

    Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks

    Authors: Minxing Zhang, Michael Backes, Xiao Zhang

    Abstract: Human Pose Estimation (HPE) has been widely applied in autonomous systems such as self-driving cars. However, the potential risks of HPE to adversarial attacks have not received comparable attention with image classification or segmentation tasks. Existing works on HPE robustness focus on misleading an HPE system to provide wrong predictions that still indicate some human poses. In this paper, we… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  2. arXiv:2410.06967  [pdf, other

    cs.CR cs.CY

    $\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities

    Authors: Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang

    Abstract: Large vision-language models (LVLMs) have been rapidly developed and widely used in various fields, but the (potential) stereotypical bias in the model is largely unexplored. In this study, we present a pioneering measurement framework, $\texttt{ModSCAN}$, to $\underline{SCAN}$ the stereotypical bias within LVLMs from both vision and language $\underline{Mod}$alities. $\texttt{ModSCAN}$ examines s… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: Accepted in EMNLP 2024. 29 pages, 22 figures

  3. arXiv:2410.06744  [pdf, other

    astro-ph.HE

    Hidden by a star: the redshift and the offset broad line of the Flat Spectrum Radio Quasar PKS 0903-57

    Authors: P. Goldoni, C. Boisson, S. Pita, F. D'Ammando, E. Kasai, W. Max-Moerbeck, M. Backes, G. Cotter

    Abstract: Context: PKS 0903-57 is a little-studied gamma-ray blazar which has recently attracted considerable interest due to the strong flaring episodes observed since 2020 in HE (100 MeV < E < 100 GeV) and VHE (100 GeV < E < 10 TeV) gamma-rays. Its nature and properties are still not well determined. In particular, it is unclear whether PKS 0903-57 is a BL Lac or a Flat Spectrum Radio Quasar (FSRQ), while… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: Astronomy and Astrophysics Letters, Accepted

  4. arXiv:2409.19069  [pdf, other

    cs.LG cs.CV

    Localizing Memorization in SSL Vision Encoders

    Authors: Wenhao Wang, Adam Dziedzic, Michael Backes, Franziska Boenisch

    Abstract: Recent work on studying memorization in self-supervised learning (SSL) suggests that even though SSL encoders are trained on millions of images, they still memorize individual data points. While effort has been put into characterizing the memorized data and linking encoder memorization to downstream utility, little is known about where the memorization happens inside SSL encoders. To close this ga… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: Accepted at NeurIPS 2024

  5. arXiv:2409.11423  [pdf, other

    cs.CR cs.LG

    Generated Data with Fake Privacy: Hidden Dangers of Fine-tuning Large Language Models on Generated Data

    Authors: Atilla Akkus, Mingjie Li, Junjie Chu, Michael Backes, Yang Zhang, Sinem Sav

    Abstract: Large language models (LLMs) have shown considerable success in a range of domain-specific tasks, especially after fine-tuning. However, fine-tuning with real-world data usually leads to privacy risks, particularly when the fine-tuning samples exist in the pre-training data. To avoid the shortcomings of real data, developers often employ methods to automatically generate synthetic data for fine-tu… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

  6. arXiv:2409.03741  [pdf, other

    cs.CR cs.LG

    Understanding Data Importance in Machine Learning Attacks: Does Valuable Data Pose Greater Harm?

    Authors: Rui Wen, Michael Backes, Yang Zhang

    Abstract: Machine learning has revolutionized numerous domains, playing a crucial role in driving advancements and enabling data-centric processes. The significance of data in training models and shaping their performance cannot be overstated. Recent research has highlighted the heterogeneous impact of individual data samples, particularly the presence of valuable data that significantly contributes to the… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: To Appear in Network and Distributed System Security (NDSS) Symposium 2025

  7. arXiv:2409.01380  [pdf, other

    cs.CR cs.CL

    Membership Inference Attacks Against In-Context Learning

    Authors: Rui Wen, Zheng Li, Michael Backes, Yang Zhang

    Abstract: Adapting Large Language Models (LLMs) to specific tasks introduces concerns about computational efficiency, prompting an exploration of efficient methods such as In-Context Learning (ICL). However, the vulnerability of ICL to privacy attacks under realistic assumptions remains largely unexplored. In this work, we present the first membership inference attack tailored for ICL, relying solely on gen… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: To Appear in the ACM Conference on Computer and Communications Security, October 14-18, 2024

  8. arXiv:2408.17285  [pdf, other

    cs.CR cs.LG

    Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution

    Authors: Yixin Wu, Yun Shen, Michael Backes, Yang Zhang

    Abstract: Text-to-image models, such as Stable Diffusion (SD), undergo iterative updates to improve image quality and address concerns such as safety. Improvements in image quality are straightforward to assess. However, how model updates resolve existing concerns and whether they raise new questions remain unexplored. This study takes an initial step in investigating the evolution of text-to-image models f… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: To Appear in the ACM Conference on Computer and Communications Security, October 14-18, 2024

  9. arXiv:2408.14854  [pdf, other

    astro-ph.HE astro-ph.GA

    Distance estimation of gamma-ray emitting BL Lac objects from imaging observations

    Authors: K. Nilsson, V. Fallah Ramazani, E. Lindfors, P. Goldoni, J. Becerra González, J. A. Acosta Pulido, R. Clavero, J. Otero-Santos, T. Pursimo, S. Pita, P. M. Kouch, C. Boisson, M. Backes, G. Cotter, F. D'Ammando, E. Kasai

    Abstract: Direct redshift determination of BL Lac objects is highly challenging as the emission in the optical and near-infrared (NIR) bands is largely dominated by the non-thermal emission from the relativistic jet that points very close to our line of sight. Therefore, their optical spectra often show no emission lines from the host galaxy. In this work, we aim to overcome this difficulty by attempting to… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  10. arXiv:2408.14704  [pdf, other

    physics.atom-ph quant-ph

    Performance of Antenna-based and Rydberg Quantum RF Sensors in the Electrically Small Regime

    Authors: K. M. Backes, P. K. Elgee, K. -J. LeBlanc, C. T. Fancher, D. H. Meyer, P. D. Kunz, N. Malvania, K. M. Nicolich, J. C. Hill, B. L. Schmittberger Marlow, K. C. Cox

    Abstract: Rydberg atom electric field sensors are tunable quantum sensors that can perform sensitive radio frequency (RF) measurements. Their qualities have piqued interest at longer wavelengths where their small size compares favorably to impedance-matched antennas. Here, we compare the signal detection sensitivity of cm-scale Rydberg sensors to similarly sized room-temperature electrically small antennas… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 6 pages, 3 figures

  11. arXiv:2408.11046  [pdf, other

    cs.CL

    Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders

    Authors: Yuan Xin, Zheng Li, Ning Yu, Dingfan Chen, Mario Fritz, Michael Backes, Yang Zhang

    Abstract: Despite being prevalent in the general field of Natural Language Processing (NLP), pre-trained language models inherently carry privacy and copyright concerns due to their nature of training on large-scale web-scraped data. In this paper, we pioneer a systematic exploration of such risks associated with pre-trained language encoders, specifically focusing on the membership leakage of pre-training… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: ECAI24

  12. arXiv:2408.00129  [pdf, other

    cs.CR cs.LG

    Vera Verto: Multimodal Hijacking Attack

    Authors: Minxing Zhang, Ahmed Salem, Michael Backes, Yang Zhang

    Abstract: The increasing cost of training machine learning (ML) models has led to the inclusion of new parties to the training pipeline, such as users who contribute training data and companies that provide computing resources. This involvement of such new parties in the ML training process has introduced new attack surfaces for an adversary to exploit. A recent attack in this domain is the model hijacking… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  13. arXiv:2407.20859  [pdf, other

    cs.CR cs.LG

    Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

    Authors: Boyang Zhang, Yicong Tan, Yun Shen, Ahmed Salem, Michael Backes, Savvas Zannettou, Yang Zhang

    Abstract: Recently, autonomous agents built on large language models (LLMs) have experienced significant development and are being deployed in real-world applications. These agents can extend the base LLM's capabilities in multiple ways. For example, a well-built agent using GPT-3.5-Turbo as its core can outperform the more advanced GPT-4 model by leveraging external components. More importantly, the usage… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  14. Very-high-energy $γ$-ray emission from young massive star clusters in the Large Magellanic Cloud

    Authors: F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, V. Barbosa Martins, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, M. Böttcher, J. Bolmont, M. de Bony de Lavergne, J. Borowska, R. Brose, A. Brown, F. Brun, B. Bruno, C. Burger-Scheidlin, S. Casanova, J. Celic, M. Cerruti, T. Chand, S. Chandra, A. Chen , et al. (107 additional authors not shown)

    Abstract: The Tarantula Nebula in the Large Magellanic Cloud is known for its high star formation activity. At its center lies the young massive star cluster R136, providing a significant amount of the energy that makes the nebula shine so brightly at many wavelengths. Recently, young massive star clusters have been suggested to also efficiently produce high-energy cosmic rays, potentially beyond PeV energi… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 10+11 pages, 4+6 figures. Corresponding authors: L. Mohrmann, N. Komin

    Journal ref: Astrophysical Journal Letters 970, L21 (2024)

  15. arXiv:2407.06955  [pdf, other

    cs.CR cs.CL

    ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization

    Authors: Wai Man Si, Michael Backes, Yang Zhang

    Abstract: In-context learning (ICL) is a recent advancement in the capabilities of large language models (LLMs). This feature allows users to perform a new task without updating the model. Concretely, users can address tasks during the inference time by conditioning on a few input-label pair demonstrations along with the test input. It is different than the conventional fine-tuning paradigm and offers more… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  16. arXiv:2407.03160  [pdf, other

    cs.CR cs.CL cs.LG

    SOS! Soft Prompt Attack Against Open-Source Large Language Models

    Authors: Ziqing Yang, Michael Backes, Yang Zhang, Ahmed Salem

    Abstract: Open-source large language models (LLMs) have become increasingly popular among both the general public and industry, as they can be customized, fine-tuned, and freely used. However, some open-source LLMs require approval before usage, which has led to third parties publishing their own easily accessible versions. Similarly, third parties have been publishing fine-tuned or quantized variants of th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  17. arXiv:2406.18167  [pdf, other

    astro-ph.HE

    H.E.S.S. observations of the 2021 periastron passage of PSR B1259-63/LS 2883

    Authors: H. E. S. S. Collaboration, F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, V. Barbosa Martins, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, J. Borowska, M. Bouyahiaoui, R. Brose, A. Brown, F. Brun, B. Bruno, T. Bulik, C. Burger-Scheidlin, S. Caroff, S. Casanova , et al. (119 additional authors not shown)

    Abstract: PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ day… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: accepted to A&A

  18. arXiv:2405.19103  [pdf, other

    cs.CR cs.LG

    Voice Jailbreak Attacks Against GPT-4o

    Authors: Xinyue Shen, Yixin Wu, Michael Backes, Yang Zhang

    Abstract: Recently, the concept of artificial assistants has evolved from science fiction into real-world applications. GPT-4o, the newest multimodal large language model (MLLM) across audio, vision, and text, has further blurred the line between fiction and reality by enabling more natural human-computer interactions. However, the advent of GPT-4o's voice mode may also introduce a new attack surface. In th… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  19. arXiv:2405.10089  [pdf, other

    cs.PL

    Do You Even Lift? Strengthening Compiler Security Guarantees Against Spectre Attacks

    Authors: Xaver Fabian, Marco Patrignani, Marco Guarnieri, Michael Backes

    Abstract: Mainstream compilers implement different countermeasures to prevent specific classes of speculative execution attacks. Unfortunately, these countermeasures either lack formal guarantees or come with proofs restricted to speculative semantics capturing only a subset of the speculation mechanisms supported by modern CPUs, thereby limiting their practical applicability. Ideally, these security proofs… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  20. arXiv:2405.05784  [pdf, other

    cs.CR cs.LG

    Link Stealing Attacks Against Inductive Graph Neural Networks

    Authors: Yixin Wu, Xinlei He, Pascal Berrang, Mathias Humbert, Michael Backes, Neil Zhenqiang Gong, Yang Zhang

    Abstract: A graph neural network (GNN) is a type of neural network that is specifically designed to process graph-structured data. Typically, GNNs can be implemented in two settings, including the transductive setting and the inductive setting. In the transductive setting, the trained model can only predict the labels of nodes that were observed at the training time. In the inductive setting, the trained mo… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: To appear in the 24th Privacy Enhancing Technologies Symposium (PETS 2024), July 15-20, 2024

  21. arXiv:2405.03486  [pdf, other

    cs.CR cs.CV cs.SI

    UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images

    Authors: Yiting Qu, Xinyue Shen, Yixin Wu, Michael Backes, Savvas Zannettou, Yang Zhang

    Abstract: With the advent of text-to-image models and concerns about their misuse, developers are increasingly relying on image safety classifiers to moderate their generated unsafe images. Yet, the performance of current image safety classifiers remains unknown for both real-world and AI-generated images. In this work, we propose UnsafeBench, a benchmarking framework that evaluates the effectiveness and ro… ▽ More

    Submitted 5 September, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  22. arXiv:2404.00108  [pdf, other

    cs.CR

    Efficient Data-Free Model Stealing with Label Diversity

    Authors: Yiyong Liu, Rui Wen, Michael Backes, Yang Zhang

    Abstract: Machine learning as a Service (MLaaS) allows users to query the machine learning model in an API manner, which provides an opportunity for users to enjoy the benefits brought by the high-performance model trained on valuable data. This interface boosts the proliferation of machine learning based applications, while on the other hand, it introduces the attack surface for model stealing attacks. Exi… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  23. Unveiling extended gamma-ray emission around HESS J1813-178

    Authors: F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, A. Baktash, V. Barbosa Martins, J. Barnard, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, B. Bi, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, J. Borowska, M. Bouyahiaoui, M. Breuhaus, R. Brose, F. Brun, B. Bruno, T. Bulik, C. Burger-Scheidlin , et al. (126 additional authors not shown)

    Abstract: HESS J1813$-$178 is a very-high-energy $γ$-ray source spatially coincident with the young and energetic pulsar PSR J1813$-$1749 and thought to be associated with its pulsar wind nebula (PWN). Recently, evidence for extended high-energy emission in the vicinity of the pulsar has been revealed in the Fermi Large Area Telescope (LAT) data. This motivates revisiting the HESS J1813$-$178 region, taking… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 13+5 pages, 13+11 figures. Accepted for publication in A&A. Corresponding authors: T.Wach, A.Mitchell, V.Joshi, P.Chambéry

    Journal ref: A&A 686, A149 (2024)

  24. Spectrum and extension of the inverse-Compton emission of the Crab Nebula from a combined Fermi-LAT and H.E.S.S. analysis

    Authors: F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, A. Baktash, V. Barbosa Martins, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, B. Bi, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, J. Borowska, F. Bradascio, M. Breuhaus, R. Brose, A. Brown, F. Brun, B. Bruno, T. Bulik, C. Burger-Scheidlin , et al. (137 additional authors not shown)

    Abstract: The Crab Nebula is a unique laboratory for studying the acceleration of electrons and positrons through their non-thermal radiation. Observations of very-high-energy $γ$ rays from the Crab Nebula have provided important constraints for modelling its broadband emission. We present the first fully self-consistent analysis of the Crab Nebula's $γ$-ray emission between 1 GeV and $\sim$100 TeV, that is… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 18+6 pages, 15+2 figures. Accepted for publication in A&A. Corresponding authors: M. Meyer, L. Mohrmann, T. Unbehaun. v2: after A&A language editing

    Journal ref: A&A 686, A308 (2024)

  25. arXiv:2403.04857  [pdf, other

    hep-ph astro-ph.HE hep-ex

    Dark Matter Line Searches with the Cherenkov Telescope Array

    Authors: S. Abe, J. Abhir, A. Abhishek, F. Acero, A. Acharyya, R. Adam, A. Aguasca-Cabot, I. Agudo, A. Aguirre-Santaella, J. Alfaro, R. Alfaro, N. Alvarez-Crespo, R. Alves Batista, J. -P. Amans, E. Amato, G. Ambrosi, L. Angel, C. Aramo, C. Arcaro, T. T. H. Arnesen, L. Arrabito, K. Asano, Y. Ascasibar, J. Aschersleben, H. Ashkar , et al. (540 additional authors not shown)

    Abstract: Monochromatic gamma-ray signals constitute a potential smoking gun signature for annihilating or decaying dark matter particles that could relatively easily be distinguished from astrophysical or instrumental backgrounds. We provide an updated assessment of the sensitivity of the Cherenkov Telescope Array (CTA) to such signals, based on observations of the Galactic centre region as well as of sele… ▽ More

    Submitted 23 July, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 44 pages JCAP style (excluding author list and references), 19 figures; minor changes to match published version

    Journal ref: JCAP 07 (2024) 047

  26. arXiv:2402.13330  [pdf, other

    astro-ph.HE astro-ph.IM

    Curvature in the very-high energy gamma-ray spectrum of M87

    Authors: H. E. S. S. Collaboration, F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, V. Barbosa Martins, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, F. Bradascio, R. Brose, F. Brun, B. Bruno, T. Bulik C. Burger-Scheidlin, T. Bylund, S. Casanova, R. Cecil, J. Celic, M. Cerruti , et al. (110 additional authors not shown)

    Abstract: The radio galaxy M87 is a variable very-high energy (VHE) gamma-ray source, exhibiting three major flares reported in 2005, 2008, and 2010. Despite extensive studies, the origin of the VHE gamma-ray emission is yet to be understood. In this study, we investigate the VHE gamma-ray spectrum of M87 during states of high gamma-ray activity, utilizing 20.2$\,$ hours the H.E.S.S. observations. Our findi… ▽ More

    Submitted 25 April, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 10 pages, 7 figures. Accepted for publication in A&A. Corresponding authors: Victor Barbosa Martins, Rahul Cecil, Iryna Lypova, Manuel Meyer, Perri Zilberman. Supplementary material: https://zenodo.org/records/10781524

    Journal ref: A&A, 685, A96 (2024)

  27. arXiv:2402.09179  [pdf, other

    cs.CR cs.LG

    Instruction Backdoor Attacks Against Customized LLMs

    Authors: Rui Zhang, Hongwei Li, Rui Wen, Wenbo Jiang, Yuan Zhang, Michael Backes, Yun Shen, Yang Zhang

    Abstract: The increasing demand for customized Large Language Models (LLMs) has led to the development of solutions like GPTs. These solutions facilitate tailored LLM creation via natural language prompts without coding. However, the trustworthiness of third-party custom versions of LLMs remains an essential concern. In this paper, we propose the first instruction backdoor attacks against applications integ… ▽ More

    Submitted 28 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  28. arXiv:2402.05668  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Comprehensive Assessment of Jailbreak Attacks Against LLMs

    Authors: Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang

    Abstract: Misuse of the Large Language Models (LLMs) has raised widespread concern. To address this issue, safeguards have been taken to ensure that LLMs align with social ethics. However, recent findings have revealed an unsettling vulnerability bypassing the safeguards of LLMs, known as jailbreak attacks. By applying techniques, such as employing role-playing scenarios, adversarial examples, or subtle sub… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 18 pages, 12 figures

  29. arXiv:2402.02987  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models

    Authors: Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang

    Abstract: Significant advancements have recently been made in large language models represented by GPT models. Users frequently have multi-round private conversations with cloud-hosted GPT models for task optimization. Yet, this operational paradigm introduces additional attack surfaces, particularly in custom GPTs and hijacked chat sessions. In this paper, we introduce a straightforward yet potent Conversa… ▽ More

    Submitted 7 October, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted in EMNLP 2024. 14 pages, 10 figures

  30. Acceleration and transport of relativistic electrons in the jets of the microquasar SS 433

    Authors: F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, V. Barbosa Martins, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, B. Bi, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, J. Borowska, M. Bouyahiaou, M. Breuhau, R. Brose, A. M. Brown, F. Brun, B. Bruno, T. Bulik, C. Burger-Scheidlin, S. Caroff , et al. (140 additional authors not shown)

    Abstract: SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Submitted 20th Apr. 2023, published 25th January 2024 (accepted version)

    Journal ref: Science383,402-406(2024)

  31. arXiv:2401.12233  [pdf, other

    cs.LG

    Memorization in Self-Supervised Learning Improves Downstream Generalization

    Authors: Wenhao Wang, Muhammad Ahmad Kaleem, Adam Dziedzic, Michael Backes, Nicolas Papernot, Franziska Boenisch

    Abstract: Self-supervised learning (SSL) has recently received significant attention due to its ability to train high-performance encoders purely on unlabeled data-often scraped from the internet. This data can still be sensitive and empirical evidence suggests that SSL encoders memorize private information of their training data and can disclose them at inference time. Since existing theoretical definition… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted at ICLR 2024

  32. arXiv:2401.07911  [pdf, other

    astro-ph.HE astro-ph.GA

    Optical spectroscopy of blazars for the Cherenkov Telescope Array -- III

    Authors: F. D'Ammando, P. Goldoni, W. Max-Moerbeck, J. Becerra Gonzalez, E. Kasai, D. A. Williams, N. Alvarez-Crespo, M. Backes, U. Barres de Almeida, C. Boisson, G. Cotter, V. Fallah Ramazani, O. Hervet, E. Lindfors, D. Mukhi-Nilo, S. Pita, M. Splettstoesser, B. van Soelen

    Abstract: Due to their almost featureless optical/UV spectra, it is challenging to measure the redshifts of BL Lacs. As a result, about 50% of gamma-ray BL Lacs lack a firm measurement of this property, which is fundamental for population studies, indirect estimates of the EBL, and fundamental physics probes. This paper is the third in a series of papers aimed at determining the redshift of a sample of blaz… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in Astronomy and Astrophysics. 17 pages, 4 Figures, 10 Tables

  33. arXiv:2401.07071  [pdf, other

    astro-ph.HE

    TeV flaring activity of the AGN PKS 0625-354 in November 2018

    Authors: H. E. S. S. Collaboration, F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, A. Baktash, V. Barbosa Martins, J. Barnard, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, B. Bi, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, J. Borowska, F. Bradascio, M. Breuhaus, R. Brose, A. Brown, F. Brun, B. Bruno , et al. (117 additional authors not shown)

    Abstract: Most $γ$-ray detected active galactic nuclei are blazars with one of their relativistic jets pointing towards the Earth. Only a few objects belong to the class of radio galaxies or misaligned blazars. Here, we investigate the nature of the object PKS 0625-354, its $γ$-ray flux and spectral variability and its broad-band spectral emission with observations from H.E.S.S., Fermi-LAT, Swift-XRT, and U… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 9 pages, 6 figures, accepted for publication in Astronomy & Astrophysics

  34. arXiv:2401.05561  [pdf, other

    cs.CL

    TrustLLM: Trustworthiness in Large Language Models

    Authors: Yue Huang, Lichao Sun, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang , et al. (45 additional authors not shown)

    Abstract: Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in… ▽ More

    Submitted 30 September, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: This work is still under work and we welcome your contribution

  35. arXiv:2312.11213  [pdf, other

    cs.CR cs.CV cs.CY

    FAKEPCD: Fake Point Cloud Detection via Source Attribution

    Authors: Yiting Qu, Zhikun Zhang, Yun Shen, Michael Backes, Yang Zhang

    Abstract: To prevent the mischievous use of synthetic (fake) point clouds produced by generative models, we pioneer the study of detecting point cloud authenticity and attributing them to their sources. We propose an attribution framework, FAKEPCD, to attribute (fake) point clouds to their respective generative models (or real-world collections). The main idea of FAKEPCD is to train an attribution model tha… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: To Appear in the 19th ACM ASIA Conference on Computer and Communications Security, July 1-5, 2024

  36. arXiv:2311.14685  [pdf, other

    cs.CY cs.CL cs.CR cs.LG

    Comprehensive Assessment of Toxicity in ChatGPT

    Authors: Boyang Zhang, Xinyue Shen, Wai Man Si, Zeyang Sha, Zeyuan Chen, Ahmed Salem, Yun Shen, Michael Backes, Yang Zhang

    Abstract: Moderating offensive, hateful, and toxic language has always been an important but challenging topic in the domain of safe use in NLP. The emerging large language models (LLMs), such as ChatGPT, can potentially further accentuate this threat. Previous works have discovered that ChatGPT can generate toxic responses using carefully crafted inputs. However, limited research has been done to systemati… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  37. arXiv:2311.12620  [pdf, other

    astro-ph.HE gr-qc

    Investigating the Lorentz Invariance Violation effect using different cosmological backgrounds

    Authors: Hassan Abdalla, Garret Cotter, Michael Backes, Eli Kasai, Markus Böttcher

    Abstract: Familiar concepts in physics, such as Lorentz symmetry, are expected to be broken at energies approaching the Planck energy scale as predicted by several quantum-gravity theories. However, such very large energies are unreachable by current experiments on Earth. Current and future Cherenkov telescope facilities may have the capability to measure the accumulated deformation from Lorentz symmetry fo… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in Class. Quantum Grav., 10 pages, 1 figure

    Report number: IPARCOS-UCM-23-127

  38. arXiv:2310.19410  [pdf, other

    cs.CR cs.CV cs.LG

    Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models

    Authors: Minxing Zhang, Ning Yu, Rui Wen, Michael Backes, Yang Zhang

    Abstract: Generative models have demonstrated revolutionary success in various visual creation tasks, but in the meantime, they have been exposed to the threat of leaking private information of their training data. Several membership inference attacks (MIAs) have been proposed to exhibit the privacy vulnerability of generative models by classifying a query image as a training dataset member or nonmember. Ho… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  39. arXiv:2310.17037  [pdf, other

    physics.data-an hep-ex hep-ph

    Event-by-event Comparison between Machine-Learning- and Transfer-Matrix-based Unfolding Methods

    Authors: Mathias Backes, Anja Butter, Monica Dunford, Bogdan Malaescu

    Abstract: The unfolding of detector effects is a key aspect of comparing experimental data with theoretical predictions. In recent years, different Machine-Learning methods have been developed to provide novel features, e.g. high dimensionality or a probabilistic single-event unfolding based on generative neural networks. Traditionally, many analyses unfold detector effects using transfer-matrix--based algo… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 22 pages, 11 figures

  40. arXiv:2310.16613  [pdf, other

    cs.CR

    On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts

    Authors: Yixin Wu, Ning Yu, Michael Backes, Yun Shen, Yang Zhang

    Abstract: Text-to-image models like Stable Diffusion have had a profound impact on daily life by enabling the generation of photorealistic images from textual prompts, fostering creativity, and enhancing visual experiences across various applications. However, these models also pose risks. Previous studies have successfully demonstrated that manipulated prompts can elicit text-to-image models to generate un… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 18 pages, 31 figures

  41. arXiv:2310.12665  [pdf, other

    cs.CR cs.LG

    SecurityNet: Assessing Machine Learning Vulnerabilities on Public Models

    Authors: Boyang Zhang, Zheng Li, Ziqing Yang, Xinlei He, Michael Backes, Mario Fritz, Yang Zhang

    Abstract: While advanced machine learning (ML) models are deployed in numerous real-world applications, previous works demonstrate these models have security and privacy vulnerabilities. Various empirical research has been done in this field. However, most of the experiments are performed on target ML models trained by the security researchers themselves. Due to the high computational resource requirement f… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: To appear in the 33rd USENIX Security Symposium, August 2024, Philadelphia, PA, USA

  42. arXiv:2310.11970  [pdf, other

    cs.CR

    Quantifying Privacy Risks of Prompts in Visual Prompt Learning

    Authors: Yixin Wu, Rui Wen, Michael Backes, Pascal Berrang, Mathias Humbert, Yun Shen, Yang Zhang

    Abstract: Large-scale pre-trained models are increasingly adapted to downstream tasks through a new paradigm called prompt learning. In contrast to fine-tuning, prompt learning does not update the pre-trained model's parameters. Instead, it only learns an input perturbation, namely prompt, to be added to the downstream task data for predictions. Given the fast development of prompt learning, a well-generali… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: To appear in the 33rd USENIX Security Symposium, August 14-16, 2024

  43. arXiv:2310.11850  [pdf, other

    cs.CR cs.CV cs.LG

    Revisiting Transferable Adversarial Image Examples: Attack Categorization, Evaluation Guidelines, and New Insights

    Authors: Zhengyu Zhao, Hanwei Zhang, Renjue Li, Ronan Sicre, Laurent Amsaleg, Michael Backes, Qi Li, Chao Shen

    Abstract: Transferable adversarial examples raise critical security concerns in real-world, black-box attack scenarios. However, in this work, we identify two main problems in common evaluation practices: (1) For attack transferability, lack of systematic, one-to-one attack comparison and fair hyperparameter settings. (2) For attack stealthiness, simply no comparisons. To address these problems, we establis… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Code is available at https://github.com/ZhengyuZhao/TransferAttackEval

  44. arXiv:2310.11397  [pdf, other

    cs.CR cs.LG

    Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning

    Authors: Rui Wen, Tianhao Wang, Michael Backes, Yang Zhang, Ahmed Salem

    Abstract: Large Language Models (LLMs) are powerful tools for natural language processing, enabling novel applications and user experiences. However, to achieve optimal performance, LLMs often require adaptation with private data, which poses privacy and security challenges. Several techniques have been proposed to adapt LLMs with private data, such as Low-Rank Adaptation (LoRA), Soft Prompt Tuning (SPT), a… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  45. arXiv:2310.08732  [pdf, other

    cs.LG cs.CR

    Provably Robust Cost-Sensitive Learning via Randomized Smoothing

    Authors: Yuan Xin, Michael Backes, Xiao Zhang

    Abstract: We study the problem of robust learning against adversarial perturbations under cost-sensitive scenarios, where the potential harm of different types of misclassifications is encoded in a cost matrix. Existing approaches are either empirical and cannot certify robustness or suffer from inherent scalability issues. In this work, we investigate whether randomized smoothing, a scalable framework for… ▽ More

    Submitted 30 May, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 19 pages, 9 tables, 5 figures

  46. arXiv:2310.07676  [pdf, other

    cs.CR cs.CL cs.LG

    Composite Backdoor Attacks Against Large Language Models

    Authors: Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang

    Abstract: Large language models (LLMs) have demonstrated superior performance compared to previous methods on various tasks, and often serve as the foundation models for many researches and services. However, the untrustworthy third-party LLMs may covertly introduce vulnerabilities for downstream tasks. In this paper, we explore the vulnerability of LLMs through the lens of backdoor attacks. Different from… ▽ More

    Submitted 30 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: To Appear in Findings of the Association for Computational Linguistics: NAACL 2024, June 2024

  47. arXiv:2310.07632  [pdf, other

    cs.CR cs.CV cs.LG

    Prompt Backdoors in Visual Prompt Learning

    Authors: Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang

    Abstract: Fine-tuning large pre-trained computer vision models is infeasible for resource-limited users. Visual prompt learning (VPL) has thus emerged to provide an efficient and flexible alternative to model fine-tuning through Visual Prompt as a Service (VPPTaaS). Specifically, the VPPTaaS provider optimizes a visual prompt given downstream data, and downstream users can use this prompt together with the… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  48. arXiv:2310.07413  [pdf, other

    astro-ph.HE

    Chasing Gravitational Waves with the Cherenkov Telescope Array

    Authors: Jarred Gershon Green, Alessandro Carosi, Lara Nava, Barbara Patricelli, Fabian Schüssler, Monica Seglar-Arroyo, Cta Consortium, :, Kazuki Abe, Shotaro Abe, Atreya Acharyya, Remi Adam, Arnau Aguasca-Cabot, Ivan Agudo, Jorge Alfaro, Nuria Alvarez-Crespo, Rafael Alves Batista, Jean-Philippe Amans, Elena Amato, Filippo Ambrosino, Ekrem Oguzhan Angüner, Lucio Angelo Antonelli, Carla Aramo, Cornelia Arcaro, Luisa Arrabito , et al. (545 additional authors not shown)

    Abstract: The detection of gravitational waves from a binary neutron star merger by Advanced LIGO and Advanced Virgo (GW170817), along with the discovery of the electromagnetic counterparts of this gravitational wave event, ushered in a new era of multimessenger astronomy, providing the first direct evidence that BNS mergers are progenitors of short gamma-ray bursts (GRBs). Such events may also produce very… ▽ More

    Submitted 5 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Presented at the 38th International Cosmic Ray Conference (ICRC 2023), 2023 (arXiv:2309.08219)

    Report number: CTA-ICRC/2023/30

  49. Discovery of a Radiation Component from the Vela Pulsar Reaching 20 Teraelectronvolts

    Authors: The H. E. S. S. Collaboration, :, F. Aharonian, F. Ait Benkhali, J. Aschersleben, H. Ashkar, M. Backes, V. Barbosa Martins, R. Batzofin, Y. Becherini, D. Berge, K. Bernlöhr, B. Bi, M. Böttcher, C. Boisson, J. Bolmont, M. de Bony de Lavergne, J. Borowska, F. Bradascio, M. Breuhaus, R. Brose, F. Brun, B. Bruno, T. Bulik, C. Burger-Scheidlin , et al. (157 additional authors not shown)

    Abstract: Gamma-ray observations have established energetic isolated pulsars as outstanding particle accelerators and antimatter factories in the Galaxy. There is, however, no consensus regarding the acceleration mechanisms and the radiative processes at play, nor the locations where these take place. The spectra of all observed gamma-ray pulsars to date show strong cutoffs or a break above energies of a fe… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 38 pages, 6 figures. This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this article is published in Nature Astronomy, Nat Astron (2023), and is available online at https://doi.org/10.1038/s41550-023-02052-3

  50. arXiv:2310.05141  [pdf, other

    cs.CR cs.LG

    Transferable Availability Poisoning Attacks

    Authors: Yiyong Liu, Michael Backes, Xiao Zhang

    Abstract: We consider availability data poisoning attacks, where an adversary aims to degrade the overall test accuracy of a machine learning model by crafting small perturbations to its training data. Existing poisoning strategies can achieve the attack goal but assume the victim to employ the same learning method as what the adversary uses to mount the attack. In this paper, we argue that this assumption… ▽ More

    Submitted 6 June, 2024; v1 submitted 8 October, 2023; originally announced October 2023.