-
Satellite-Terrestrial Quantum Networks and the Global Quantum Internet
Authors:
Andrea Conti,
Robert Malaney,
Moe Z. Win
Abstract:
This paper will explore the design and implementation of quantum networks in space integrated with quantum networks on Earth. We propose a three-layer approach, involving GEO and LEO satellites integrated with terrestrial ground stations. We first analyze the channel conditions between the three layers, and then highlight the key role of LEO satellites in the integrated space-terrestrial system -…
▽ More
This paper will explore the design and implementation of quantum networks in space integrated with quantum networks on Earth. We propose a three-layer approach, involving GEO and LEO satellites integrated with terrestrial ground stations. We first analyze the channel conditions between the three layers, and then highlight the key role of LEO satellites in the integrated space-terrestrial system - namely the source of entanglement distribution between specified terrestrial stations via direct downlink quantum-optical channels. The GEO satellites in the considered system are used primarily as coordination stations, managing and directing the LEO satellites regarding the positioning and timing of entanglement distribution. Complexity, in the form of entanglement distillation and quantum-state correction, is concentrated at the terrestrial stations, and teleportation is used as the primary quantum channel in the LEO uplinks and the inter-terrestrial channels. Although our designs are futuristic in that they assume limited quantum memory at the transceivers, we also discuss some near-term uses of our network in which no quantum memory is available.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
Exploring Fine-grained Retail Product Discrimination with Zero-shot Object Classification Using Vision-Language Models
Authors:
Anil Osman Tur,
Alessandro Conti,
Cigdem Beyan,
Davide Boscaini,
Roberto Larcher,
Stefano Messelodi,
Fabio Poiesi,
Elisa Ricci
Abstract:
In smart retail applications, the large number of products and their frequent turnover necessitate reliable zero-shot object classification methods. The zero-shot assumption is essential to avoid the need for re-training the classifier every time a new product is introduced into stock or an existing product undergoes rebranding. In this paper, we make three key contributions. Firstly, we introduce…
▽ More
In smart retail applications, the large number of products and their frequent turnover necessitate reliable zero-shot object classification methods. The zero-shot assumption is essential to avoid the need for re-training the classifier every time a new product is introduced into stock or an existing product undergoes rebranding. In this paper, we make three key contributions. Firstly, we introduce the MIMEX dataset, comprising 28 distinct product categories. Unlike existing datasets in the literature, MIMEX focuses on fine-grained product classification and includes a diverse range of retail products. Secondly, we benchmark the zero-shot object classification performance of state-of-the-art vision-language models (VLMs) on the proposed MIMEX dataset. Our experiments reveal that these models achieve unsatisfactory fine-grained classification performance, highlighting the need for specialized approaches. Lastly, we propose a novel ensemble approach that integrates embeddings from CLIP and DINOv2 with dimensionality reduction techniques to enhance classification performance. By combining these components, our ensemble approach outperforms VLMs, effectively capturing visual cues crucial for fine-grained product discrimination. Additionally, we introduce a class adaptation method that utilizes visual prototyping with limited samples in scenarios with scarce labeled data, addressing a critical need in retail environments where product variety frequently changes. To encourage further research into zero-shot object classification for smart retail applications, we will release both the MIMEX dataset and benchmark to the research community. Interested researchers can contact the authors for details on the terms and conditions of use. The code is available: https://github.com/AnilOsmanTur/Zero-shot-Retail-Product-Classification.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor
Authors:
Andrea Conti,
Matteo Poggi,
Valerio Cambareri,
Stefano Mattoccia
Abstract:
High frame rate and accurate depth estimation plays an important role in several tasks crucial to robotics and automotive perception. To date, this can be achieved through ToF and LiDAR devices for indoor and outdoor applications, respectively. However, their applicability is limited by low frame rate, energy consumption, and spatial sparsity. Depth on Demand (DoD) allows for accurate temporal and…
▽ More
High frame rate and accurate depth estimation plays an important role in several tasks crucial to robotics and automotive perception. To date, this can be achieved through ToF and LiDAR devices for indoor and outdoor applications, respectively. However, their applicability is limited by low frame rate, energy consumption, and spatial sparsity. Depth on Demand (DoD) allows for accurate temporal and spatial depth densification achieved by exploiting a high frame rate RGB sensor coupled with a potentially lower frame rate and sparse active depth sensor. Our proposal jointly enables lower energy consumption and denser shape reconstruction, by significantly reducing the streaming requirements on the depth sensor thanks to its three core stages: i) multi-modal encoding, ii) iterative multi-modal integration, and iii) depth decoding. We present extended evidence assessing the effectiveness of DoD on indoor and outdoor video datasets, covering both environment scanning and automotive perception use cases.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
${\cal N}=(4,4)$ supersymmetric AdS$_3$ solutions in $d=11$
Authors:
Andrea Conti,
Niall T. Macpherson
Abstract:
We derive necessary and sufficient conditions for AdS$_3$ solutions of $d=11$ supergravity to preserve ${\cal N}=(1,1)$ supersymmetry in terms of G-structures. Such solutions necessarily support an SU(3)-structure on the internal 8-manifold M$_8$, in terms of which we phrase the conditions for supersymmetry preservation. We use this to derive the local form of all ${\cal N}=(4,4)$ supersymmetric A…
▽ More
We derive necessary and sufficient conditions for AdS$_3$ solutions of $d=11$ supergravity to preserve ${\cal N}=(1,1)$ supersymmetry in terms of G-structures. Such solutions necessarily support an SU(3)-structure on the internal 8-manifold M$_8$, in terms of which we phrase the conditions for supersymmetry preservation. We use this to derive the local form of all ${\cal N}=(4,4)$ supersymmetric AdS$_3$ solutions in $d=11$, for which M$_8$ decomposes as a foliation of a 3-sphere over a 5 dimensional base. There are 3 independent classes, 2 of which preserve the small superconformal algebra and one preserving its large counterpart for which M$_5$ contains a second 3-sphere. We show that for each solution with large (4,4) supersymmetry there are two corresponding solutions with small $(4,4)$, one for which M$_5$ maintains its 3-sphere, one where this blows up to $\mathbb{R}^3$ which can be compactified to $\mathbb{T}^3$. We use our results to construct several new solutions that lie within our derived classes as well as recovering some existing solutions.
△ Less
Submitted 30 August, 2024;
originally announced August 2024.
-
LiDAR-Event Stereo Fusion with Hallucinations
Authors:
Luca Bartolomei,
Matteo Poggi,
Andrea Conti,
Stefano Mattoccia
Abstract:
Event stereo matching is an emerging technique to estimate depth from neuromorphic cameras; however, events are unlikely to trigger in the absence of motion or the presence of large, untextured regions, making the correspondence problem extremely challenging. Purposely, we propose integrating a stereo event camera with a fixed-frequency active sensor -- e.g., a LiDAR -- collecting sparse depth mea…
▽ More
Event stereo matching is an emerging technique to estimate depth from neuromorphic cameras; however, events are unlikely to trigger in the absence of motion or the presence of large, untextured regions, making the correspondence problem extremely challenging. Purposely, we propose integrating a stereo event camera with a fixed-frequency active sensor -- e.g., a LiDAR -- collecting sparse depth measurements, overcoming the aforementioned limitations. Such depth hints are used by hallucinating -- i.e., inserting fictitious events -- the stacks or raw input streams, compensating for the lack of information in the absence of brightness changes. Our techniques are general, can be adapted to any structured representation to stack events and outperform state-of-the-art fusion methods applied to event-based stereo.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
Deconstruction and surface defects in 6d CFTs
Authors:
Andrea Conti,
Giuseppe Dibitetto,
Yolanda Lozano,
Nicolò Petri,
Anayeli Ramírez
Abstract:
We study the two families of AdS$_3\times S^3\times S^2\times Σ_2$ solutions to massive Type IIA supergravity with small and large $(0,4)$ supersymmetries constructed recently in the literature, in connection with the AdS$_7\times S^2\times I$ solutions to massive Type IIA, to which they asymptote locally. Based on our analysis of various observables, that we study holographically, we propose an i…
▽ More
We study the two families of AdS$_3\times S^3\times S^2\times Σ_2$ solutions to massive Type IIA supergravity with small and large $(0,4)$ supersymmetries constructed recently in the literature, in connection with the AdS$_7\times S^2\times I$ solutions to massive Type IIA, to which they asymptote locally. Based on our analysis of various observables, that we study holographically, we propose an interpretation of the first class of solutions as dual to deconstructed 6d (1,0) CFTs dual to AdS$_7$, and of the second class as dual to surface defects in the same 6d theories. Among the observables that we study are baryon vertices and giant graviton configurations in quiver-like constructions.
△ Less
Submitted 11 September, 2024; v1 submitted 31 July, 2024;
originally announced July 2024.
-
Half-BPS Janus solutions in AdS$_7$
Authors:
Andrea Conti,
Giuseppe Dibitetto,
Yolanda Lozano,
Nicolò Petri,
Anayeli Ramírez
Abstract:
We study half-BPS flows in gauged minimal 7d supergravity featured by an AdS$_3\times S^3$ slicing of the metric, supported by a dyonic three-form field. We first present a novel strategy for analytic integration of the BPS equations, which makes use of the integrals of motion. Subsequently, we discuss the suitable choice of integration constants that gives rise to smooth geometries. These flows a…
▽ More
We study half-BPS flows in gauged minimal 7d supergravity featured by an AdS$_3\times S^3$ slicing of the metric, supported by a dyonic three-form field. We first present a novel strategy for analytic integration of the BPS equations, which makes use of the integrals of motion. Subsequently, we discuss the suitable choice of integration constants that gives rise to smooth geometries. These flows are asymptotically locally AdS$_7$ in their UV limit, while their IR geometry is AdS$_3\times \mathbb{R}^4$. We then discuss their uplifts to 11d and massive IIA supergravity and observe that they describe one-parameter deformations of their AdS$_7\times S^4$ and AdS$_7\times S^3$ vacua, respectively, their holographic interpretation being as conformal defect CFT$_2$'s within the corresponding dual SCFT$_6$'s. We conclude with the computation of the holographic central charge, by focussing on the M-theory interpretation.
△ Less
Submitted 11 September, 2024; v1 submitted 31 July, 2024;
originally announced July 2024.
-
Automatic benchmarking of large multimodal models via iterative experiment programming
Authors:
Alessandro Conti,
Enrico Fini,
Paolo Rota,
Yiming Wang,
Massimiliano Mancini,
Elisa Ricci
Abstract:
Assessing the capabilities of large multimodal models (LMMs) often requires the creation of ad-hoc evaluations. Currently, building new benchmarks requires tremendous amounts of manual work for each specific analysis. This makes the evaluation process tedious and costly. In this paper, we present APEx, Automatic Programming of Experiments, the first framework for automatic benchmarking of LMMs. Gi…
▽ More
Assessing the capabilities of large multimodal models (LMMs) often requires the creation of ad-hoc evaluations. Currently, building new benchmarks requires tremendous amounts of manual work for each specific analysis. This makes the evaluation process tedious and costly. In this paper, we present APEx, Automatic Programming of Experiments, the first framework for automatic benchmarking of LMMs. Given a research question expressed in natural language, APEx leverages a large language model (LLM) and a library of pre-specified tools to generate a set of experiments for the model at hand, and progressively compile a scientific report. The report drives the testing procedure: based on the current status of the investigation, APEx chooses which experiments to perform and whether the results are sufficient to draw conclusions. Finally, the LLM refines the report, presenting the results to the user in natural language. Thanks to its modularity, our framework is flexible and extensible as new tools become available. Empirically, APEx reproduces the findings of existing studies while allowing for arbitrary analyses and hypothesis testing.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Stereo-Depth Fusion through Virtual Pattern Projection
Authors:
Luca Bartolomei,
Matteo Poggi,
Fabio Tosi,
Andrea Conti,
Stefano Mattoccia
Abstract:
This paper presents a novel general-purpose stereo and depth data fusion paradigm that mimics the active stereo principle by replacing the unreliable physical pattern projector with a depth sensor. It works by projecting virtual patterns consistent with the scene geometry onto the left and right images acquired by a conventional stereo camera, using the sparse hints obtained from a depth sensor, t…
▽ More
This paper presents a novel general-purpose stereo and depth data fusion paradigm that mimics the active stereo principle by replacing the unreliable physical pattern projector with a depth sensor. It works by projecting virtual patterns consistent with the scene geometry onto the left and right images acquired by a conventional stereo camera, using the sparse hints obtained from a depth sensor, to facilitate the visual correspondence. Purposely, any depth sensing device can be seamlessly plugged into our framework, enabling the deployment of a virtual active stereo setup in any possible environment and overcoming the severe limitations of physical pattern projection, such as the limited working range and environmental conditions. Exhaustive experiments on indoor and outdoor datasets featuring both long and close range, including those providing raw, unfiltered depth hints from off-the-shelf depth sensors, highlight the effectiveness of our approach in notably boosting the robustness and accuracy of algorithms and deep stereo without any code modification and even without re-training. Additionally, we assess the performance of our strategy on active stereo evaluation datasets with conventional pattern projection. Indeed, in all these scenarios, our virtual pattern projection paradigm achieves state-of-the-art performance. The source code is available at: https://github.com/bartn8/vppstereo.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Stoichiometric reconstruction of the Al$_{2}$O$_{3}$(0001) surface
Authors:
Johanna I. Hütner,
Andrea Conti,
David Kugler,
Florian Mittendorfer,
Georg Kresse,
Michael Schmid,
Ulrike Diebold,
Jan Balajka
Abstract:
Macroscopic properties of materials stem from fundamental atomic-scale details, yet for insulators, resolving surface structures remains a challenge. The basal (0001) plane of $α$-Al$_{2}$O$_{3}$ was imaged with noncontact atomic force microscopy with an atomically-defined tip apex. The surface forms a complex $({\sqrt31} {\times} {\sqrt31})R{\pm}9°$ reconstruction. The lateral positions of the in…
▽ More
Macroscopic properties of materials stem from fundamental atomic-scale details, yet for insulators, resolving surface structures remains a challenge. The basal (0001) plane of $α$-Al$_{2}$O$_{3}$ was imaged with noncontact atomic force microscopy with an atomically-defined tip apex. The surface forms a complex $({\sqrt31} {\times} {\sqrt31})R{\pm}9°$ reconstruction. The lateral positions of the individual O and Al surface atoms come directly from experiment; how these connect to the underlying crystal bulk was determined based on computational modeling. Before the restructuring, the surface Al atoms assume an unfavorable, threefold planar coordination; the reconstruction allows a rehybridization with subsurface O that leads to a substantial energy gain. The reconstructed surface remains stoichiometric, Al$_{2}$O$_{3}$.
△ Less
Submitted 16 September, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
NH$_3$ adsorption and competition with H$_2$O on a hydroxylated aluminosilicate surface
Authors:
Giada Franceschi,
Andrea Conti,
Luca Lezuo,
Rainer Abart,
Florian Mittendorfer,
Michael Schmid,
Ulrike Diebold
Abstract:
The interaction between ammonia (NH$_3$) and (alumino)silicates is of fundamental and applied importance, yet the specifics of NH$_3$ adsorption on silicate surfaces remain largely unexplored, mainly because of experimental challenges related to their electrically insulating nature. An example of this knowledge gap is evident in the context of ice nucleation on silicate dust, wherein the role of N…
▽ More
The interaction between ammonia (NH$_3$) and (alumino)silicates is of fundamental and applied importance, yet the specifics of NH$_3$ adsorption on silicate surfaces remain largely unexplored, mainly because of experimental challenges related to their electrically insulating nature. An example of this knowledge gap is evident in the context of ice nucleation on silicate dust, wherein the role of NH$_3$ for ice nucleation remains debated. This study explores the fundamentals of the interaction between NH$_3$ and microcline feldspar (KAlSi$_3$O$_8$), a common aluminosilicate with outstanding ice nucleation abilities. Atomically resolved non-contact atomic force microscopy, x-ray photoelectron spectroscopy, and density functional theory-based calculations elucidate the adsorption geometry of NH$_3$ on the lowest-energy surface of microcline, the (001) facet, and its interplay with surface hydroxyls and molecular water. NH$_3$ and H$_2$O are found to adsorb molecularly in the same adsorption sites, creating H-bonds with the proximate surface silanol (Si-OH) and aluminol (Al-OH) groups. Despite the closely matched adsorption energies of the two molecules, NH$_3$ readily yields to replacement by H$_2$O, challenging the notion that ice nucleation on microcline proceeds via the creation of an ordered H$_2$O layer atop pre-adsorbed NH$_3$ molecules.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Vocabulary-free Image Classification and Semantic Segmentation
Authors:
Alessandro Conti,
Enrico Fini,
Massimiliano Mancini,
Paolo Rota,
Yiming Wang,
Elisa Ricci
Abstract:
Large vision-language models revolutionized image classification and semantic segmentation paradigms. However, they typically assume a pre-defined set of categories, or vocabulary, at test time for composing textual prompts. This assumption is impractical in scenarios with unknown or evolving semantic context. Here, we address this issue and introduce the Vocabulary-free Image Classification (VIC)…
▽ More
Large vision-language models revolutionized image classification and semantic segmentation paradigms. However, they typically assume a pre-defined set of categories, or vocabulary, at test time for composing textual prompts. This assumption is impractical in scenarios with unknown or evolving semantic context. Here, we address this issue and introduce the Vocabulary-free Image Classification (VIC) task, which aims to assign a class from an unconstrained language-induced semantic space to an input image without needing a known vocabulary. VIC is challenging due to the vastness of the semantic space, which contains millions of concepts, including fine-grained categories. To address VIC, we propose Category Search from External Databases (CaSED), a training-free method that leverages a pre-trained vision-language model and an external database. CaSED first extracts the set of candidate categories from the most semantically similar captions in the database and then assigns the image to the best-matching candidate category according to the same vision-language model. Furthermore, we demonstrate that CaSED can be applied locally to generate a coarse segmentation mask that classifies image regions, introducing the task of Vocabulary-free Semantic Segmentation. CaSED and its variants outperform other more complex vision-language models, on classification and semantic segmentation benchmarks, while using much fewer parameters.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Socially Pertinent Robots in Gerontological Healthcare
Authors:
Xavier Alameda-Pineda,
Angus Addlesee,
Daniel Hernández García,
Chris Reinke,
Soraya Arias,
Federica Arrigoni,
Alex Auternaud,
Lauriane Blavette,
Cigdem Beyan,
Luis Gomez Camara,
Ohad Cohen,
Alessandro Conti,
Sébastien Dacunha,
Christian Dondrup,
Yoav Ellinson,
Francesco Ferro,
Sharon Gannot,
Florian Gras,
Nancie Gunson,
Radu Horaud,
Moreno D'Incà,
Imad Kimouche,
Séverin Lemaignan,
Oliver Lemon,
Cyril Liotard
, et al. (19 additional authors not shown)
Abstract:
Despite the many recent achievements in developing and deploying social robotics, there are still many underexplored environments and applications for which systematic evaluation of such systems by end-users is necessary. While several robotic platforms have been used in gerontological healthcare, the question of whether or not a social interactive robot with multi-modal conversational capabilitie…
▽ More
Despite the many recent achievements in developing and deploying social robotics, there are still many underexplored environments and applications for which systematic evaluation of such systems by end-users is necessary. While several robotic platforms have been used in gerontological healthcare, the question of whether or not a social interactive robot with multi-modal conversational capabilities will be useful and accepted in real-life facilities is yet to be answered. This paper is an attempt to partially answer this question, via two waves of experiments with patients and companions in a day-care gerontological facility in Paris with a full-sized humanoid robot endowed with social and conversational interaction capabilities. The software architecture, developed during the H2020 SPRING project, together with the experimental protocol, allowed us to evaluate the acceptability (AES) and usability (SUS) with more than 60 end-users. Overall, the users are receptive to this technology, especially when the robot perception and action skills are robust to environmental clutter and flexible to handle a plethora of different interactions.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Test-Time Zero-Shot Temporal Action Localization
Authors:
Benedetta Liberatori,
Alessandro Conti,
Paolo Rota,
Yiming Wang,
Elisa Ricci
Abstract:
Zero-Shot Temporal Action Localization (ZS-TAL) seeks to identify and locate actions in untrimmed videos unseen during training. Existing ZS-TAL methods involve fine-tuning a model on a large amount of annotated training data. While effective, training-based ZS-TAL approaches assume the availability of labeled data for supervised learning, which can be impractical in some applications. Furthermore…
▽ More
Zero-Shot Temporal Action Localization (ZS-TAL) seeks to identify and locate actions in untrimmed videos unseen during training. Existing ZS-TAL methods involve fine-tuning a model on a large amount of annotated training data. While effective, training-based ZS-TAL approaches assume the availability of labeled data for supervised learning, which can be impractical in some applications. Furthermore, the training process naturally induces a domain bias into the learned model, which may adversely affect the model's generalization ability to arbitrary videos. These considerations prompt us to approach the ZS-TAL problem from a radically novel perspective, relaxing the requirement for training data. To this aim, we introduce a novel method that performs Test-Time adaptation for Temporal Action Localization (T3AL). In a nutshell, T3AL adapts a pre-trained Vision and Language Model (VLM). T3AL operates in three steps. First, a video-level pseudo-label of the action category is computed by aggregating information from the entire video. Then, action localization is performed adopting a novel procedure inspired by self-supervised learning. Finally, frame-level textual descriptions extracted with a state-of-the-art captioning model are employed for refining the action region proposals. We validate the effectiveness of T3AL by conducting experiments on the THUMOS14 and the ActivityNet-v1.3 datasets. Our results demonstrate that T3AL significantly outperforms zero-shot baselines based on state-of-the-art VLMs, confirming the benefit of a test-time adaptation approach.
△ Less
Submitted 11 April, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Lattices in rigid analytic representations
Authors:
Andrea Conti,
Emiliano Torti
Abstract:
For a profinite group $G$ and a rigid analytic space $X$, we study when an $\mathcal O_X(X)$-linear representation $V$ of $G$ admits a lattice, i.e. an $\mathcal O_{\mathcal X}(\mathcal X)$-linear model for a suitable formal model $\mathcal X$ of $X$ in the sense of Berthelot. We give a positive answer, under mild assumptions, when $X$ is a ``wide open'' space. As a consequence, we are able to des…
▽ More
For a profinite group $G$ and a rigid analytic space $X$, we study when an $\mathcal O_X(X)$-linear representation $V$ of $G$ admits a lattice, i.e. an $\mathcal O_{\mathcal X}(\mathcal X)$-linear model for a suitable formal model $\mathcal X$ of $X$ in the sense of Berthelot. We give a positive answer, under mild assumptions, when $X$ is a ``wide open'' space. As a consequence, we are able to describe explicit open rational subdomains of $X$ over which $V$ is constant after reduction modulo a power of $p$. We give applications in two different directions. First, we prove explicit results on the reduction modulo powers of $p$ of sheaves of crystalline and semistable representations of fixed weight. Second, we focus on the sheaves of Galois representations on eigenvarieties, which are important examples of wide open spaces thanks to a result of Bellaïche and Chenevier. We give an application of our main results to the pseudorepresentation carried by the Coleman--Mazur eigencurve, which can be made explicit whenever equations for a rational subdomain of the eigencurve are given.
△ Less
Submitted 1 September, 2024; v1 submitted 29 March, 2024;
originally announced March 2024.
-
$p$-adic rigidity of eigenforms of infinite slope
Authors:
Andrea Conti
Abstract:
We give a notion of $p$-adic families of Hecke eigenforms that allows for the slope of the forms be infinite at $p$. We prove that, contrary to the case of finite slope when every eigenform lives in a Hida or Coleman family, the only families of infinite slope are either twists of Hida or Coleman families with Dirichlet characters of $p$-power conductor, or non-ordinary families with complex multi…
▽ More
We give a notion of $p$-adic families of Hecke eigenforms that allows for the slope of the forms be infinite at $p$. We prove that, contrary to the case of finite slope when every eigenform lives in a Hida or Coleman family, the only families of infinite slope are either twists of Hida or Coleman families with Dirichlet characters of $p$-power conductor, or non-ordinary families with complex multiplication. Our proof goes via a local study of deformations of potentially trianguline Galois representations, relying on work of Berger and Chenevier, and a global input coming from an analogue of a result of Balasubramanyam, Ghate and Vatsal on a Greenberg-type conjecture for families of Hilbert modular forms.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Lifting Galois representations via Kummer flags
Authors:
Andrea Conti,
Cyril Demarche,
Mathieu Florence
Abstract:
Let $Γ$ be either i) the absolute Galois group of a local field $F$, or ii) the topological fundamental group of a closed connected orientable surface of genus $g$. In case i), assume that $μ_{p^2} \subset F$. We give an elementary and unified proof that every representation $ρ_1: Γ\to \mathbf{GL}_d(\mathbb{F}_p)$ lifts to a representation $ρ_2: Γ\to \mathbf{GL}_d(\mathbb{Z}/p^2)$. [In case i), it…
▽ More
Let $Γ$ be either i) the absolute Galois group of a local field $F$, or ii) the topological fundamental group of a closed connected orientable surface of genus $g$. In case i), assume that $μ_{p^2} \subset F$. We give an elementary and unified proof that every representation $��_1: Γ\to \mathbf{GL}_d(\mathbb{F}_p)$ lifts to a representation $ρ_2: Γ\to \mathbf{GL}_d(\mathbb{Z}/p^2)$. [In case i), it is understood these are continuous.] The actual statement is much stronger: for $r \geq 1$, under "suitable" assumptions, triangular representations $ρ_r: Γ\to \mathbf{B}_d(\mathbb{Z}/p^r)$ lift to $ρ_{r+1}: Γ\to \mathbf{B}_d(\mathbb{Z}/p^{r+1})$, in the strongest possible step-by-step sense. Here "suitable" is made precise by the concept of $\textit{Kummer flag}$. An essential aspect of this work, is to identify the common properties of groups i) and ii), that suffice to ensure the existence of such lifts.
△ Less
Submitted 17 September, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Range-Agnostic Multi-View Depth Estimation With Keyframe Selection
Authors:
Andrea Conti,
Matteo Poggi,
Valerio Cambareri,
Stefano Mattoccia
Abstract:
Methods for 3D reconstruction from posed frames require prior knowledge about the scene metric range, usually to recover matching cues along the epipolar lines and narrow the search range. However, such prior might not be directly available or estimated inaccurately in real scenarios -- e.g., outdoor 3D reconstruction from video sequences -- therefore heavily hampering performance. In this paper,…
▽ More
Methods for 3D reconstruction from posed frames require prior knowledge about the scene metric range, usually to recover matching cues along the epipolar lines and narrow the search range. However, such prior might not be directly available or estimated inaccurately in real scenarios -- e.g., outdoor 3D reconstruction from video sequences -- therefore heavily hampering performance. In this paper, we focus on multi-view depth estimation without requiring prior knowledge about the metric range of the scene by proposing RAMDepth, an efficient and purely 2D framework that reverses the depth estimation and matching steps order. Moreover, we demonstrate the capability of our framework to provide rich insights about the quality of the views used for prediction. Additional material can be found on our project page https://andreaconti.github.io/projects/range_agnostic_multi_view_depth.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Revisiting Depth Completion from a Stereo Matching Perspective for Cross-domain Generalization
Authors:
Luca Bartolomei,
Matteo Poggi,
Andrea Conti,
Fabio Tosi,
Stefano Mattoccia
Abstract:
This paper proposes a new framework for depth completion robust against domain-shifting issues. It exploits the generalization capability of modern stereo networks to face depth completion, by processing fictitious stereo pairs obtained through a virtual pattern projection paradigm. Any stereo network or traditional stereo matcher can be seamlessly plugged into our framework, allowing for the depl…
▽ More
This paper proposes a new framework for depth completion robust against domain-shifting issues. It exploits the generalization capability of modern stereo networks to face depth completion, by processing fictitious stereo pairs obtained through a virtual pattern projection paradigm. Any stereo network or traditional stereo matcher can be seamlessly plugged into our framework, allowing for the deployment of a virtual stereo setup that is future-proof against advancement in the stereo field. Exhaustive experiments on cross-domain generalization support our claims. Hence, we argue that our framework can help depth completion to reach new deployment scenarios.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
How water binds to microcline feldspar (001)
Authors:
Giada Franceschi,
Andrea Conti,
Luca Lezuo,
Rainer Abart,
Florian Mittendorfer,
Michael Schmid,
Ulrike Diebold
Abstract:
Microcline feldspar (KAlSi$_3$O$_8$) is a common mineral with important roles for Earth's ecological balance. It participates in the carbon, potassium, and water cycles, contributing to CO$_2$ sequestration, soil formation, and atmospheric ice nucleation. To understand the fundamentals of these processes, it is essential to establish microcline's surface atomic structure and its interaction with t…
▽ More
Microcline feldspar (KAlSi$_3$O$_8$) is a common mineral with important roles for Earth's ecological balance. It participates in the carbon, potassium, and water cycles, contributing to CO$_2$ sequestration, soil formation, and atmospheric ice nucleation. To understand the fundamentals of these processes, it is essential to establish microcline's surface atomic structure and its interaction with the omnipresent water molecules. This work presents atomic-scale results on microcline's lowest-energy surface and its interaction with water, combining ultrahigh vacuum investigations by non-contact atomic force microscopy and X-ray photoelectron spectroscopy with density functional theory calculations. An ordered array of hydroxyls bonded to silicon or aluminum readily forms on the cleaved surface at room temperature. The distinct proton affinities of these hydroxyls influence the arrangement and orientation of the first water molecules binding to the surface, holding potential implications for the subsequent condensation of water.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Active Stereo Without Pattern Projector
Authors:
Luca Bartolomei,
Matteo Poggi,
Fabio Tosi,
Andrea Conti,
Stefano Mattoccia
Abstract:
This paper proposes a novel framework integrating the principles of active stereo in standard passive camera systems without a physical pattern projector. We virtually project a pattern over the left and right images according to the sparse measurements obtained from a depth sensor. Any such devices can be seamlessly plugged into our framework, allowing for the deployment of a virtual active stere…
▽ More
This paper proposes a novel framework integrating the principles of active stereo in standard passive camera systems without a physical pattern projector. We virtually project a pattern over the left and right images according to the sparse measurements obtained from a depth sensor. Any such devices can be seamlessly plugged into our framework, allowing for the deployment of a virtual active stereo setup in any possible environment, overcoming the limitation of pattern projectors, such as limited working range or environmental conditions. Experiments on indoor/outdoor datasets, featuring both long and close-range, support the seamless effectiveness of our approach, boosting the accuracy of both stereo algorithms and deep networks.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Resolving the intrinsic short-range ordering of K$^+$ ions on cleaved muscovite mica
Authors:
Giada Franceschi,
Pavel Kocán,
Andrea Conti,
Sebastian Brandstetter,
Jan Balajka,
Igor Sokolović,
Markus Valtiner,
Florian Mittendorfer,
Michael Schmid,
Martin Setvín,
Ulrike Diebold
Abstract:
Muscovite mica, KAl$_2$(Si$_3$Al)O$_{10}$(OH)$_2$, is a common layered phyllosilicate with perfect cleavage planes. The atomically flat surfaces obtained through cleaving lend themselves to scanning probe techniques with atomic resolution and are ideal to model minerals and clays. Despite the importance of the cleaved mica surfaces, several questions remain unresolved. It is established that K…
▽ More
Muscovite mica, KAl$_2$(Si$_3$Al)O$_{10}$(OH)$_2$, is a common layered phyllosilicate with perfect cleavage planes. The atomically flat surfaces obtained through cleaving lend themselves to scanning probe techniques with atomic resolution and are ideal to model minerals and clays. Despite the importance of the cleaved mica surfaces, several questions remain unresolved. It is established that K$^+$ ions decorate the cleaved surface, but their intrinsic ordering -- unaffected by the interaction with the environment -- is not known. This work presents clear images of the K$^+$ distribution of cleaved mica obtained with low-temperature non-contact atomic force microscopy (AFM) under ultra-high vacuum (UHV) conditions. The data unveil the presence of short-range ordering, contrasting previous assumptions of random or fully ordered distributions. Density functional theory (DFT) calculations and Monte Carlo simulations show that the substitutional subsurface Al$^{3+}$ ions have an important role for the surface K$^+$ ion arrangement.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation
Authors:
Giacomo Zara,
Alessandro Conti,
Subhankar Roy,
Stéphane Lathuilière,
Paolo Rota,
Elisa Ricci
Abstract:
Source-Free Video Unsupervised Domain Adaptation (SFVUDA) task consists in adapting an action recognition model, trained on a labelled source dataset, to an unlabelled target dataset, without accessing the actual source data. The previous approaches have attempted to address SFVUDA by leveraging self-supervision (e.g., enforcing temporal consistency) derived from the target data itself. In this wo…
▽ More
Source-Free Video Unsupervised Domain Adaptation (SFVUDA) task consists in adapting an action recognition model, trained on a labelled source dataset, to an unlabelled target dataset, without accessing the actual source data. The previous approaches have attempted to address SFVUDA by leveraging self-supervision (e.g., enforcing temporal consistency) derived from the target data itself. In this work, we take an orthogonal approach by exploiting "web-supervision" from Large Language-Vision Models (LLVMs), driven by the rationale that LLVMs contain a rich world prior surprisingly robust to domain-shift. We showcase the unreasonable effectiveness of integrating LLVMs for SFVUDA by devising an intuitive and parameter-efficient method, which we name Domain Adaptation with Large Language-Vision models (DALL-V), that distills the world prior and complementary source model information into a student network tailored for the target. Despite the simplicity, DALL-V achieves significant improvement over state-of-the-art SFVUDA methods.
△ Less
Submitted 22 August, 2023; v1 submitted 17 August, 2023;
originally announced August 2023.
-
AdS$_3$ T-duality and evidence for ${\cal N}=5,6$ superconformal quantum mechanics
Authors:
Andrea Conti
Abstract:
We construct two families of AdS$_2$ vacua in Type IIB Supergravity performing U(1) and SL(2) T-dualities on the $\text{AdS}_3 \times \text{$ \widehat{\mathbb{CP}}\!\!~^3$} \times $ I solutions to Type IIA recently reported in arXiv:2304.12207. Depending on the T-duality we operate, we find two different classes of solutions of the type $\text{AdS}_2 \times \text{$ \widehat{\mathbb{CP}}\!\!~^3…
▽ More
We construct two families of AdS$_2$ vacua in Type IIB Supergravity performing U(1) and SL(2) T-dualities on the $\text{AdS}_3 \times \text{$ \widehat{\mathbb{CP}}\!\!~^3$} \times $ I solutions to Type IIA recently reported in arXiv:2304.12207. Depending on the T-duality we operate, we find two different classes of solutions of the type $\text{AdS}_2 \times \text{$ \widehat{\mathbb{CP}}\!\!~^3$} \times $ I $\times$ I and $\text{AdS}_3 \times \text{$ \widehat{\mathbb{CP}}\!\!~^3$} \times $ I $\times$ S$^1$. This provides evidence for more general classes of solutions $\text{AdS}_2 \times \text{$ \widehat{\mathbb{CP}}\!\!~^3$} \times Σ$, dual to superconformal quantum mechanics with ${\cal N}=5,6$ supersymmetry.
△ Less
Submitted 23 November, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
Vocabulary-free Image Classification
Authors:
Alessandro Conti,
Enrico Fini,
Massimiliano Mancini,
Paolo Rota,
Yiming Wang,
Elisa Ricci
Abstract:
Recent advances in large vision-language models have revolutionized the image classification paradigm. Despite showing impressive zero-shot capabilities, a pre-defined set of categories, a.k.a. the vocabulary, is assumed at test time for composing the textual prompts. However, such assumption can be impractical when the semantic context is unknown and evolving. We thus formalize a novel task, term…
▽ More
Recent advances in large vision-language models have revolutionized the image classification paradigm. Despite showing impressive zero-shot capabilities, a pre-defined set of categories, a.k.a. the vocabulary, is assumed at test time for composing the textual prompts. However, such assumption can be impractical when the semantic context is unknown and evolving. We thus formalize a novel task, termed as Vocabulary-free Image Classification (VIC), where we aim to assign to an input image a class that resides in an unconstrained language-induced semantic space, without the prerequisite of a known vocabulary. VIC is a challenging task as the semantic space is extremely large, containing millions of concepts, with hard-to-discriminate fine-grained categories. In this work, we first empirically verify that representing this semantic space by means of an external vision-language database is the most effective way to obtain semantically relevant content for classifying the image. We then propose Category Search from External Databases (CaSED), a method that exploits a pre-trained vision-language model and an external vision-language database to address VIC in a training-free manner. CaSED first extracts a set of candidate categories from captions retrieved from the database based on their semantic similarity to the image, and then assigns to the image the best matching candidate category according to the same vision-language model. Experiments on benchmark datasets validate that CaSED outperforms other complex vision-language frameworks, while being efficient with much fewer parameters, paving the way for future research in this direction.
△ Less
Submitted 12 January, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
New $\text{AdS}_2/\text{CFT}_1$ pairs from $\text{AdS}_3$ and monopole bubbling
Authors:
Andrea Conti,
Yolanda Lozano,
Niall T. Macpherson
Abstract:
We present general results on generating $\text{AdS}_2$ solutions to Type II supergravity from $\text{AdS}_3$ solutions via U(1) and SL(2) T-dualities. We focus on a class of Type IIB solutions with small $\mathcal{N}=4$ supersymmetry, that we show can be embedded into a more general class of solutions obtained by double analytical continuation from $\text{AdS}_3$ geometries with small…
▽ More
We present general results on generating $\text{AdS}_2$ solutions to Type II supergravity from $\text{AdS}_3$ solutions via U(1) and SL(2) T-dualities. We focus on a class of Type IIB solutions with small $\mathcal{N}=4$ supersymmetry, that we show can be embedded into a more general class of solutions obtained by double analytical continuation from $\text{AdS}_3$ geometries with small $\mathcal{N}=(0,4)$ supersymmetry constructed in the literature. We then start the analysis of the superconformal quantum mechanics dual to the $\mathcal{N}=4$ backgrounds focusing on a subclass of $\text{AdS}_2\times\text{S}^3\times\mathbb{T}^3$ solutions foliated over a Riemann surface. We show that the associated supersymmetric quantum mechanics describes monopole bubbling in 4d $\mathcal{N}=2$ supersymmetric gauge theories living in D3-D7 branes, as previously discussed in the literature. Therefore, we propose that our solutions provide a geometrical description via holography of monopole bubbling in 4d $\mathcal{N}=2$ SCFTs. We check our proposal with the computation of the central charge.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Sparsity Agnostic Depth Completion
Authors:
Andrea Conti,
Matteo Poggi,
Stefano Mattoccia
Abstract:
We present a novel depth completion approach agnostic to the sparsity of depth points, that is very likely to vary in many practical applications. State-of-the-art approaches yield accurate results only when processing a specific density and distribution of input points, i.e. the one observed during training, narrowing their deployment in real use cases. On the contrary, our solution is robust to…
▽ More
We present a novel depth completion approach agnostic to the sparsity of depth points, that is very likely to vary in many practical applications. State-of-the-art approaches yield accurate results only when processing a specific density and distribution of input points, i.e. the one observed during training, narrowing their deployment in real use cases. On the contrary, our solution is robust to uneven distributions and extremely low densities never witnessed during training. Experimental results on standard indoor and outdoor benchmarks highlight the robustness of our framework, achieving accuracy comparable to state-of-the-art methods when tested with density and distribution equal to the training one while being much more accurate in the other cases. Our pretrained models and further material are available in our project page.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
Multi-View Guided Multi-View Stereo
Authors:
Matteo Poggi,
Andrea Conti,
Stefano Mattoccia
Abstract:
This paper introduces a novel deep framework for dense 3D reconstruction from multiple image frames, leveraging a sparse set of depth measurements gathered jointly with image acquisition. Given a deep multi-view stereo network, our framework uses sparse depth hints to guide the neural network by modulating the plane-sweep cost volume built during the forward step, enabling us to infer constantly m…
▽ More
This paper introduces a novel deep framework for dense 3D reconstruction from multiple image frames, leveraging a sparse set of depth measurements gathered jointly with image acquisition. Given a deep multi-view stereo network, our framework uses sparse depth hints to guide the neural network by modulating the plane-sweep cost volume built during the forward step, enabling us to infer constantly much more accurate depth maps. Moreover, since multiple viewpoints can provide additional depth measurements, we propose a multi-view guidance strategy that increases the density of the sparse points used to guide the network, thus leading to even more accurate results. We evaluate our Multi-View Guided framework within a variety of state-of-the-art deep multi-view stereo networks, demonstrating its effectiveness at improving the results achieved by each of them on BlendedMVG and DTU datasets.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition
Authors:
Alessandro Conti,
Paolo Rota,
Yiming Wang,
Elisa Ricci
Abstract:
Automatically understanding emotions from visual data is a fundamental task for human behaviour understanding. While models devised for Facial Expression Recognition (FER) have demonstrated excellent performances on many datasets, they often suffer from severe performance degradation when trained and tested on different datasets due to domain shift. In addition, as face images are considered highl…
▽ More
Automatically understanding emotions from visual data is a fundamental task for human behaviour understanding. While models devised for Facial Expression Recognition (FER) have demonstrated excellent performances on many datasets, they often suffer from severe performance degradation when trained and tested on different datasets due to domain shift. In addition, as face images are considered highly sensitive data, the accessibility to large-scale datasets for model training is often denied. In this work, we tackle the above-mentioned problems by proposing the first Source-Free Unsupervised Domain Adaptation (SFUDA) method for FER. Our method exploits self-supervised pretraining to learn good feature representations from the target data and proposes a novel and robust cluster-level pseudo-labelling strategy that accounts for in-cluster statistics. We validate the effectiveness of our method in four adaptation setups, proving that it consistently outperforms existing SFUDA methods when applied to FER, and is on par with methods addressing FER in the UDA setting.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Unsupervised confidence for LiDAR depth maps and applications
Authors:
Andrea Conti,
Matteo Poggi,
Filippo Aleotti,
Stefano Mattoccia
Abstract:
Depth perception is pivotal in many fields, such as robotics and autonomous driving, to name a few. Consequently, depth sensors such as LiDARs rapidly spread in many applications. The 3D point clouds generated by these sensors must often be coupled with an RGB camera to understand the framed scene semantically. Usually, the former is projected over the camera image plane, leading to a sparse depth…
▽ More
Depth perception is pivotal in many fields, such as robotics and autonomous driving, to name a few. Consequently, depth sensors such as LiDARs rapidly spread in many applications. The 3D point clouds generated by these sensors must often be coupled with an RGB camera to understand the framed scene semantically. Usually, the former is projected over the camera image plane, leading to a sparse depth map. Unfortunately, this process, coupled with the intrinsic issues affecting all the depth sensors, yields noise and gross outliers in the final output. Purposely, in this paper, we propose an effective unsupervised framework aimed at explicitly addressing this issue by learning to estimate the confidence of the LiDAR sparse depth map and thus allowing for filtering out the outliers. Experimental results on the KITTI dataset highlight that our framework excels for this purpose. Moreover, we demonstrate how this achievement can improve a wide range of tasks.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
An intertwined neural network model for EEG classification in brain-computer interfaces
Authors:
Andrea Duggento,
Mario De Lorenzo,
Stefano Bargione,
Allegra Conti,
Vincenzo Catrambone,
Gaetano Valenza,
Nicola Toschi
Abstract:
The brain computer interface (BCI) is a nonstimulatory direct and occasionally bidirectional communication link between the brain and a computer or an external device. Classically, EEG-based BCI algorithms have relied on models such as support vector machines and linear discriminant analysis or multiclass common spatial patterns. During the last decade, however, more sophisticated machine learning…
▽ More
The brain computer interface (BCI) is a nonstimulatory direct and occasionally bidirectional communication link between the brain and a computer or an external device. Classically, EEG-based BCI algorithms have relied on models such as support vector machines and linear discriminant analysis or multiclass common spatial patterns. During the last decade, however, more sophisticated machine learning architectures, such as convolutional neural networks, recurrent neural networks, long short-term memory networks and gated recurrent unit networks, have been extensively used to enhance discriminability in multiclass BCI tasks. Additionally, preprocessing and denoising of EEG signals has always been key in the successful decoding of brain activity, and the determination of an optimal and standardized EEG preprocessing activity is an active area of research. In this paper, we present a deep neural network architecture specifically engineered to a) provide state-of-the-art performance in multiclass motor imagery classification and b) remain robust to preprocessing to enable real-time processing of raw data as it streams from EEG and BCI equipment. It is based on the intertwined use of time-distributed fully connected (tdFC) and space-distributed 1D temporal convolutional layers (sdConv) and explicitly addresses the possibility that interaction of spatial and temporal features of the EEG signal occurs at all levels of complexity. Numerical experiments demonstrate that our architecture provides superior performance compared baselines based on a combination of 3D convolutions and recurrent neural networks in a six-class motor imagery network, with a subjectwise accuracy that reaches 99%. Importantly, these results remain unchanged when minimal or extensive preprocessing is applied, possibly paving the way for a more transversal and real-time use of deep learning architectures in EEG classification.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss
Authors:
Riccardo Franceschini,
Enrico Fini,
Cigdem Beyan,
Alessandro Conti,
Federica Arrigoni,
Elisa Ricci
Abstract:
Emotion recognition is involved in several real-world applications. With an increase in available modalities, automatic understanding of emotions is being performed more accurately. The success in Multimodal Emotion Recognition (MER), primarily relies on the supervised learning paradigm. However, data annotation is expensive, time-consuming, and as emotion expression and perception depends on seve…
▽ More
Emotion recognition is involved in several real-world applications. With an increase in available modalities, automatic understanding of emotions is being performed more accurately. The success in Multimodal Emotion Recognition (MER), primarily relies on the supervised learning paradigm. However, data annotation is expensive, time-consuming, and as emotion expression and perception depends on several factors (e.g., age, gender, culture) obtaining labels with a high reliability is hard. Motivated by these, we focus on unsupervised feature learning for MER. We consider discrete emotions, and as modalities text, audio and vision are used. Our method, as being based on contrastive loss between pairwise modalities, is the first attempt in MER literature. Our end-to-end feature learning approach has several differences (and advantages) compared to existing MER methods: i) it is unsupervised, so the learning is lack of data labelling cost; ii) it does not require data spatial augmentation, modality alignment, large number of batch size or epochs; iii) it applies data fusion only at inference; and iv) it does not require backbones pre-trained on emotion recognition task. The experiments on benchmark datasets show that our method outperforms several baseline approaches and unsupervised learning methods applied in MER. Particularly, it even surpasses a few supervised MER state-of-the-art.
△ Less
Submitted 23 July, 2022;
originally announced July 2022.
-
Monitoring social distancing with single image depth estimation
Authors:
Alessio Mingozzi,
Andrea Conti,
Filippo Aleotti,
Matteo Poggi,
Stefano Mattoccia
Abstract:
The recent pandemic emergency raised many challenges regarding the countermeasures aimed at containing the virus spread, and constraining the minimum distance between people resulted in one of the most effective strategies. Thus, the implementation of autonomous systems capable of monitoring the so-called social distance gained much interest. In this paper, we aim to address this task leveraging a…
▽ More
The recent pandemic emergency raised many challenges regarding the countermeasures aimed at containing the virus spread, and constraining the minimum distance between people resulted in one of the most effective strategies. Thus, the implementation of autonomous systems capable of monitoring the so-called social distance gained much interest. In this paper, we aim to address this task leveraging a single RGB frame without additional depth sensors. In contrast to existing single-image alternatives failing when ground localization is not available, we rely on single image depth estimation to perceive the 3D structure of the observed scene and estimate the distance between people. During the setup phase, a straightforward calibration procedure, leveraging a scale-aware SLAM algorithm available even on consumer smartphones, allows us to address the scale ambiguity affecting single image depth estimation. We validate our approach through indoor and outdoor images employing a calibrated LiDAR + RGB camera asset. Experimental results highlight that our proposal enables sufficiently reliable estimation of the inter-personal distance to monitor social distancing effectively. This fact confirms that despite its intrinsic ambiguity, if appropriately driven single image depth estimation can be a viable alternative to other depth perception techniques, more expensive and not always feasible in practical applications. Our evaluation also highlights that our framework can run reasonably fast and comparably to competitors, even on pure CPU systems. Moreover, its practical deployment on low-power systems is around the corner.
△ Less
Submitted 29 April, 2022; v1 submitted 4 April, 2022;
originally announced April 2022.
-
Lifting trianguline Galois representations along isogenies
Authors:
Andrea Conti
Abstract:
Given a central isogeny $π\colon G\to H$ of connected reductive $\overline{\mathbb Q}_p$-groups, and a local Galois representation $ρ$ valued in $H(\overline{\mathbb Q}_p)$ that is trianguline in the sense of Daruvar, we study whether a lift of $ρ$ along $π$ is still trianguline. We give a positive answer under weak conditions on the Hodge--Tate--Sen weights of $ρ$, and the assumption that the tri…
▽ More
Given a central isogeny $π\colon G\to H$ of connected reductive $\overline{\mathbb Q}_p$-groups, and a local Galois representation $ρ$ valued in $H(\overline{\mathbb Q}_p)$ that is trianguline in the sense of Daruvar, we study whether a lift of $ρ$ along $π$ is still trianguline. We give a positive answer under weak conditions on the Hodge--Tate--Sen weights of $ρ$, and the assumption that the trianguline parameter of $ρ$ can be lifted along $π$. This is an analogue of the results proved by Wintenberger, Conrad, Patrikis, and Hoang Duc for $p$-adic Hodge-theoretic properties of $ρ$. We describe a Tannakian framework for all such lifting problems, and we reinterpret the existence of a lift with prescribed local properties in terms of the simple connectedness of a certain pro-semisimple group. While applying this formalism to the case of trianguline representations, we extend a result of Berger and Di Matteo on triangulable tensor products of $B$-pairs.
△ Less
Submitted 10 January, 2022; v1 submitted 6 January, 2021;
originally announced January 2021.
-
Piggybacking on Quantum Streams
Authors:
Marco Chiani,
Andrea Conti,
Moe Z. Win
Abstract:
This paper shows that it is possible to piggyback classical information on a stream of qubits protected by quantum error correcting codes. The piggyback channel can be created by introducing intentional errors corresponding to a controlled sequence of syndromes. These syndromes are further protected, when quantum noise is present, by classical error correcting codes according to a performance-dela…
▽ More
This paper shows that it is possible to piggyback classical information on a stream of qubits protected by quantum error correcting codes. The piggyback channel can be created by introducing intentional errors corresponding to a controlled sequence of syndromes. These syndromes are further protected, when quantum noise is present, by classical error correcting codes according to a performance-delay trade-off. Classical information can thus be added and extracted at arbitrary epochs without consuming additional quantum resources and without disturbing the quantum stream.
△ Less
Submitted 25 May, 2020;
originally announced May 2020.
-
The Case for Probe-class NASA Astrophysics Missions
Authors:
Martin Elvis,
Jon Arenberg,
David Ballantyne,
Mark Bautz,
Charles Beichman,
Jeffrey Booth,
James Buckley,
Jack O. Burns,
Jordan Camp,
Alberto Conti,
Asantha Cooray,
William Danchi,
Jacques Delabrouille,
Gianfranco De Zotti,
Raphael Flauger,
Jason Glenn,
Jonathan Grindlay,
Shaul Hanany,
Dieter Hartmann,
George Helou,
Diego Herranz,
Johannes Hubmayr,
Bradley R. Johnson,
William Jones,
N. Jeremy Kasdin
, et al. (23 additional authors not shown)
Abstract:
Astrophysics spans an enormous range of questions on scales from individual planets to the entire cosmos. To address the richness of 21st century astrophysics requires a corresponding richness of telescopes spanning all bands and all messengers. Much scientific benefit comes from having the multi-wavelength capability available at the same time. Most of these bands,or measurement sensitivities, re…
▽ More
Astrophysics spans an enormous range of questions on scales from individual planets to the entire cosmos. To address the richness of 21st century astrophysics requires a corresponding richness of telescopes spanning all bands and all messengers. Much scientific benefit comes from having the multi-wavelength capability available at the same time. Most of these bands,or measurement sensitivities, require space-based missions. Historically, NASA has addressed this need for breadth with a small number of flagship-class missions and a larger number of Explorer missions. While the Explorer program continues to flourish, there is a large gap between Explorers and strategic missions. A fortunate combination of new astrophysics technologies with new, high capacity, low dollar-per-kg to orbit launchers, and new satellite buses allow for cheaper missions with capabilities approaching strategic mission levels. NASA has recognized these developments by calling for Probe-class mission ideas for mission studies, spanning most of the electromagnetic spectrum from GeV gamma-rays to the far infrared, and the new messengers of neutrinos and ultra-high energy cosmic rays. The key insight from the Probes exercise is that order-of-magnitude advances in science performance metrics are possible across the board for initial total cost estimates in the range 500M-1B dollars.
△ Less
Submitted 12 February, 2020;
originally announced February 2020.
-
Peregrine: Network Localization and Navigation with Scalable Inference and Efficient Operation
Authors:
Bryan Teague,
Zhenyu Liu,
Florian Meyer,
Andrea Conti,
Moe Z. Win
Abstract:
Location-aware networks will enable new services and applications in fields such as autonomous driving, smart cities, and the Internet-of-Things. One promising solution for ubiquitous localization is network localization and navigation (NLN), where devices form a network that cooperatively localizes itself, reducing the infrastructure needed for accurate localization. This paper introduces a real-…
▽ More
Location-aware networks will enable new services and applications in fields such as autonomous driving, smart cities, and the Internet-of-Things. One promising solution for ubiquitous localization is network localization and navigation (NLN), where devices form a network that cooperatively localizes itself, reducing the infrastructure needed for accurate localization. This paper introduces a real-time NLN system named Peregrine, which combines distributed NLN algorithms with commercially available ultra-wideband (UWB) sensing and communication technology. The Peregrine software application, for the first time, integrates three NLN algorithms to jointly perform the tasks of localization and network operation in a technology agnostic manner, leveraging both spatial and temporal cooperation. Peregrine hardware is composed of low-cost, compact devices that comprise a microprocessor and a commercial UWB radio. This paper presents the design of the Peregrine system and characterizes the performance impact of each algorithmic component. Indoor experiments validate that our approach to realizing NLN is both reliable and scalable, and maintains sub-meter-level accuracy even in challenging indoor scenarios.
△ Less
Submitted 30 January, 2020;
originally announced January 2020.
-
Big images of two-dimensional pseudorepresentations
Authors:
Andrea Conti,
Jaclyn Lang,
Anna Medvedovsky
Abstract:
Bellaïche has recently applied Pink-Lie theory to prove that, under mild conditions, the image of a continuous 2-dimensional pseudorepresentation $ρ$ of a profinite group on a local pro-$p$ domain $A$ contains a nontrivial congruence subgroup of ${\rm SL}_2(B)$ for a certain subring $B$ of $A$. We enlarge Bellaïche's ring and give this new $B$ a conceptual interpretation in terms of conjugate self…
▽ More
Bellaïche has recently applied Pink-Lie theory to prove that, under mild conditions, the image of a continuous 2-dimensional pseudorepresentation $ρ$ of a profinite group on a local pro-$p$ domain $A$ contains a nontrivial congruence subgroup of ${\rm SL}_2(B)$ for a certain subring $B$ of $A$. We enlarge Bellaïche's ring and give this new $B$ a conceptual interpretation in terms of conjugate self-twists of $ρ$, symmetries that naturally constrain its image. As a corollary, this new $B$ is optimal among congruence subgroups contained in the image. We also interpret the new $B$ vis-a-vis the adjoint trace ring of $ρ$, which we show is a more natural ring for these questions in general. Finally, we use our purely algebraic result to recover and extend a variety of arithmetic big-image results for ${\rm GL}_2$ Galois representations arising from elliptic, Hilbert, and Bianchi modular forms and $p$-adic Hida or Coleman families of elliptic and Hilbert modular forms.
△ Less
Submitted 14 December, 2021; v1 submitted 23 April, 2019;
originally announced April 2019.
-
Trianguline Galois representations and Schur functors
Authors:
Andrea Conti
Abstract:
Given a $B$-pair $W$ and a Schur functor $S$, we show under some general assumptions that $W$ is trianguline if and only if $S(W)$ is. This is an extension of earlier work of Di Matteo. We derive some consequences on the behavior of local Galois representations under morphisms of Langlands dual groups. We attach to a Schur functor a map between the trianguline deformation spaces defined by Hellman…
▽ More
Given a $B$-pair $W$ and a Schur functor $S$, we show under some general assumptions that $W$ is trianguline if and only if $S(W)$ is. This is an extension of earlier work of Di Matteo. We derive some consequences on the behavior of local Galois representations under morphisms of Langlands dual groups. We attach to a Schur functor a map between the trianguline deformation spaces defined by Hellmann, and we study congruence loci on the Hecke-Taylor-Wiles varieties constructed by Breuil, Hellmann and Schraen for unitary groups.
△ Less
Submitted 14 January, 2021; v1 submitted 6 November, 2017;
originally announced November 2017.
-
Galois level and congruence ideal for $p$-adic families of finite slope Siegel modular forms
Authors:
Andrea Conti
Abstract:
We consider $p$-adic families of Siegel eigenforms of genus $2$ and finite slope, defined as local pieces of an eigenvariety and equipped with a suitable integral structure. Under some assumptions on the residual image, we show that the image of the Galois representation associated with a family is big, in the sense that a Lie algebra attached to it contains a congruence subalgebra of non-zero lev…
▽ More
We consider $p$-adic families of Siegel eigenforms of genus $2$ and finite slope, defined as local pieces of an eigenvariety and equipped with a suitable integral structure. Under some assumptions on the residual image, we show that the image of the Galois representation associated with a family is big, in the sense that a Lie algebra attached to it contains a congruence subalgebra of non-zero level. We call Galois level of the family the largest such level. We show that it is trivial when the residual representation has full image. When the residual representation is a symmetric cube, the zero locus defined by the Galois level of the family admits an automorphic description: it is the locus of points that arise from overconvergent eigenforms for $\mathrm{GL}_2$, via a $p$-adic Langlands lift attached to the symmetric cube representation. Our proof goes via the comparison of the Galois level with a "fortuitous" congruence ideal, that describes the zero- and one-dimensional subvarieties of symmetric cube type appearing in the family. We show that some of the $p$-adic lifts are interpolated by a morphism of rigid analytic spaces from an eigencurve for $\mathrm{GL}_2$ to an eigenvariety for $\mathrm{GSp}_4$. The remaining lifts appear as isolated points on the eigenvariety.
△ Less
Submitted 30 November, 2016;
originally announced December 2016.
-
Big image of Galois representations associated with finite slope $p$-adic families of modular forms
Authors:
Andrea Conti,
Adrian Iovita,
Jacques Tilouine
Abstract:
We consider the Galois representation associated with a finite slope $p$-adic family of modular forms. We prove that the Lie algebra of its image contains a congruence Lie subalgebra of a non-trivial level. We describe the largest such level in terms of the congruences of the family with $p$-adic CM forms.
We consider the Galois representation associated with a finite slope $p$-adic family of modular forms. We prove that the Lie algebra of its image contains a congruence Lie subalgebra of a non-trivial level. We describe the largest such level in terms of the congruences of the family with $p$-adic CM forms.
△ Less
Submitted 7 December, 2016; v1 submitted 7 August, 2015;
originally announced August 2015.
-
The Ultraviolet Sky: An Overview from the GALEX Surveys
Authors:
Luciana Bianchi,
Alberto Conti,
Bernie Shiao
Abstract:
The Galaxy Evolution Explorer (GALEX) has performed the first surveys of the sky in the Ultraviolet (UV). Its legacy is an unprecedented database with more than 200 million source measurements in far-UV (FUV) and near-UV (NUV), as well as wide-field imaging of extended objects, filling an important gap in our view of the sky across the electromagnetic spectrum. The UV surveys offer unique sensitiv…
▽ More
The Galaxy Evolution Explorer (GALEX) has performed the first surveys of the sky in the Ultraviolet (UV). Its legacy is an unprecedented database with more than 200 million source measurements in far-UV (FUV) and near-UV (NUV), as well as wide-field imaging of extended objects, filling an important gap in our view of the sky across the electromagnetic spectrum. The UV surveys offer unique sensitivity for identifying and studying selected classes of astrophysical objects, both stellar and extra-galactic. We examine the overall content and distribution of UV sources over the sky, and with magnitude and color. For this purpose, we have constructed final catalogs of UV sources with homogeneous quality, eliminating duplicate measurements of the same source. Such catalogs can facilitate a variety of investigations on UV-selected samples, as well as planning of observations with future missions.
We describe the criteria used to build the catalogs, their coverage and completeness. We included observations in which both the far-UV and near-UV detectors were exposed; 28,707 fields from the All-Sky Imaging survey (AIS) cover a unique area of 22,080 square degrees (after we restrict the catalogs to the central 1-degree diameter of the field), with a typical depth of about 20/21 mag (FUV/NUV, in the AB mag system), and 3,008 fields from the Medium-depth Imaging Survey (MIS) cover a total of 2,251 square degrees at a depth of about 22.7mag. The catalogs contain about 71 and 16.6 million sources respectively. The density of hot stars reflects the Galactic structure, and the number counts of both Galactic and extra-galactic sources are modulated by the Milky Way dust extinction, to which the UV data are very sensitive.
△ Less
Submitted 11 December, 2013;
originally announced December 2013.
-
Stellar Imager (SI): developing and testing a predictive dynamo model for the Sun by imaging other stars
Authors:
Kenneth G. Carpenter,
Carolus J. Schrijver,
Margarita Karovska,
Steve Kraemer,
Richard Lyon,
David Mozurkewich,
Vladimir Airapetian,
John C. Adams,
Ronald J. Allen,
Alex Brown,
Fred Bruhweiler,
Alberto Conti,
Joergen Christensen-Dalsgaard,
Steve Cranmer,
Manfred Cuntz,
William Danchi,
Andrea Dupree,
Martin Elvis,
Nancy Evans,
Mark Giampapa,
Graham Harper,
Kathy Hartman,
Antoine Labeyrie,
Jesse Leitner,
Chuck Lillie
, et al. (17 additional authors not shown)
Abstract:
The Stellar Imager mission concept is a space-based UV/Optical interferometer designed to resolve surface magnetic activity and subsurface structure and flows of a population of Sun-like stars, in order to accelerate the development and validation of a predictive dynamo model for the Sun and enable accurate long-term forecasting of solar/stellar magnetic activity.
The Stellar Imager mission concept is a space-based UV/Optical interferometer designed to resolve surface magnetic activity and subsurface structure and flows of a population of Sun-like stars, in order to accelerate the development and validation of a predictive dynamo model for the Sun and enable accurate long-term forecasting of solar/stellar magnetic activity.
△ Less
Submitted 23 November, 2010;
originally announced November 2010.
-
The staircase structure of the Southern Brazilian Continental Shelf
Authors:
M. S. Baptista,
L. A. Conti
Abstract:
We show some evidences that the Southeastern Brazilian Continental Shelf (SBCS) has a devil's staircase structure, with a sequence of scarps and terraces with widths that obey fractal formation rules. Since the formation of these features are linked with the sea level variations, we say that the sea level changes in an organized pulsating way. Although the proposed approach was applied in a part…
▽ More
We show some evidences that the Southeastern Brazilian Continental Shelf (SBCS) has a devil's staircase structure, with a sequence of scarps and terraces with widths that obey fractal formation rules. Since the formation of these features are linked with the sea level variations, we say that the sea level changes in an organized pulsating way. Although the proposed approach was applied in a particular region of the Earth, it is suitable to be applied in an integrated way to other Shelves around the world, since the analyzes favor the revelation of the global sea level variations.
△ Less
Submitted 24 October, 2008;
originally announced October 2008.
-
Spatially Resolved Galaxy Star Formation and its Environmental Dependence I
Authors:
Niraj Welikala,
Andrew J. Connolly,
Andrew M. Hopkins,
Ryan Scranton,
Alberto Conti
Abstract:
We use the photometric information contained in individual pixels of 44,964 (0.019<z<0.125 and -23.5<M_r<-20.5) galaxies in the Fourth Data Release (DR4) of the Sloan Digital Sky Survey to investigate the effects of environment on galaxy star formation (SF). We use the pixel-z technique, which combines stellar population synthesis models with photometric redshift template fitting on the scale of…
▽ More
We use the photometric information contained in individual pixels of 44,964 (0.019<z<0.125 and -23.5<M_r<-20.5) galaxies in the Fourth Data Release (DR4) of the Sloan Digital Sky Survey to investigate the effects of environment on galaxy star formation (SF). We use the pixel-z technique, which combines stellar population synthesis models with photometric redshift template fitting on the scale of individual pixels in galaxy images. Spectral energy distributions are constructed, sampling a wide range of properties such as age, star formation rate (SFR), dust obscuration and metallicity. By summing the SFRs in the pixels, we demonstrate that the distribution of total galaxy SFR shifts to lower values as the local density of surrounding galaxies increases, as found in other studies. The effect is most prominent in the galaxies with the highest star formation, and we see the break in the SFR-density relation at a local galaxy density of $\approx 0.05 $(Mpc/h)$^{-3}$. Since our method allows us to spatially resolve the SF distribution within galaxies, we can calculate the mean SFR of each galaxy as a function of radius. We find that on average the mean SFR is dominated by SF in the central regions of galaxies, and that the trend for suppression of SFR in high density environments is driven by a reduction in this nuclear SF. We also find that the mean SFR in the outskirts is largely independent of environmental effects. This trend in the mean SFR is shared by galaxies which are highly star forming, while those which are weakly star forming show no statistically significant correlation between their environment and the mean SFR at any radius.
△ Less
Submitted 25 December, 2007; v1 submitted 7 November, 2007;
originally announced November 2007.
-
Log-concavity property of the error probability with application to local bounds for wireless communications
Authors:
Andrea Conti,
Dmitry Panchenko,
Sergiy Sidenko,
Velio Tralli
Abstract:
A clear understanding the behavior of the error probability (EP) as a function of signal-to-noise ratio (SNR) and other system parameters is fundamental for assessing the design of digital wireless communication systems.We propose an analytical framework based on the log-concavity property of the EP which we prove for a wide family of multidimensional modulation formats in the presence of Gaussi…
▽ More
A clear understanding the behavior of the error probability (EP) as a function of signal-to-noise ratio (SNR) and other system parameters is fundamental for assessing the design of digital wireless communication systems.We propose an analytical framework based on the log-concavity property of the EP which we prove for a wide family of multidimensional modulation formats in the presence of Gaussian disturbances and fading. Based on this property, we construct a class of local bounds for the EP that improve known generic bounds in a given region of the SNR and are invertible, as well as easily tractable for further analysis. This concept is motivated by the fact that communication systems often operate with performance in a certain region of interest (ROI) and, thus, it may be advantageous to have tighter bounds within this region instead of generic bounds valid for all SNRs. We present a possible application of these local bounds, but their relevance is beyond the example made in this paper.
△ Less
Submitted 20 February, 2009; v1 submitted 6 October, 2007;
originally announced October 2007.
-
Sky in Google Earth: The Next Frontier in Astronomical Data Discovery and Visualization
Authors:
Ryan Scranton,
Andrew Connolly,
Simon Krughoff,
Jeremy Brewer,
Alberto Conti,
Carol Christian,
Brian McLean,
Craig Sosin,
Greg Coombe,
Paul Heckbert
Abstract:
Astronomy began as a visual science, first through careful observations of the sky using either an eyepiece or the naked eye, then on to the preservation of those images with photographic media and finally the digital encoding of that information via CCDs. This last step has enabled astronomy to move into a fully automated era -- where data is recorded, analyzed and interpreted often without any…
▽ More
Astronomy began as a visual science, first through careful observations of the sky using either an eyepiece or the naked eye, then on to the preservation of those images with photographic media and finally the digital encoding of that information via CCDs. This last step has enabled astronomy to move into a fully automated era -- where data is recorded, analyzed and interpreted often without any direct visual inspection. Sky in Google Earth completes that circle by providing an intuitive visual interface to some of the largest astronomical imaging surveys covering the full sky. By streaming imagery, catalogs, time domain data, and ancillary information directly to a user, Sky can provide the general public as well as professional and amateur astronomers alike with a wealth of information for use in education and research. We provide here a brief introduction to Sky in Google Earth, focusing on its extensible environment, how it may be integrated into the research process and how it can bring astronomical research to a broader community. With an open interface available on Linux, Mac OS X and Windows, applications developed within Sky are accessible not just within the Google framework but through any visual browser that supports the Keyhole Markup Language. We present Sky as the embodiment of a virtual telescope.
△ Less
Submitted 10 September, 2007; v1 submitted 5 September, 2007;
originally announced September 2007.
-
On Punctured Pragmatic Space-Time Codes in Block Fading Channel
Authors:
Samuele Bandi,
Luca Stabellini,
Andrea Conti,
Velio Tralli
Abstract:
This paper considers the use of punctured convolutional codes to obtain pragmatic space-time trellis codes over block-fading channel. We show that good performance can be achieved even when puncturation is adopted and that we can still employ the same Viterbi decoder of the convolutional mother code by using approximated metrics without increasing the complexity of the decoding operations.
This paper considers the use of punctured convolutional codes to obtain pragmatic space-time trellis codes over block-fading channel. We show that good performance can be achieved even when puncturation is adopted and that we can still employ the same Viterbi decoder of the convolutional mother code by using approximated metrics without increasing the complexity of the decoding operations.
△ Less
Submitted 2 April, 2007;
originally announced April 2007.
-
Pragmatic Space-Time Trellis Codes for Block Fading Channels
Authors:
Marco Chiani,
Andrea Conti,
Velio Tralli
Abstract:
A pragmatic approach for the construction of space-time codes over block fading channels is investigated. The approach consists in using common convolutional encoders and Viterbi decoders with suitable generators and rates, thus greatly simplifying the implementation of space-time codes. For the design of pragmatic space-time codes a methodology is proposed and applied, based on the extension of…
▽ More
A pragmatic approach for the construction of space-time codes over block fading channels is investigated. The approach consists in using common convolutional encoders and Viterbi decoders with suitable generators and rates, thus greatly simplifying the implementation of space-time codes. For the design of pragmatic space-time codes a methodology is proposed and applied, based on the extension of the concept of generalized transfer function for convolutional codes over block fading channels. Our search algorithm produces the convolutional encoder generators of pragmatic space-time codes for various number of states, number of antennas and fading rate. Finally it is shown that, for the investigated cases, the performance of pragmatic space-time codes is better than that of previously known space-time codes, confirming that they are a valuable choice in terms of both implementation complexity and performance.
△ Less
Submitted 28 March, 2007;
originally announced March 2007.
-
Statistical Properties of the GALEX/SDSS matched source catalogs, and classification of the UV sources
Authors:
Luciana Bianchi,
Lino Rodriguez-Merino,
Maurice Viton,
Michel Laget,
Boryana Efremova,
James Herald,
Alberto Conti,
Bernie Shiao,
Armando Gil de Paz,
Samir Salim,
A. Thakar,
Peter G. Friedman,
S. C. Rey,
David Thilker,
Tom A. Barlow,
Tamas Budavari,
Jose Donas,
Karl Forster,
Timothy M. Heckman,
Young-Wook Lee,
Barry F. Madore,
D. Christopher Martin,
Bruno Milliard,
Patrick Morrissey,
Susan G. Neff
, et al. (8 additional authors not shown)
Abstract:
We use the Galaxy Evolution Explorer (GALEX) Medium and All-Sky-Imaging Survey (MIS & AIS) data from the first public data release (GR1), matched to the Sloan Digital Sky Survey (SDSS) DR3 catalog, to perform source classification. The GALEX surveys provide photometry in far- and near-UV bands and the SDSS in five optical bands (u,g,r,i,z). The GR1/DR3 overlapping areas are 363[83]deg^2 for the…
▽ More
We use the Galaxy Evolution Explorer (GALEX) Medium and All-Sky-Imaging Survey (MIS & AIS) data from the first public data release (GR1), matched to the Sloan Digital Sky Survey (SDSS) DR3 catalog, to perform source classification. The GALEX surveys provide photometry in far- and near-UV bands and the SDSS in five optical bands (u,g,r,i,z). The GR1/DR3 overlapping areas are 363[83]deg^2 for the GALEX AIS[MIS], for sources within the 0.5deg central area of the GALEX fields. Our sample covers mostly |b|>30deg galactic latitudes. We present statistical properties of the GALEX/SDSS matched sources catalog, containing >2x10^6 objects detected in at least one UV band. We classify the matched sources by comparing the seven-band photometry to model colors constructed for different classes of astrophysical objects. For sources with photometric errors <0.3 mag, the corresponding typical AB-magnitude limits are m_FUV~21.5, m_NUV~22.5 for AIS, and m_FUV~24, m_NUV~24.5 for MIS. At AIS depth, the number of Galactic and extragalactic objects are comparable, but the latter predominate in the MIS. Based on our stellar models, we estimate the GALEX surveys detect hot White Dwarfs throughout the Milky Way halo (down to a radius of 0.04 R_sun at MIS depth), providing an unprecedented improvement in the Galactic WD census. Their observed surface density is consistent with Milky Way model predictions. We also select low-redshift QSO candidates, extending the known QSO samples to lower magnitudes, and providing candidates for detailed z~1 follow-up investigations. SDSS optical spectra available for a large subsample confirm the classification for the photometrically selected candidates with 97% purity for single hot stars, ~45%(AIS)/31%(MIS) for binaries containing a hot star and a cooler companion, and about 85% for QSOs.
△ Less
Submitted 30 November, 2006;
originally announced November 2006.