Skip to main content

Showing 1–11 of 11 results for author: Manzini, T

  1. arXiv:2407.17673  [pdf, other

    cs.CV cs.AI cs.RO

    CRASAR-U-DROIDs: A Large Scale Benchmark Dataset for Building Alignment and Damage Assessment in Georectified sUAS Imagery

    Authors: Thomas Manzini, Priyankari Perali, Raisa Karnik, Robin Murphy

    Abstract: This document presents the Center for Robot Assisted Search And Rescue - Uncrewed Aerial Systems - Disaster Response Overhead Inspection Dataset (CRASAR-U-DROIDs) for building damage assessment and spatial alignment collected from small uncrewed aerial systems (sUAS) geospatial imagery. This dataset is motivated by the increasing use of sUAS in disaster response and the lack of previous work in ut… ▽ More

    Submitted 29 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: 16 Pages, 7 Figures, 6 Tables

  2. arXiv:2405.06593  [pdf, other

    cs.CV

    Non-Uniform Spatial Alignment Errors in sUAS Imagery From Wide-Area Disasters

    Authors: Thomas Manzini, Priyankari Perali, Raisa Karnik, Mihir Godbole, Hasnat Abdullah, Robin Murphy

    Abstract: This work presents the first quantitative study of alignment errors between small uncrewed aerial systems (sUAS) geospatial imagery and a priori building polygons and finds that alignment errors are non-uniform and irregular. The work also introduces a publicly available dataset of imagery, building polygons, and human-generated and curated adjustments that can be used to evaluate existing strateg… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 6 pages, 5 figures, 1 table

  3. arXiv:2309.09882  [pdf, other

    cs.RO

    Differentiable Boustrophedon Paths That Enable Optimization Via Gradient Descent

    Authors: Thomas Manzini, Robin Murphy

    Abstract: This paper introduces a differentiable representation for the optimization of boustrophedon path plans in convex polygons, explores an additional parameter of these path plans that can be optimized, discusses the properties of this representation that can be leveraged during the optimization process and shows that the previously published attempt at optimization of these path plans was too coarse… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 6 pages, 5 figures, 1 table

  4. arXiv:2309.01904  [pdf, other

    cs.RO cs.CV

    Improving Drone Imagery For Computer Vision/Machine Learning in Wilderness Search and Rescue

    Authors: Robin Murphy, Thomas Manzini

    Abstract: This paper describes gaps in acquisition of drone imagery that impair the use with computer vision/machine learning (CV/ML) models and makes five recommendations to maximize image suitability for CV/ML post-processing. It describes a notional work process for the use of drones in wilderness search and rescue incidents. The large volume of data from the wide area search phase offers the greatest op… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 6 pages, 4 figures

  5. arXiv:2308.14577  [pdf, other

    cs.RO

    Quantitative Data Analysis: CRASAR Small Unmanned Aerial Systems at Hurricane Ian

    Authors: Thomas Manzini, Robin Murphy, David Merrick

    Abstract: This paper provides a summary of the 281 sorties that were flown by the 10 different models of small unmanned aerial systems (sUAS) at Hurricane Ian, and the failures made in the field. These 281 sorties, supporting 44 missions, represents the largest use of sUAS in a disaster to date (previously Hurricane Florence with 260 sorties). The sUAS operations at Hurricane Ian differ slightly from prior… ▽ More

    Submitted 16 October, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 6 pages, 4 figures, 3 tables

  6. arXiv:2307.14527  [pdf, other

    cs.CV cs.AI cs.LG

    Open Problems in Computer Vision for Wilderness SAR and The Search for Patricia Wu-Murad

    Authors: Thomas Manzini, Robin Murphy

    Abstract: This paper details the challenges in applying two computer vision systems, an EfficientDET supervised learning model and the unsupervised RX spectral classifier, to 98.9 GB of drone imagery from the Wu-Murad wilderness search and rescue (WSAR) effort in Japan and identifies 3 directions for future research. There have been at least 19 proposed approaches and 3 datasets aimed at locating missing pe… ▽ More

    Submitted 9 August, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: 10 pages, 10 figures

  7. arXiv:2303.12937  [pdf, other

    cs.RO cs.NI

    Wireless Network Demands of Data Products from Small Uncrewed Aerial Systems at Hurricane Ian

    Authors: Thomas Manzini, Robin Murphy, David Merrick, Justin Adams

    Abstract: Data collected at Hurricane Ian (2022) quantifies the demands that small uncrewed aerial systems (UAS), or drones, place on the network communication infrastructure and identifies gaps in the field. Drones have been increasingly used since Hurricane Katrina (2005) for disaster response, however getting the data from the drone to the appropriate decision makers throughout incident command in a time… ▽ More

    Submitted 4 September, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: 6 pages, 8 figures

  8. arXiv:2006.05469  [pdf, other

    cs.CL cs.LG

    Examination and Extension of Strategies for Improving Personalized Language Modeling via Interpolation

    Authors: Liqun Shao, Sahitya Mantravadi, Tom Manzini, Alejandro Buendia, Manon Knoertzer, Soundar Srinivasan, Chris Quirk

    Abstract: In this paper, we detail novel strategies for interpolating personalized language models and methods to handle out-of-vocabulary (OOV) tokens to improve personalized language models. Using publicly available data from Reddit, we demonstrate improvements in offline metrics at the user level by interpolating a global LSTM-based authoring model with a user-personalized n-gram model. By optimizing thi… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

    Comments: ACL Natural Language Interface Workshop 2020, short paper

  9. arXiv:1904.04047  [pdf, other

    cs.CL cs.LG stat.ML

    Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings

    Authors: Thomas Manzini, Yao Chong Lim, Yulia Tsvetkov, Alan W Black

    Abstract: Online texts -- across genres, registers, domains, and styles -- are riddled with human stereotypes, expressed in overt or subtle ways. Word embeddings, trained on these texts, perpetuate and amplify these stereotypes, and propagate biases to machine learning models that use word embeddings as features. In this work, we propose a method to debias word embeddings in multiclass settings such as race… ▽ More

    Submitted 1 July, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted as a conference paper at NAACL. 5 Pages excluding references, additional page for appendix

  10. arXiv:1812.07809  [pdf, other

    cs.LG cs.CL cs.CV cs.HC stat.ML

    Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities

    Authors: Hai Pham, Paul Pu Liang, Thomas Manzini, Louis-Philippe Morency, Barnabas Poczos

    Abstract: Multimodal sentiment analysis is a core research area that studies speaker sentiment expressed from the language, visual, and acoustic modalities. The central challenge in multimodal learning involves inferring joint representations that can process and relate information from these modalities. However, existing work learns joint representations by requiring all modalities as input and as a result… ▽ More

    Submitted 28 February, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: AAAI 2019, code available at https://github.com/hainow/MCTN

  11. arXiv:1807.03915  [pdf, other

    cs.CL cs.LG stat.ML

    Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis

    Authors: Hai Pham, Thomas Manzini, Paul Pu Liang, Barnabas Poczos

    Abstract: Multimodal machine learning is a core research area spanning the language, visual and acoustic modalities. The central challenge in multimodal learning involves learning representations that can process and relate information from multiple modalities. In this paper, we propose two methods for unsupervised learning of joint multimodal representations using sequence to sequence (Seq2Seq) methods: a… ▽ More

    Submitted 6 August, 2018; v1 submitted 10 July, 2018; originally announced July 2018.

    Comments: 8 pages of content, 11 pages total, 2 figures. Published as a workshop paper at ACL 2018, Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML). 2018