Skip to main content

Showing 1–10 of 10 results for author: Daum, M

  1. arXiv:2305.02375  [pdf, other

    cs.DB cs.LG cs.MM

    MaskSearch: Querying Image Masks at Scale

    Authors: Dong He, Jieyu Zhang, Maureen Daum, Alexander Ratner, Magdalena Balazinska

    Abstract: Machine learning tasks over image databases often generate masks that annotate image content (e.g., saliency maps, segmentation maps, depth maps) and enable a variety of applications (e.g., determine if a model is learning spurious correlations or if an image was maliciously modified to mislead a model). While queries that retrieve examples based on mask properties are valuable to practitioners, e… ▽ More

    Submitted 8 January, 2024; v1 submitted 3 May, 2023; originally announced May 2023.

  2. arXiv:2303.17719  [pdf, other

    cs.CV cs.LG

    Why is the winner the best?

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Sharib Ali, Vincent Andrearczyk, Marc Aubreville, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano, Jorge Bernal, Sebastian Bodenstedt, Alessandro Casella, Veronika Cheplygina, Marie Daum, Marleen de Bruijne, Adrien Depeursinge, Reuben Dorent, Jan Egger, David G. Ellis, Sandy Engelhardt, Melanie Ganz , et al. (100 additional authors not shown)

    Abstract: International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and successful participation strategies? What makes a solution superior to a competing method? To addre… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: accepted to CVPR 2023

  3. arXiv:2303.04068  [pdf, other

    cs.DB cs.CV cs.SD eess.AS

    VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building [Technical Report]

    Authors: Maureen Daum, Enhao Zhang, Dong He, Stephen Mussmann, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska

    Abstract: We introduce VOCALExplore, a system designed to support users in building domain-specific models over video datasets. VOCALExplore supports interactive labeling sessions and trains models using user-supplied labels. VOCALExplore maximizes model quality by automatically deciding how to select samples based on observed skew in the collected labels. It also selects the optimal video representations t… ▽ More

    Submitted 29 September, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  4. arXiv:2301.00929  [pdf, other

    cs.DB

    EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report]

    Authors: Enhao Zhang, Maureen Daum, Dong He, Brandon Haynes, Ranjay Krishna, Magdalena Balazinska

    Abstract: We introduce EQUI-VOCAL: a new system that automatically synthesizes queries over videos from limited user interactions. The user only provides a handful of positive and negative examples of what they are looking for. EQUI-VOCAL utilizes these initial examples and additional ones collected through active learning to efficiently synthesize complex user queries. Our approach enables users to find ev… ▽ More

    Submitted 8 August, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: This is an extended technical report for the following paper: "Enhao Zhang, Maureen Daum, Dong He, Brandon Haynes, Ranjay Krishna, and Magdalena Balazinska. EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions. PVLDB, 16(11): 2714-2727, 2023. doi:10.14778/3611479.3611482"

  5. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  6. arXiv:2104.02234  [pdf, other

    cs.DB cs.LG

    DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation

    Authors: Dong He, Maureen Daum, Walter Cai, Magdalena Balazinska

    Abstract: We design, implement, and evaluate DeepEverest, a system for the efficient execution of interpretation by example queries over the activation values of a deep neural network. DeepEverest consists of an efficient indexing technique and a query execution algorithm with various optimizations. We prove that the proposed query execution algorithm is instance optimal. Experiments with our prototype show… ▽ More

    Submitted 2 April, 2023; v1 submitted 5 April, 2021; originally announced April 2021.

    Comments: This is an extended technical report for the following paper: "DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation. PVLDB, 15(1): 98 - 111, 2021. doi:10.14778/3485450.3485460"

  7. arXiv:2103.16604  [pdf, other

    cs.DB

    VSS: A Storage System for Video Analytics [Technical Report]

    Authors: Brandon Haynes, Maureen Daum, Dong He, Amrita Mazumdar, Magdalena Balazinska, Alvin Cheung, Luis Ceze

    Abstract: We present a new video storage system (VSS) designed to decouple high-level video operations from the low-level details required to store and efficiently retrieve video data. VSS is designed to be the storage subsystem of a video data management system (VDBMS) and is responsible for: (1) transparently and automatically arranging the data on disk in an efficient, granular format; (2) caching freque… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  8. arXiv:2006.02958  [pdf, other

    cs.DB

    TASM: A Tile-Based Storage Manager for Video Analytics

    Authors: Maureen Daum, Brandon Haynes, Dong He, Amrita Mazumdar, Magdalena Balazinska

    Abstract: Modern video data management systems store videos as a single encoded file, which significantly limits possible storage level optimizations. We design, implement, and evaluate TASM, a new tile-based storage manager for video data. TASM uses a feature in modern video codecs called "tiles" that enables spatial random access into encoded videos. TASM physically tunes stored videos by optimizing their… ▽ More

    Submitted 26 February, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

  9. arXiv:1904.05387  [pdf, other

    cs.PL cs.HC cs.MS

    Tea: A High-level Language and Runtime System for Automating Statistical Analysis

    Authors: Eunice Jun, Maureen Daum, Jared Roesch, Sarah E. Chasins, Emery D. Berger, Rene Just, Katharina Reinecke

    Abstract: Though statistical analyses are centered on research questions and hypotheses, current statistical analysis tools are not. Users must first translate their hypotheses into specific statistical tests and then perform API calls with functions and parameters. To do so accurately requires that users have statistical expertise. To lower this barrier to valid, replicable statistical analysis, we introdu… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: 11 pages

  10. Chiefly Symmetric: Results on the Scalability of Probabilistic Model Checking for Operating-System Code

    Authors: Christel Baier, Marcus Daum, Benjamin Engel, Hermann Härtig, Joachim Klein, Sascha Klüppelholz, Steffen Märcker, Hendrik Tews, Marcus Völp

    Abstract: Reliability in terms of functional properties from the safety-liveness spectrum is an indispensable requirement of low-level operating-system (OS) code. However, with evermore complex and thus less predictable hardware, quantitative and probabilistic guarantees become more and more important. Probabilistic model checking is one technique to automatically obtain these guarantees. First experiences… ▽ More

    Submitted 26 November, 2012; originally announced November 2012.

    Comments: In Proceedings SSV 2012, arXiv:1211.5873

    Journal ref: EPTCS 102, 2012, pp. 156-166