Skip to main content

Showing 1–18 of 18 results for author: Jayasumana, S

  1. arXiv:2406.17968  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    Efficient Document Ranking with Learnable Late Interactions

    Authors: Ziwei Ji, Himanshu Jain, Andreas Veit, Sashank J. Reddi, Sadeep Jayasumana, Ankit Singh Rawat, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar

    Abstract: Cross-Encoder (CE) and Dual-Encoder (DE) models are two fundamental approaches for query-document relevance in information retrieval. To predict relevance, CE models use joint query-document embeddings, while DE models maintain factorized query and document embeddings; usually, the former has higher quality while the latter benefits from lower latency. Recently, late-interaction models have been p… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2401.09603  [pdf, other

    cs.CV

    Rethinking FID: Towards a Better Evaluation Metric for Image Generation

    Authors: Sadeep Jayasumana, Srikumar Ramalingam, Andreas Veit, Daniel Glasner, Ayan Chakrabarti, Sanjiv Kumar

    Abstract: As with many machine learning problems, the progress of image generation methods hinges on good evaluation metrics. One of the most popular is the Frechet Inception Distance (FID). FID estimates the distance between a distribution of Inception-v3 features of real images, and those of images generated by the algorithm. We highlight important drawbacks of FID: Inception's poor representation of the… ▽ More

    Submitted 25 January, 2024; v1 submitted 30 November, 2023; originally announced January 2024.

    Comments: Code is available at: https://github.com/google-research/google-research/tree/master/cmmd

  3. arXiv:2308.10997  [pdf, other

    cs.CV cs.AI cs.LG

    MarkovGen: Structured Prediction for Efficient Text-to-Image Generation

    Authors: Sadeep Jayasumana, Daniel Glasner, Srikumar Ramalingam, Andreas Veit, Ayan Chakrabarti, Sanjiv Kumar

    Abstract: Modern text-to-image generation models produce high-quality images that are both photorealistic and faithful to the text prompts. However, this quality comes at significant computational cost: nearly all of these models are iterative and require running sampling multiple times with large models. This iterative process is needed to ensure that different regions of the image are not only aligned wit… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  4. arXiv:2301.12005  [pdf, other

    cs.LG

    EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval

    Authors: Seungyeon Kim, Ankit Singh Rawat, Manzil Zaheer, Sadeep Jayasumana, Veeranjaneyulu Sadhanala, Wittawat Jitkrittum, Aditya Krishna Menon, Rob Fergus, Sanjiv Kumar

    Abstract: Large neural models (such as Transformers) achieve state-of-the-art performance for information retrieval (IR). In this paper, we aim to improve distillation methods that pave the way for the resource-efficient deployment of such models in practice. Inspired by our theoretical analysis of the teacher-student generalization gap for IR models, we propose a novel distillation approach that leverages… ▽ More

    Submitted 3 July, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  5. arXiv:2210.16413  [pdf, other

    cs.LG

    When does mixup promote local linearity in learned representations?

    Authors: Arslan Chaudhry, Aditya Krishna Menon, Andreas Veit, Sadeep Jayasumana, Srikumar Ramalingam, Sanjiv Kumar

    Abstract: Mixup is a regularization technique that artificially produces new samples using convex combinations of original training points. This simple technique has shown strong empirical performance, and has been heavily used as part of semi-supervised learning techniques such as mixmatch~\citep{berthelot2019mixmatch} and interpolation consistent training (ICT)~\citep{verma2019interpolation}. In this pape… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Journal ref: NeurIPS 2022 (First Workshop on Interpolation and Beyond)

  6. arXiv:2105.05736  [pdf, other

    cs.LG stat.ML

    Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces

    Authors: Ankit Singh Rawat, Aditya Krishna Menon, Wittawat Jitkrittum, Sadeep Jayasumana, Felix X. Yu, Sashank Reddi, Sanjiv Kumar

    Abstract: Negative sampling schemes enable efficient training given a large number of classes, by offering a means to approximate a computationally expensive loss function that takes all labels into account. In this paper, we present a new connection between these schemes and loss modification techniques for countering label imbalance. We show that different negative sampling schemes implicitly trade-off pe… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: To appear in ICML 2021

  7. arXiv:2104.12835  [pdf, other

    cs.CV cs.AI cs.LG

    Less is more: Selecting informative and diverse subsets with balancing constraints

    Authors: Srikumar Ramalingam, Daniel Glasner, Kaushal Patel, Raviteja Vemulapalli, Sadeep Jayasumana, Sanjiv Kumar

    Abstract: Deep learning has yielded extraordinary results in vision and natural language processing, but this achievement comes at a cost. Most models require enormous resources during training, both in terms of computation and in human labeling effort. We show that we can identify informative and diverse subsets of data that lead to deep learning models with similar performance as the ones trained with the… ▽ More

    Submitted 8 October, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Added error bars to the experiments

  8. arXiv:2012.09607  [pdf, other

    cs.LG cs.CV

    Kernelized Classification in Deep Networks

    Authors: Sadeep Jayasumana, Srikumar Ramalingam, Sanjiv Kumar

    Abstract: We propose a kernelized classification layer for deep networks. Although conventional deep networks introduce an abundance of nonlinearity for representation (feature) learning, they almost universally use a linear classifier on the learned feature vectors. We advocate a nonlinear classification layer by using the kernel trick on the softmax cross-entropy loss function during training and the scor… ▽ More

    Submitted 18 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

  9. arXiv:2007.07314  [pdf, other

    cs.LG stat.ML

    Long-tail learning via logit adjustment

    Authors: Aditya Krishna Menon, Sadeep Jayasumana, Ankit Singh Rawat, Himanshu Jain, Andreas Veit, Sanjiv Kumar

    Abstract: Real-world classification problems typically exhibit an imbalanced or long-tailed label distribution, wherein many labels are associated with only a few samples. This poses a challenge for generalisation on such labels, and also makes naïve learning biased towards dominant labels. In this paper, we present two simple modifications of standard softmax cross-entropy training to cope with these chall… ▽ More

    Submitted 9 July, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: Published as a conference paper in ICLR 2021

  10. arXiv:1912.05307  [pdf, other

    cs.CV eess.IV

    Bipartite Conditional Random Fields for Panoptic Segmentation

    Authors: Sadeep Jayasumana, Kanchana Ranasinghe, Mayuka Jayawardhana, Sahan Liyanaarachchi, Harsha Ranasinghe

    Abstract: We tackle the panoptic segmentation problem with a conditional random field (CRF) model. Panoptic segmentation involves assigning a semantic label and an instance label to each pixel of a given image. At each pixel, the semantic label and the instance label should be compatible. Furthermore, a good panoptic segmentation should have a number of other desirable properties such as the spatial and col… ▽ More

    Submitted 21 August, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  11. arXiv:1512.01192  [pdf, other

    cs.CV

    Prototypical Priors: From Improving Classification to Zero-Shot Learning

    Authors: Saumya Jetley, Bernardino Romera-Paredes, Sadeep Jayasumana, Philip Torr

    Abstract: Recent works on zero-shot learning make use of side information such as visual attributes or natural language semantics to define the relations between output visual classes and then use these relationships to draw inference on new unseen classes at test time. In a novel extension to this idea, we propose the use of visual prototypical concepts as side information. For most real-world visual objec… ▽ More

    Submitted 25 April, 2018; v1 submitted 3 December, 2015; originally announced December 2015.

    Comments: 12 Pages, 6 Figures, 2 Tables, in British Machine Vision Conference (BMVC), 2015

  12. arXiv:1511.08119  [pdf, other

    cs.CV

    Higher Order Conditional Random Fields in Deep Neural Networks

    Authors: Anurag Arnab, Sadeep Jayasumana, Shuai Zheng, Philip Torr

    Abstract: We address the problem of semantic segmentation using deep learning. Most segmentation systems include a Conditional Random Field (CRF) to produce a structured output that is consistent with the image's visual features. Recent deep learning approaches have incorporated CRFs into Convolutional Neural Networks (CNNs), with some even training the CRF end-to-end with the rest of the network. However,… ▽ More

    Submitted 29 July, 2016; v1 submitted 25 November, 2015; originally announced November 2015.

    Comments: ECCV 2016

  13. Conditional Random Fields as Recurrent Neural Networks

    Authors: Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr

    Abstract: Pixel-level labelling tasks, such as semantic segmentation, play a central role in image understanding. Recent approaches have attempted to harness the capabilities of deep learning techniques for image recognition to tackle pixel-level labelling tasks. One central issue in this methodology is the limited capacity of deep learning techniques to delineate visual objects. To solve this problem, we i… ▽ More

    Submitted 13 April, 2016; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: This paper is published in IEEE ICCV 2015

  14. arXiv:1412.4175  [pdf, other

    cs.CV

    Optimizing Over Radial Kernels on Compact Manifolds

    Authors: Sadeep Jayasumana, Richard Hartley, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

    Abstract: We tackle the problem of optimizing over all possible positive definite radial kernels on Riemannian manifolds for classification. Kernel methods on Riemannian manifolds have recently become increasingly popular in computer vision. However, the number of known positive definite kernels on manifolds remain very limited. Furthermore, most kernels typically depend on at least one parameter that needs… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

    Comments: Published in CVPR 2014

  15. arXiv:1412.4174  [pdf, other

    cs.CV

    A Framework for Shape Analysis via Hilbert Space Embedding

    Authors: Sadeep Jayasumana, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

    Abstract: We propose a framework for 2D shape analysis using positive definite kernels defined on Kendall's shape manifold. Different representations of 2D shapes are known to generate different nonlinear spaces. Due to the nonlinearity of these spaces, most existing shape classification algorithms resort to nearest neighbor methods and to learning distances on shape spaces. Here, we propose to map shapes o… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

    Comments: Published in ICCV 2013

  16. arXiv:1412.4172  [pdf, other

    cs.CV

    Kernel Methods on the Riemannian Manifold of Symmetric Positive Definite Matrices

    Authors: Sadeep Jayasumana, Richard Hartley, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

    Abstract: Symmetric Positive Definite (SPD) matrices have become popular to encode image information. Accounting for the geometry of the Riemannian manifold of SPD matrices has proven key to the success of many algorithms. However, most existing methods only approximate the true shape of the manifold locally by its tangent plane. In this paper, inspired by kernel methods, we propose to map SPD matrices to a… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

    Comments: Published in CVPR 2013. arXiv admin note: substantial text overlap with arXiv:1412.0265

  17. Kernel Methods on Riemannian Manifolds with Gaussian RBF Kernels

    Authors: Sadeep Jayasumana, Richard Hartley, Mathieu Salzmann, Hongdong Li, Mehrtash Harandi

    Abstract: In this paper, we develop an approach to exploiting kernel methods with manifold-valued data. In many computer vision problems, the data can be naturally represented as points on a Riemannian manifold. Due to the non-Euclidean geometry of Riemannian manifolds, usual Euclidean computer vision and machine learning algorithms yield inferior results on such data. In this paper, we define Gaussian radi… ▽ More

    Submitted 17 March, 2015; v1 submitted 30 November, 2014; originally announced December 2014.

  18. arXiv:1407.1123  [pdf, other

    cs.CV cs.LG stat.ML

    Expanding the Family of Grassmannian Kernels: An Embedding Perspective

    Authors: Mehrtash T. Harandi, Mathieu Salzmann, Sadeep Jayasumana, Richard Hartley, Hongdong Li

    Abstract: Modeling videos and image-sets as linear subspaces has proven beneficial for many visual recognition tasks. However, it also incurs challenges arising from the fact that linear subspaces do not obey Euclidean geometry, but lie on a special type of Riemannian manifolds known as Grassmannian. To leverage the techniques developed for Euclidean spaces (e.g, support vector machines) with subspaces, sev… ▽ More

    Submitted 4 July, 2014; originally announced July 2014.