Skip to main content

Showing 1–5 of 5 results for author: Laddha, P

  1. arXiv:2406.10247  [pdf, other

    cs.CL cs.AI

    QCQA: Quality and Capacity-aware grouped Query Attention

    Authors: Vinay Joshi, Prashant Laddha, Shambhavi Sinha, Om Ji Omer, Sreenivas Subramoney

    Abstract: Excessive memory requirements of key and value features (KV-cache) present significant challenges in the autoregressive inference of large language models (LLMs), restricting both the speed and length of text generation. Approaches such as Multi-Query Attention (MQA) and Grouped Query Attention (GQA) mitigate these challenges by grouping query heads and consequently reducing the number of correspo… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  2. arXiv:2111.08434  [pdf, other

    cs.CV

    Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion

    Authors: Anirud Thyagharajan, Benjamin Ummenhofer, Prashant Laddha, Om J Omer, Sreenivas Subramoney

    Abstract: 3D semantic segmentation is a fundamental building block for several scene understanding applications such as autonomous driving, robotics and AR/VR. Several state-of-the-art semantic segmentation models suffer from the part misclassification problem, wherein parts of the same object are labelled incorrectly. Previous methods have utilized hierarchical, iterative methods to fuse semantic and insta… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  3. arXiv:2011.12669  [pdf, other

    cs.AR

    AccSS3D: Accelerator for Spatially Sparse 3D DNNs

    Authors: Om Ji Omer, Prashant Laddha, Gurpreet S Kalsi, Anirud Thyagharajan, Kamlesh R Pillai, Abhimanyu Kulkarni, Anbang Yao, Yurong Chen, Sreenivas Subramoney

    Abstract: Semantic understanding and completion of real world scenes is a foundational primitive of 3D Visual perception widely used in high-level applications such as robotics, medical imaging, autonomous driving and navigation. Due to the curse of dimensionality, compute and memory requirements for 3D scene understanding grow in cubic complexity with voxel resolution, posing a huge impediment to realizing… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

  4. arXiv:1103.2741  [pdf

    cs.NE

    Memory Retrieval in the B-Matrix Neural Network

    Authors: Prerana Laddha

    Abstract: This paper is an extension to the memory retrieval procedure of the B-Matrix approach [6],[17] to neural network learning. The B-Matrix is a part of the interconnection matrix generated from the Hebbian neural network, and in memory retrieval, the B-matrix is clamped with a small fragment of the memory. The fragment gradually enlarges by means of feedback, until the entire vector is obtained. In t… ▽ More

    Submitted 14 March, 2011; originally announced March 2011.

    Comments: 8 Pages, 4 Figures

  5. arXiv:1007.5476  [pdf

    cs.SI physics.soc-ph

    Degree of Separation in Social Networks

    Authors: Prerana Laddha

    Abstract: According to the small-world concept, the entire world is connected through short chains of acquaintances. In popular imagination this is captured in the phrase six degrees of separation, implying that any two individuals are, at most, six handshakes away. Social network analysis is the understanding of concepts and information on relationships among interacting units in an ecological system. In t… ▽ More

    Submitted 30 July, 2010; originally announced July 2010.

    Comments: 9 pages, 7 figures