San Francisco, California, United States Contact Info
20K followers 500+ connections

Join to view profile

About

Having been a startup founder practically all my professional life, I am an entrepreneur…

Articles by Sachin

Activity

Join now to see all activity

Experience & Education

  • HackerEarth

View Sachin’s full experience

See their title, tenure and more.

or

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Publications

  • Efficient Variable Size Template matching Using Fast Normalized Cross Correlation on Multicore Processors

    LNCS Springer

    Normalized Cross Correlation (NCC) is an efficient and robust way for finding the location of a template in given image. However NCC is computationally expensive. Fast normalized cross correlation (FNCC) makes use of pre-computed sum-tables to improve the computational efficiency of NCC. In this paper we propose a strategy for parallel implementation of FNCC algorithm using NVIDIA’s Compute Unified Device Architecture (CUDA) for real-time template matching. We also present an approach to make…

    Normalized Cross Correlation (NCC) is an efficient and robust way for finding the location of a template in given image. However NCC is computationally expensive. Fast normalized cross correlation (FNCC) makes use of pre-computed sum-tables to improve the computational efficiency of NCC. In this paper we propose a strategy for parallel implementation of FNCC algorithm using NVIDIA’s Compute Unified Device Architecture (CUDA) for real-time template matching. We also present an approach to make proposed method adaptable to variable size templates which is an important challenge to tackle. Efficient parallelization strategies adopted for pre-computing sum-tables and extracting data parallelism by dividing the image into series of blocks substantially reduce required computational time. We show that by optimal utilization different memories available on CUDA and using idling time of host CPU to perform independent tasks we can obtain the speedup of the order of 17X as compared to the sequential implementation.

    Other authors
    See publication
  • Motion Detection in Low Resolution Grayscale Videos Using Fast Normalized Cross Correrelation on GP-GPU

    ICAISC, Bhuvaneshwar

    Motion estimation (ME) has been widely used in many computer vision applications, such as object tracking, object detection, pattern recognition and video compression. The most popular block based similarity measures are the sum of absolute differences (SAD), the sum of squared differences (SSD) and the normalized cross correlation (NCC). Similarity measure obtained using NCC is more robust under varying illumination changes as compared to SAD and SSD. However NCC is computationally expensive…

    Motion estimation (ME) has been widely used in many computer vision applications, such as object tracking, object detection, pattern recognition and video compression. The most popular block based similarity measures are the sum of absolute differences (SAD), the sum of squared differences (SSD) and the normalized cross correlation (NCC). Similarity measure obtained using NCC is more robust under varying illumination changes as compared to SAD and SSD. However NCC is computationally expensive and application of NCC using full or exhaustive search method further increases required computational time. Relatively efficient way of calculating the NCC is to pre-compute sum-tables to perform the normalization referred to as fast NCC (FCC). In this paper we propose real time implementation of full search FCC algorithm applied to gray scale videos using NVIDIA’s Compute Unified Device Architecture (CUDA). We present fine-grained optimization techniques for fully exploiting computational capacity of CUDA. Novel parallelization strategies adopted for extracting data parallelism substantially reduce computational time of exhaustive FCC. We show that by efficient utilization of global, shared and texture memories available on CUDA, we can obtain the speedup of the order of 10x as compared to the sequential implementation of FCC.

    Other authors
    See publication

Courses

  • Compilers

    -

  • Database Management Systems

    -

  • Operating System

    -

  • Operating System

    -

Honors & Awards

  • Forbes 30 under 30

    Forbes

    Awarded as Forbes 30 under 30 in the Enterprise Tech category for Asia.

  • Forbes 30 under 30

    Forbes

    Recognized in Forbes 30 under 30 for Enterprise software.

Languages

  • English

    Native or bilingual proficiency

  • Hindi

    Native or bilingual proficiency

More activity by Sachin

View Sachin’s full profile

  • See who you know in common
  • Get introduced
  • Contact Sachin directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Sachin Gupta in United States

Add new skills with these courses