I am an MS by Research candidate at CVIT, IIIT Hyderabad. I’m guided by Prof. C.V. Jawahar and co-guided by Prof. Chetan Arora. My research interest lies in Computer Vision, Pattern Recognition, and Machine Learning. My graduate research focuses on devising learning-based methods for understanding and exploring various aspects of first-person (egocentric) vision. Earlier, I worked on improving word recognition and retrieval in large document collection under the guidance of Prof. C.V. Jawahar. Previously, I worked with Prof. Shanmuganathan Raman on 3D Computer Vision.
My ultimate goal is to contribute to the development of systems capable of understanding the world as we do. I’m an inquisitive person, and I’m always willing to learn about fields including, but not limited to, science, technology, astrophysics, and physics.
Oct, 2020 : Improving Word Recognition using Multiple Hypotheses and Deep Embeddings got accepted to ICPR 2020!
April, 2020 : Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval got accepted to DAS 2020!
Center for Visual Information and Technology (CVIT), International Institute of Information Technology (IIIT), Hyderabad, India.
We propose to fuse recognition-based and recognition-free approaches for word recognition using learning-based methods.
International Conference on Pattern Recognition (ICPR), 2020
Fusing recognition-based and recognition-free approaches using rule-based methods for improving word recognition and retrieval. (ORAL)
IAPR International Workshop on Document Analysis and System (DAS), 2020
This paper proposes a Generative Adversarial Network (GAN) based architecture called Deep Future Gaze (DFG) for addressing the task of gaze anticipation in egocentric videos.
This paper proposes a three-stream convolutional neural network architecture for the task of action recognition in first-person videos.
This paper proposes a two-stream convolutional neural network architecture for the task of action recognition in a video.