I am an MS by Research candidate at CVIT, IIIT Hyderabad. I’m guided by Prof. C.V. Jawahar and co-guided by Prof. Chetan Arora. My research interest lies in Computer Vision, Pattern Recognition, and Machine Learning. My graduate research focuses on devising learning-based methods for understanding and exploring various aspects of first-person (egocentric) vision. Earlier, I worked on improving word recognition and retrieval in large document collection under the guidance of Prof. C.V. Jawahar. Previously, I worked with Prof. Shanmuganathan Raman on 3D Computer Vision.
My ultimate goal is to contribute to the development of systems capable of understanding the world as we do. I’m an inquisitive person, and I’m always willing to learn about fields including, but not limited to, science, technology, astrophysics, and physics.
April, 2020 : Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval got accepted to DAS 2020!
June, 2019 : Completed my B.E. in ECE from Vishwakarma Government Engineering College.
Center for Visual Information and Technology (CVIT), International Institute of Information Technology (IIIT), Hyderabad, India.
Fusing recognition-based and recognition-free approaches using rule-based methods for improving word recognition and retrieval. (ORAL)
IAPR International Workshop on Document Analysis and System (DAS), 2020
This paper proposes a Generative Adversarial Network (GAN) based architecture called Deep Future Gaze (DFG) for addressing the task of gaze anticipation in egocentric videos.
This paper proposes a three-stream convolutional neural network architecture for the task of action recognition in first-person videos.
This paper proposes a two-stream convolutional neural network architecture for the task of action recognition in a video.