We are excited to showcase the accepted regular papers for presentation at ICVGIP 2024.
Explore the full list of accepted papers below, and join us at the conference to engage
with the authors!
The list doubles as a Detailed Technical Program so Authors will know which time slot their
presentation is according to the Conference Program.
For list of Accepted Tiny Papers jump here.
For list of Accepted Symposium Papers jump here.
For list of Vision India Session Papers jump here.
Paper ID | Title | Authors |
---|---|---|
163 | MoCoMER: A Self Supervised Representation Learning for Handwritten Mathematical Expression Recognition | Sandip Pramanik (Jadavpur University)*; Shree Mitra (Indian Institute of Information Technology Guwahati); Dr. Nibaran Das (Jadavpur University) |
50 | Enhancing Generalization Ability in Deepfake Detection via Continual Learning | Shaheen Usmani (ABV-IIITM Gwalior); SUNIL KUMAR (ABV-IIITM Gwalior); Debanjan Sadhya (ABV-IIITM Gwalior)* |
68 | GraphVL: Graph-Enhanced Semantic Modeling via Vision-Language Models for Generalized Class Discovery | Bhupendra Mr. Solanki (Indian Institute of Technology, Bombay); Ashwin R Nair (Indian Institute of Science Education and Research Thiruvananthapuram); Mainak Singha (Indian Institute of Technology Bombay); Souradeep Mukhopadhyay (Indian Institute of Science); Ankit Jha (INRIA, Grenoble, France)*; Biplab Banerjee (Indian Institute of Technology, Bombay) |
115 | Covariance-Controlled Feature Space Augmentation and Rectification for Long-Tailed Class Incremental Learning | Riya Verma (IIT Madras)*; Sukhendu Das (Indian Institute of Technology, Madras) |
120 | Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networks | Sanchar Palit (IIT Bombay)*; Biplab Banerjee (Indian Institute of Technology, Bombay); Subhasis Chaudhuri (Indian Institute of Technology Bombay) |
141 | Robust Speech Recognition with Unsupervised Frame and Character Level Adversarial Domain Matching | SOUMEN PAUL (IIT KGP)*; Partha Pratim Das (IIT Kharagpur); Krothapalli Sreenivasa Rao (Indian Institute of Technology Kharagpur) |
70 | EXACFS - A CIL Method to Mitigate Catastrophic Forgetting | Balasubramanian S (SSSIHL)*; Sai Subramaniam M (Sri Sathya Sai Institute of Higher Learning); Sai Sriram Talasu (Sri Sathya Sai Institute of Higher Learning); Pranav Phanindra Sai Manepalli (Sri Sathya Sai Institute of Higher Learning); Yedu Krishna P (Sri Sathya Sai Institute of Higher Learning); Ravi Mukkamala (Old Dominion University); Darshan Gera (SSSIHL) |
Paper ID | Title | Authors |
---|---|---|
29 | An Improved Framework for Precision Grading of Renal Cell Carcinoma using Histopathological Images | Rashika Bagri (Department of Computer Science, Faculty of Mathematical Sciences, University of Delhi); Ankit Rajpal (Department of Computer Science, Faculty of Mathematical Sciences, University of Delhi)*; Naveen Kumar (University of Delhi) |
45 | Multi-Scale Transformer-CNN Network for Brain Tumor Segmentation and Survival Prediction | Indrajit Mazumdar (Indian Institute of Technology Kharagpur)*; Jayanta Mukhopadhyay (IIT Kharagpur) |
93 | A Novel Spatial Attention Module (SAM) for Alzheimer's Detection from MRI Images | Santanu Roy (Pandit Deendayal Energy University (PDEU) )*; Archit Gupta (NIIT University); Shubhi Tiwari (NIIT UNIVERSITY); Himaanshi Sharma (NIIT University); Mehak Kapoor (NIIT University); Samay Singh (NIIT University) |
84 | Diffusion-Based Generative Model for subject-specific Amyloid Spread Prediction | ALPHIN J THOTTUPATTU (TCS)*; Jayavardhana Gubbi (TCS Research); Pavan Kumar Reddy K (TCS Research); murali poduval (TCS); Arpan Pal (Tata Consultancy Services) |
119 | EEG classification for visual brain decoding with spatio-temporal and transformer based paradigms. | Akanksha Sharma (Indian Institute of Technology, Mandi)*; Jyoti Nigam (IITMANDI); Abhishek Rathore (Indian Insititute of Technology Mandi); Arnav Bhavsar (IIT Mandi) |
124 | Hierarchical Feature Integrated BoT-UNet with contextual feature enhancement for retinal vessel segmentation | Ananya Bose (St Thomas' College of Engineering and Technology, Kolkata); Prerana Mukherjee (Jawaharlal Nehru University)*; Anasua Sarkar (Jadavpur Univesity, India) |
149 | Phase Synchronization-based Topological Features for Autism and ADHD Classification | Karamjot Kaur (Department of Computer Science, University of Delhi); Vaishali Chawla (Department of Computer Science, University of Delhi); BHARTI - (DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF DELHI)* |
Paper ID | Title | Authors |
---|---|---|
94 | No Prompting Frozen Foundation Models: Interactive Medical Volume Segmentation using Continual Test Time Adaptation of Compact Models | Kushal Borkar (International Institute of Information Technology, Hyderabad)*; Abhilaksh Singh Reen (Indian Institute of Technology, Delhi); C.V. Jawahar (IIIT-Hyderabad); Chetan Arora (Indian Institute of Technology Delhi) |
96 | Med-SeAM: Medical Context Aware Self-Supervised Learning Framework for Anomaly Classification in Knee MRI | Akshay Daydar (Indian Institute of Technology Guwahati)*; Ajay Kumar Reddy (Indian Institute of Technology Guwahati); SONAL KUMAR (INDIAN INSTITUTE OF TECHNOLOGY GUWAHATI); Arijit Sur (IIT Guwahati); Hanif Laskar (Guwahati Neurological Research Centre (GNRC)) |
28 | IDA-UIE: An Iterative Framework for Deep Network based Degradation Aware Underwater Image Enhancement | Pranjali Singh (Indian Institute of Technology, Guwahati)*; Prithwijit Guha (Indian Institute of Technology Guwahati) |
56 | A Computer Vision Framework on Biomechanical Analysis of Jump Landings | Srishti U Sharma (Ahmedabad University)*; Srikrishnan Divakaran (Krea University); Tolga Kaya (Sacred Heart University); Mehul S Raval (Ahmedabad University) |
104 | ViDAS: Vision-based Danger Assessment and Scoring | Pranav Gupta (SRM Institute of Science and Technology)*; Advith Krishnan (SRM Institute of Science and Technology); Naman Nanda (SRM Institute of Science and Technology ); Ananth Eswar (VIT Chennai); Deeksha Agrawal (SRM Institute of Science and Technology); Pratham Gohil (SRM Institute of Science and Technology ); Pratyush Goel (SRM Institute of Science and Technology) |
80 | APMSA: Crossmodal Remote Sensing Image Retrieval using Attention Pooling and Multimodal Semantic Alignment | Aparna H (National Institute of Technology, Tiruchirappalli)*; Avik Hati (NIT Tiruchirappalli) |
130 | Deep Learning Based Compressive Domain Analytics Framework for Seismic Images | Siddharath Narayan Shakya (Indian Institute of Technology, Mandi)*; Parimala Kancharla (Indian Institute of Technology Mandi) |
Paper ID | Title | Authors |
---|---|---|
64 | Internal Embeddings of Multi-modal LLMs as Generalizable Representations for Image Quality Assessment | Sanjot Sagar Totade (IISc); Nithin C Babu (Indian Institute of Science, Bangalore)*; Shika Rao (Indian Institute of Science); Rajiv Soundararajan (Indian Institute of Science) |
77 | Zero-Shot Pose Estimation and Tracking of Autonomous Mobile Robots using Infrastructure Vision Sensors - An End-to-End Perception Framework | Dharini Raghavan (ARTPARK, Indian Institute of Science)*; Raghu Krishnapuram (IISc Bangalore); Bharadwaj Amrutur (IISc Bangalore) |
106 | Variational Distribution and Experience Replay for 3D Reconstruction in a Continual Learning Framework | Sanchar Palit (IIT Bombay)*; Sandika Biswas (Monash University) |
135 | Refine3DNet: Scaling Precision in 3D Object Reconstruction from Multi-View RGB Images using Attention | Ajith KB (College of Engineering, Trivandrum)*; Linu Shine (College of Engineering, Trivandrum); SREEJA S (COLLEGE OF ENGINEERING TRIVANDRUM) |
67 | Can Commonsense Knowledge Improve CLIP’s Performance in Cross-Domain VQA? | Mohamad Hassan N C (Indian Institute of Technology, Bombay); Ankit Jha (INRIA, Grenoble, France)*; Moloud Abdar (Deakin University); Biplab Banerjee (Indian Institute of Technology, Bombay) |
105 | Spectrogrand: Computational Creativity Driven Audiovisuals' Generation From Text Prompts | Vijay Jaisankar (International Institute of Information Technology, Bangalore)*; Dinesh Babu Jayagopi (International Institute of Information Technology, Bangalore) |
153 | Addressing Diffusion Model Based Counter-Forensic Image Manipulation for Synthetic Image Detection | Aryan N Herur (National Institute of Technology Karnataka Surathkal)*; Vaibhav Santhosh (National Institute of Technology Karnataka Surathkal); Nishanth Shetty (Indian Institute of Science); Chandra Sekhar Seelamantula (IISc Bangalore) |
Paper ID | Title | Authors |
---|---|---|
10 | Enhancing Face Quality Assessment through Age and Expression Analysis | Prateek Jaiswal (International Institute of Information Technology Hyderabad)*; Anoop Namboodiri (IIIT Hyderabad) |
30 | Lost in Context: The Influence of Context on Feature Attribution Methods for Object Recognition | Sayanta Adhikari (Indian Institute of Technology Hyderabad); Rishav Kumar (Indian Institute of Technology, Hyderabad)*; Konda Reddy Mopuri (Indian Institute of Technology Hyderabad); Rajalakshmi Pachamuthu (IIT Hyderabad) |
33 | Vision-Language Modeling with Regularized Spatial Transformer Networks for All Weather Crosswind Landing of Aircraft | Debabrata Pal (Honeywell)*; Anvita Singh (Honeywell); Saumya Saumya (Honeywell); Shouvik Das (Honeywell) |
34 | On The Efficacy of Guidance Tasks in Panoptic Segmentation | Pranjal Agarwal (IIIT Bangalore)*; Shivansh Sethi (IIITB); Viswanath Gopalakrishnan (IIIT Bangalore); Biplab Chandra Das (Samsung R&D Institute Bangalore); Shouvik Das (Samsung) |
58 | TRAQID - Traffic-Related Air Quality Image Dataset | Om Rajendra Kathalkar (IIIT Hyderabad)*; Nitin Nilesh (IIIT-Hyderabad); Sachin Chaudhari (IIITH); Anoop Namboodiri (IIIT Hyderabad) |
61 | LiAGE : Light-weight Adaptive Gaze Estimation | Navneeth S Holla (Samsung Research India)*; Anup Kushwaha (Samsung); Chandramouli Sanchi (Samsung R&D Institute India Bangalore); Saravana Balaji (Samsung R&D Institute India) |
62 | Pick and Pack: revitalizing mandalas through digital artistry | Tusita Sarkar (IIT Kharagpur); Tushar Abhishek (Indian Institute of Technology Kharagpur); Partha Bhowmick, IIT Kharagpur Bhowmick (IIT Kharagpur)* |
65 | Accurate and Real-time LiDAR Point Cloud Forecasting for Autonomous Driving | Soham Dasgupta (IIT Jodhpur)*; Kshitij J Aphale (Indian Institute of Technology Jodhpur); Kaustab Pal (IIIT Hyderabad); Avinash Sharma (IIT Jodhpur) |
71 | Simultaneous Segmentation and Anatomical Landmark Detection on 3D Data using Context-aware Multi-task Learning | Pavan Kumar Reddy K (TCS Research)*; Noorjaha Bhanu Mahammad (Tata Consultancy Services); Aparna Kanakatte (TCS); DIVYA M BHATIA (TCS); Jayavardhana Gubbi (TCS Research); Murali Poduval (TCS); Arpan Pal (Tata Consultancy Services) |
82 | Certainty in Uncertainty! An Improved GI Bleeding Detection Pipeline with Uncertainty Estimation | Sasidhar Alavala (IIT Tirupati)*; Subrahmanyam VRMS Gorthi (Indian Institute of Technology - Tirupati) |
89 | Influence of Evaluating Cross-Stimulation on the EEG Biometric Verification Performance: Benchmarking Study | Marissa L de Ataide (Goa University)*; Krishna Patel (Goa University); Narayan Vetrekar (Goa University); Rajendra Gad (UoG, India) |
97 | AppleV: A dataset for Apple fruit Volume Estimation | Seema Barda (IIT Ropar)*; Aditya _ (IIT Ropar); Rohit Kinha (IIT Ropar); Neeraj Goel (IIT Ropar) |
100 | Semi-Decoupled Distillation for Brain Tumor Segmentation using multimodal MRI Scans | Surajit Kundu (Indian Institute of Technology Kharagpur)*; Ankita Chatterjee (Indian Institute of Technology, Kharagpur); Jayanta Mukhopadhyay (IIT Kharagpur); Nishant Chakravorty (IIT Kharagpur) |
107 | CHAPVIDMR: Chapter-based Video Moment Retrieval using Natural Language Queries | Uday Agarwal (IIT Jodhpur); Yogesh Kumar (IIT Jodhpur)*; Abu Shahid (IIT Jodhpur); Prajwal Gatti (University of Bristol); Manish Gupta (Microsoft,India); Anand Mishra (Indian Institute of Technology, Jodhpur) |
109 | Transferring Appearance with Mask-Guided Attention | Manikanta Bandla (IISc, Bangalore)* |
114 | Leveraging Auxiliary Classification for Rib Fracture Segmentation | Harini G (Indian Institute of Technology Jodhpur); Aiman Farooq (Indian Institute of Technology Jodhpur)*; Deepak Mishra (IIT Jodhpur) |
121 | Dementia Disease Progression Analysis from 2D Cellular Automata Architecture | Siva Manohar Reddy Kesu (Centre for Brain Research)*; Neelam Sinha (Centre for Brain Research); Hariharan Ramasangu (Relecura) |
134 | Contextual Self-Attention Based UNet Architecture for Fluid Segmentation in Retinal OCT B-scans | Himashree Kalita (Indian Institute of Technology Guwahati)*; Samarendra Dandapat (Indian Institute of Technology Guwahati); Prabin Bora (IIT Guwahati) |
145 | Paddy Pest Detection with a Modified SE-YOLO Model Using the TPD-20 Dataset | Md Mansoor (Roomi); Priya K Kannpiran (Thiagarajar College of Engineering)*; Uma Maheswari Pandyan (Velammal College of Engineering and Technology); Vaishali V (Thiagarajar College of Engineering); Sasithradevi Anablagan (Vellore Institute of Technology) |
155 | LW-Dehazer: Deep Learning for Dense and Non-Homogeneous Haze Removal | Kalaivani P (GCE Bodi)*; Md Mansoor (Roomi); Senthilarasi Marimuthu (Thiagarajar College of Engineering); Annalakshmi M (Velammal College of Engineering and Technology); Prakash P (Madras Institute of Technology) |
159 | Exploring the Limits of VLMs: A Dataset for Evaluating Text-to-Video Generation | Avnish Srivastava (Indian Institute of Technology Kharagpur)*; Raviteja Sista (Indian Institute of Technology Kharagpur); Partha P Chakrabarti (Indian Institute of Technology Kharagpur); Debdoot Sheet (Indian Institute of Technology Kharagpur) |