PaperID.pdf before uploading.The following workshops will be held at ICVGIP 2025, on Dec 17, 2025. More information about each workshop is available on the individual webpages, linked to the titles below.
Note that some workshops may also have an associated call for papers.
The following tutorials will be held at ICVGIP 2025, on Dec 17, 2025. More information about each tutorial is available on the individual webpages, linked to the titles below.
The results for Vision India 2025 are now announced.
| Category | Early Registration | Late Registration |
|---|---|---|
| Students (includes accommodation) | *INR 5,900 | *INR 7,080 |
| Faculty / Researcher | *INR 9,440 | *INR 10,620 |
| Industry | *INR 16,520 | *INR 17,700 |
*INR amounts are inclusive of 18% GST
| Category | Early Registration | Late Registration |
|---|---|---|
| Students (includes accommodation) | *USD 260 | *USD 320 |
| Faculty / Researcher | *USD 390 | *USD 450 |
*USD amounts are inclusive of 18% GST
Title: A Guided Tour of Generative and Diagnostic AI in Medical Imaging
Artificial intelligence is transforming medical imaging, driven by rapid advances in computer vision and multimodal foundation models. Over the past 15 years, our group has led the ENIGMA Consortium — the world’s largest collaborative neuroimaging initiative — integrating data from 2,000 imaging centers across 47 countries to study over 30 brain disorders. Leveraging this unprecedented scale, we have systematically benchmarked key computer vision architectures — convolutional neural networks, Vision Transformers, and Diffusion Transformers — for clinical tasks spanning diagnosis, prognosis, and the discovery of disease mechanisms in the brain. The lecture will examine how generative AI is redefining the medical imaging workflow and empowering biomedical discovery. Vision-language models (VLMs) can now perform semantically grounded reasoning on brain scans, supporting visual question answering (“What disease does this scan indicate?”) with calibrated confidence and interpretability. Meanwhile, image-generating models based on latent diffusion and flow matching are enabling cross-modal translation (e.g., MRI↔PET), super-resolution, and counterfactual synthesis — generating individualized predictions such as “How would this patient’s brain look after treatment?”, We will also highlight mathematical frontiers in medical AI, including privacy-preserving training, memorization detection, uncertainty quantification, and calibration of probabilistic outputs. Methods grounded in stochastic partial differential equations (sPDEs) — including Malliavin derivatives, Stein variational flows, and fractional diffusion operators — offer principled frameworks to model uncertainty and transport distributions in generative learning. Finally, I will introduce a triple scaling law that characterizes how diagnostic AI performance scales jointly with data volume, model capacity, and algorithmic diversity, illustrating how AI–human co-learning networks such as ENIGMA provide a new paradigm for scalable, interpretable, and collaborative medical intelligence.
Paul Thompson is a British-American neuroscientist and mathematician who develops advanced AI and mathematical models to map the human brain, integrating large-scale multimodal data to quantify disease processes.
As Associate Director of USC’s Stevens Neuroimaging & Informatics Institute in Los Angeles, he leads national and global projects that combine deep learning, statistical genomics, and multi-resolution structural, diffusion, and functional MRI to dissect brain organization and pathology.
His group builds explainable and generative AI systems—including latent diffusion models, flow-matched transformations, and continuum mechanics frameworks—to characterize normative and pathological brain changes, microstructure, and connectivity across more than 30 diseases. His team also develops vision-language models to support data-driven radiologic interpretation.
Thompson directs the $18M NIH-funded AI4AD initiative on machine learning prediction of Alzheimer’s trajectories and co-founded ENIGMA, the largest brain imaging-genetics consortium worldwide, analyzing data from over 200,000 individuals across 45 countries, with over 2,000 participating scientists.
ENIGMA’s work has produced the largest international MRI-genetic studies of schizophrenia, depression, anorexia, epilepsy, and Parkinson’s, uncovering multiscale biomarkers of disease risk and progression. Ranked among the world’s most cited scientists (h-index=220), Thompson has published over 1,000 papers and delivered keynote lectures such as the Talairach Lecture at OHBM.
His honors include the Gold Medal from the Society of Biological Psychiatry, and with colleagues at NIMHANS (Bangalore), he co-leads the India ENIGMA Initiative to understand drivers of human brain aging and dementia.
Title: Cross-modal generation and understanding of multimodal content
In the first part of the presentation, we will present our work on video generation without annotations or prior object-specific information. Trained on videos of similar objects (e.g. faces, bodies), our method generalizes across the category. Building on this, we introduce a Learnable Game Engine (LGE), trained from monocular annotated videos, that maintains scene and object states and renders environments from controllable viewpoints. Like a game engine, it simulates physics and logic, allowing users to control the game play or use a director mode to guide agents via high-level language and goals, enabled by learned game AI. The second part will investigate the safety and fairness of the current generative models. While most of the existing research focuses on detecting closed sets of biases defined a priori, we tackle the challenge of open-set bias detection in text-to-image generative models. For this we proposed OpenBias, a new pipeline that agnostically identifies and quantifies the severity of biases without access to any precompiled set. We study the behavior of Stable Diffusion 1.5, 2, and XL emphasizing new biases, never investigated before. Via quantitative experiments, we demonstrate that OpenBias agrees with current closed-set bias detection methods and human judgement.
Nicu Sebe is a professor in the University of Trento, Italy, where he is leading the research in the areas of multimedia information retrieval and human-computer interaction in computer vision applications. He received his PhD from the University of Leiden, The Netherlands and has been in the past with the University of Amsterdam, The Netherlands and the University of Illinois at Urbana-Champaign, USA. He was involved in the organization of the major conferences and workshops addressing the computer vision and human-centered aspects of multimedia information retrieval, among which as a General Co-Chair of the IEEE Automatic Face and Gesture Recognition Conference, FG 2008, ACM International Conference on Multimedia Retrieval (ICMR) 2017 and ACM Multimedia 2013. He was a program chair of ACM Multimedia 2011 and 2007, ECCV 2016, ICCV 2017, ICPR 2020 and a general chair of ACM Multimedia 2022. He is a PC of CVPR 2027 and a GC of ACM Multimedia 2027 and ECCV 2028. He is a fellow of ELLIS, IAPR and a Senior member of ACM and IEEE. He is the co-editor in chief of Computer Vision and Image Understanding journal.
*To be updated*
Srinath Sridhar is the John E. Savage Assistant Professor of Computer Science at Brown University, where he leads the Interactive 3D Vision & Learning Lab (https://ivl.cs.brown.edu). He received his PhD at the Max Planck Institute for Informatics and was subsequently a postdoctoral researcher at Stanford. His research interests are in 3D computer vision and artificial intelligence. Specifically, his group builds foundational methods for 3D spatiotemporal (4D) visual understanding of the world including objects in it, humans in motion, and human-object interactions, with applications ranging from robotics to mixed reality. He is the recipient of an NSF CAREER award, a Google Research Scholar award, and his work received a Best Student Paper award at WACV and a Best Paper Honorable Mention at Eurographics. He spends part of his time as an Amazon Scholar and a visiting faculty at the Indian Institute of Science (IISc).
*To be updated*
Prof. C. V. Jawahar is the Head of the Centre for Visual Information Technology-CVIT, at the International Institute of Information Technology, Hyderabad (IIITH), India. At IIIT Hyderabad, he leads the research group focusing on computer vision, machine learning, and multimedia systems. In recent years, he has been actively involved in research questions in Computer Vision with emphasis on mobility, healthcare, and Indian language computing.. He is also interested in large-scale multimedia systems with a special focus on assistive technology solutions. Prof. Jawahar is an elected Fellow of the Indian National Academy of Engineers (INAE) and the International Association of Pattern Recognition(IAPR). His prolific research is globally recognized in the Artificial Intelligence and Computer Vision research community with more than 300 publications in top tier conferences and journals in computer vision, robotics and document image processing to his credit with over 25000 citations. He is awarded the ACM India Outstanding Contribution to Computing Education (OCCE) 2021. He is actively engaged with several government agencies, ministries, and leading companies around innovating at scale through research.
Title: Translating AI Innovations into Solutions in the Global South
According to Prof. Kentaro Toyama, technology predominantly acts as an amplifier of human effort and the strength of institutions. In this presentation, I will share examples from India and other developing nations that demonstrate how AI-powered solutions are being designed, implemented, and scaled to improve people's lives, in manner consistent with Kentaro’s observation. Each project narrative will explore how specific problems were selected, the significant constraints and opportunities encountered during the design and development phases, and the approaches taken for implementation. The discussion will also address the challenges and possibilities associated with scaling these solutions. A central lesson from these experiences is the necessity for what I call “radical collaboration.” This involves bringing together a diverse group of contributors—technology specialists, implementing organizations, subject matter experts, donors, and government agencies. Through these case studies, I will illustrate how such deep collaboration can turn AI-driven innovations into effective solutions for challenges in the developing world.
Padmanabhan Anandan is a scientist in the domain of Artificial Intelligence with over 40 years of experience in basic and applied research in AI as well as in leading teams of AI scientists and building multiple applied research institutions. He is the ex-CEO of Wadhwani Institute for Artificial Intelligence, an independent not-for-profit Research institute focused on developing artificial intelligence based applications for social good. He was formerly vice president for research at Adobe Systems and prior to that a distinguished scientist and managing director of Microsoft Research. He was the managing director of Microsoft Research India, which he founded in January 2005 in Bangalore. He joined Microsoft Research in Redmond, Washington in 1997, where he founded and built the Interactive Visual Media group. He was also previously a professor of Computer Science at Yale University and the head of the video information processing group at Sarnoff Corporation.
The Indian Conference on Computer Vision, Graphics, and Image Processing (ICVGIP) stands as India's premier forum for these dynamic fields. Since its inception in 1998, ICVGIP has been a crucial platform for showcasing cutting-edge technological advancements and pivotal research findings. The upcoming 16th ICVGIP 2025 is proudly hosted by IIT Mandi in collaboration with the Indian Unit for Pattern Recognition and Artificial Intelligence (IUPRAI), an affiliate of the esteemed International Association for Pattern Recognition (IAPR). Renowned for nurturing a vibrant community of researchers and enthusiasts both domestically and internationally, ICVGIP transitioned to an annual cadence in 2022, a testament to the significant expansion of this community. ICVGIP 2025 continues to embody our unwavering commitment to the evolving landscapes of computer vision, graphics, and image processing. The conference offers a dynamic environment for academics and industry experts to converge, exchange insights, and explore the latest innovations shaping these domains.
ICVGIP has consistently cultivated strong industry relationships and actively encourages engagement from diverse sectors. The conference presents multifaceted opportunities for industry participation, spanning the technical program, specialized research symposiums, and exhibitions. By engaging with ICVGIP as a sponsor, organizations not only gain significant visibility but also acquire the chance to influence the trajectory of these rapidly progressing fields. We earnestly encourage companies and R&D laboratories to present their groundbreaking work and actively connect with the community at ICVGIP. This engagement is not only invaluable for fostering networking and knowledge dissemination but also pivotal in molding the future directions of computer vision, graphics, and image processing. We offer a spectrum of sponsorship tiers, detailed below, each accompanied by a corresponding allocation of complimentary registrations as outlined below:
Sponsorship cost
INR 10 Lakh
Sponsorship cost
INR 5 Lakh
Sponsorship cost
INR 3 Lakh
Sponsorship cost
INR 1 Lakh
Sponsorship cost
INR 50 Thousand
Sponsorship cost
Contact Sponsors
IIT Mandi
IIT Delhi
IIT Delhi
IIIT Bangalore
IIIT Hyderabad
IIT Madras
IIT Mandi
IIT Mandi
IIT Mandi
IIT Mandi
IIT Mandi
IIT Mandi
IIT Mandi
IIT Mandi
IIT Ropar
IIT Hyderabad
IIT Kharagpur
IIT Gandhinagar
IIT Hyderabad
IIIT Hyderabad
IIT Jodhpur
IIT Kharagpur
IIT BHU
IIT Ropar
Walmart Global Tech
IISc Bangalore
Ashoka University
IBM Research
The conference will be held at IIT Mandi's North campus in Kamand, Mandi District, Himachal Pradesh, India. Please read about the geography of this location here. The location is very close to a number of very popular tourist attractions.
| Hotel Name | Location | Contact No. | Room Charges per night |
|---|---|---|---|
| Hotel Parashar Valley View | Kataula, Mandi - 5 km from campus | 9816296621 | INR 1700 + GST |
| Hotel Name | Location | Link |
|---|---|---|
| Munish Resorts | Mandi City | Click here |
| Raj Mahal Palace | Mandi City | Click here |
| Hotel Park View | Mandi City | Click here |
| The Regent Palms Hotel | Mandi City | Click here |
| Visco Resorts | Mandi City | Click here |
| Hotel River Bank | Mandi City | Click here |
| Classio | Mandi City | Click here |
We eagerly look forward to hosting you at IIT Mandi during the early-mid phase of the Himachali winters.
To make your visit comfortable, it is important that one respects the mountain weather.
Hence, we would earnestly request you to take notice of the points mentioned below:
Weather conditions
Temperatures at Mandi in December can range below 15°C during the day, and can go around or below 5°C in the evenings and nights.
Especially nights and early mornings can be quite cold.
Although there is less chance of rain (based on the current forecast), Mandi can get occasional rain or drizzles, when it snows at higher altitudes.
Clothing Recommendations
A three-layered clothing may be required at times. Please carry thermal innerwear (as a first layer), sweaters / jacket (as a third layer). It's good to have woolen accessories like caps, scarves, mufflers, gloves, to protect exposed areas from the cold. These may be needed especially if you plan to walk or hike in the early morning hours, before the conference sessions. If you plan to do some easy walks and hikes around the campus, ensure to wear shoes that you are usually used to. It is advisable to carry a small umbrella to be prepared for occasional rain or drizzle.
Medicine and Health Precautions
The institute has a health center, and a pharmacy, if you require professional care. However, just as a precautionary measure: Bring basic medicines for cold, cough, and fever Due to the air being dry and cold, carry moisturizer if you tend to use it. Carry any personal prescription medicines in sufficient quantity.
With these measures, we are confident that your stay at IIT Mandi during the ICVGIP will be very pleasant.
Stay warm, Breathe in the fresh air, and Enjoy the conference !
P.S. It does not snow at Mandi, which is at the altitude of approximately 3000 ft. However, if you plan to travel pre / post conference, higher places around Manali, Atal Tunnel, and Lahaul valley, Kinnaur valley usually get a lot of snowfall. Hence, you should carry an extra sweater or jacket for such travel plans.
Thanks
ICVGIP 2025 team
IIT Mandi is surrounded by several scenic and culturally rich tourist spots, perfect for exploration.
A serene lake with views of the Dhauladhar range, located 32 km from IIT Mandi. Perfect for a peaceful retreat in nature.
A religiously significant and scenic lake located 45 km from IIT Mandi, popular with both tourists and pilgrims.
Famous hill stations known for adventure, natural beauty, and vibrant culture, located around 55-95 km from IIT Mandi.
Renowned for paragliding and Tibetan monasteries, located 75 km from IIT Mandi.
A tranquil valley with lush green landscapes, located around 70 km from IIT Mandi.
A breathtaking waterfall in the Lahaul region, located about 40 km from Manali.
IIT Mandi North Campus
Parashar Road, Tehsil Sadar, Near Kataula, Kamand, Himachal Pradesh 175005