CVPR 2024 Workshop on
Representation Learning with Very Limited Images
-Zero-shot, Unsupervised, and Synthetic Learning in the Era of Big Models-
June 18th (PM), 2024 at Summit 324
<Related Workshops by Organizers>
The 2nd Workshop on LIMIT
We propose the 2nd workshop on “Representation Learning with Very Limited Images: Zero-shot, Unsupervised, and Synthetic Learning in the Era of Big Models” in conjunction with CVPR 2024. At this very moment, the era of ‘foundation models’ heavily relies on a huge amount (>100M-order data) of samples inside of a training dataset. We have witnessed that this kind of large-scale dataset tends to incur ethical issues such as societal bias, copyright, and privacy, due to the uncontrollable big data. On the other hand, the setting of very limited data such as self-supervised learning with a single image or synthetic pre-training with generated images are free of the typical issues. Efforts to train visual/multi-modal models on very limited data resources have emerged independently from various academic and industry communities around the world. This workshop aims to bring together these various communities to form a collaborative effort and find brave new ideas.
Broader Impact
It was an established fact that learning representations were acquired by “human-annotated labels” from “large amount of real data”. The trained models were additionally fine-tuned and applied to each visual task. However, recent problems with large-scale datasets are: (i) biased datasets could lead to e.g., gender and racial discrimination, (ii) suspension of public dataset access due to offensive labels, (iii) ethically problematic images are mixed in a large-scale dataset. As long as a large-scale dataset consisting of real images are used, the situation is endlessly problematic. Here, a recent study reveals that a learning strategy even with very few real images [1] and supervision from a mathematical formula [2] successfully acquires a learned representation of how to see the real world. Moreover, a pre-trained model with artificially generated data outperformed ImageNet-21k pre-training [3] and was found to acquire higher robustness [4]. Thus, it is clear that self-supervised learning (SSL), formula-driven supervised learning (FDSL), and synthetic training in the very limited data setting can also develop a DNN model with high accuracy and safety. Moreover, in the era of foundation models, while these critical issues remain unresolved, there’s growing attention on how pre-training can be achieved with very limited data, whether it’s possible with synthetic images or generative models without any real images, and how adaptation can be carried out using zero/one/few-shot or very limited data. Although these topics have not yet attracted much attention in the computer vision field, these research topics must be focused on since they are expected to be a means to replace learning with real data and resolve ethical issues in the future.
- [1] Yuki M. Asano, Christian Rupprecht, Andrea Vedaldi, "A critical analysis of self-supervision, or what we can learn from a single image," ICLR 2020. [PDF] [Project]
- [2] Hirokatsu Kataoka, Kazushige Okayasu, Asato Matsumoto, Eisuke Yamagata, Ryosuke Yamada, Nakamasa Inoue, Akio Nakamura, Yutaka Satoh, "Pre-training without Natural Images," IJCV 2022. [PDF] [Project]
- [3] Hirokatsu Kataoka, Ryo Hayamizu, Ryosuke Yamada, Kodai Nakashima, Sora Takashima, Xinyu Zhang, Edgar Josafat Martinez-Noriega, Nakamasa Inoue, Rio Yokota, "Replacing Labeled Real-image Datasets with Auto-generated Contours," CVPR 2022. [PDF] [Project]
- [4] Dan Hendrycks, Andy Zou, Mantas Mazeika, Leonard Tang, Bo Li, Dawn Song, Jacob Steinhardt, "PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures," CVPR 2022. [PDF] [Project]
* The listed papers are proposed by organizers or invited speakers
Invited Talk 1: Phillip Isola (MIT)
Title: N=0: Learning Vision with Zero Visual Data [Slide]
Bio: Phillip Isola is the Class of 1948 Career Development associate professor in EECS at MIT. He studies computer vision, machine learning, robotics, and AI. He completed his Ph.D. in Brain & Cognitive Sciences at MIT, and has since spent time at UC Berkeley, OpenAI, and Google Research. His work has particularly impacted generative AI and self-supervised representation learning. Dr. Isola's research has been recognized by a Google Faculty Research Award, a PAMI Young Researcher Award, a Samsung AI Researcher of the Year Award, a Packard Fellowship, and a Sloan Fellowship. His teaching has been recognized by the Ruth and Joel Spira Award for Distinguished Teaching. His current research focuses on trying to scientifically understand human-like intelligence.
(Refer from: http://web.mit.edu/phillipi/www/bio.html)
References:
- [1] Manel Baradad, Jonas Wulff, Tongzhou Wang, Phillip Isola, Antonio Torralba, "Learning to See by Looking at Noise," NeurIPS 2021. [PDF] [Project]
- [2] Manel Baradad, Chun-Fu Chen, Jonas Wulff, Tongzhou Wang, Rogerio Feris, Antonio Torralba, Phillip Isola, "Procedural Image Programs for Representation Learning," NeurIPS 2022. [PDF] [Project]
- [3] Pratyusha Sharma, Tamar Rott Shaham, Manel Baradad, Stephanie Fu, Adrian Rodriguez-Munoz, Shivam Duggal, Phillip Isola, Antonio Torralba, "A Vision Check-up for Language Models," CVPR 2024. [PDF] [Project]
Invited Talk 2: Zeynep Akata (Helmholtz Munich/TUM)
Title: Learning with Small Number of Images in Multimodal Large Language Models [Video]
Bio: Zeynep Akata is a Liesel Beckmann Distinguished professor of Computer Science at Technical University of Munich and the director of the Institute for Explainable Machine Learning at Helmholtz Munich. After completing her PhD at the INRIA Rhone Alpes with Prof Cordelia Schmid (2014), she worked as a post-doctoral researcher at the Max Planck Institute for Informatics with Prof Bernt Schiele (2014-17) and at University of California Berkeley with Prof Trevor Darrell (2016-17) and as an assistant professor at the University of Amsterdam with Prof Max Welling (2017-19). Before moving to Munich in 2024, she was a professor of computer science (W3) within the Cluster of Excellence Machine Learning at the University of Tübingen. She received a Lise-Meitner Award for Excellent Women in Computer Science from Max Planck Society in 2014, a young scientist honour from the Werner-von-Siemens-Ring foundation in 2019, an ERC-2019 Starting Grant from the European Commission, The DAGM German Pattern Recognition Award in 2021, The ECVA Young Researcher Award in 2022 and the Alfried Krupp Award in 2023. Her research interests include multimodal learning and explainable AI. (Refer from: https://www.eml-unitue.de/people/zeynep-akata)
References:
- [1] Lisa Anne Hendricks, Ronghang Hu, Trevor Darrell, Zeynep Akata, "Grounding Visual Explanations," ECCV 2018. [PDF]
- [2] Stephan Alaniz, Diego Marcos, Bernt Schiele, Zeynep Akata, "Learning Decision Trees Recurrently Through Communication," CVPR 2021. [PDF]
- [3] Stephan Alaniz, Massimiliano Mancini, Anjan Dutta, Diego Marcos, Zeynep Akata, "Abstracting Sketches through Simple Primitives," ECCV 2022. [PDF]
- [4] Shyamgopal Karthik, Karsten Roth, Massimiliano Mancini, Zeynep Akata, "Vision-by-Language for Training-Free Compositional Image Retrieval," ICLR 2024. [PDF]
Program (Date: June 18th PM, Room: Summit 324)
Oral Session (14:20 - 15:00; Room: Summit 324)
Oral presenters have 10 minutes including questions. Oral presenters are also required to present at the poster session.
- Oral/Poster 1 [Panel#74]:
Kevin Zhang (Peking University), Zhiqiang Shen (MBZUAI)
- Oral/Poster 2 [Panel#75]:
Zhiwen Fan (University of Texas at Austin), Panwang Pan (ByteDance Inc), Peihao Wang (University of Texas at Austin), Yifan Jiang (University of Texas at Austin), Dejia Xu (University of Texas at Austin), Zhangyang Wang (University of Texas at Austin)
- Oral/Poster 3 [Panel#76]:
Sunny Soni (Universiteit van Amsterdam), Aaqib Saeed (Eindhoven University of Technology), Yuki M Asano (University of Amsterdam)
- Oral/Poster 4 [Panel#77]:
Xin Xing (University of Nebraska Omaha), Zhexiao Xiong (Washington University in St. Louis), Abby Stylianou (Saint Louis University), Srikumar Sastry (Washington University in St. Louis), Liyu Gong (Oracle Inc), Nathan Jacobs (Washington University in St. Louis)
Poster Session (16:30 - 17:30; Room: Arch Building 4E)
Posters will be 84” x 42” = 213 cm x 107cm (WxH, aspect ratio 2:1, landscape format). If you want to use the CVPR logo on your poster, you can download them as a zip file here (Refer from: CVPR Official Website)
- Poster 5 [Panel#78]:
Luyao Shi (IBM Research), Prashanth Vijayaraghavan (IBM Research); Ehsan Degan (IBM Research)
- Poster 6 [Panel#79]:
Mehmet Aygün (University of Edinburgh), Prithviraj Dhar (Meta), Zhicheng Yan (Meta Reality Labs), Oisin Mac Aodha (University of Edinburgh); Rakesh Ranjan (Meta)
- Poster 7 [Panel#80]:
Yuki HIROHASHI (OMRON Corp.), Tsubasa Hirakawa (Chubu University), Takayoshi Yamashita (Chubu University), Hironobu Fujiyoshi (Chubu Univ.)
- Poster 8 [Panel#81]:
Joshua Feinglass (Arizona State University), Jayaraman J. Thiagarajan (Lawrence Livermore National Laboratory), Rushil Anirudh (Amazon); T.S. Jayram (LLNL), Yezhou Yang (Arizona State University)
- Poster 9 [Panel#82]:
Michael Majurski (NIST), Sumeet Menon (University of Maryland, Baltimore County), Parniyan Favardin (University of Miami), David R Chapman (University of Miami)
- Poster 10 [Panel#83]:
Anant Khandelwal (IIT Delhi)
- Poster 11 [Invited poster 1; ICCV 2023 accepted paper; Panel#84]:
Risa Shinoda (AIST/University of Kyoto), Ryo Hayamizu (AIST/TDU), Kodai Nakashima (AIST/University of Tsukuba), Nakamasa Inoue (AIST/TokyoTech), Rio Yokota (AIST/TokyoTech), Hirokatsu Kataoka (AIST)
- Poster 12 [Invited poster 2; CVPR 2024 accepted paper; Panel#85]:
Stanislaw Szymanowicz Christian Rupprecht Andrea Vedaldi (University of Oxford)
- Poster 13 [Invited poster 3; CVPR 2024 accepted paper; Panel#86]:
Lukas Knobel (University of Amsterdam), Tengda Han (University of Oxford), Yuki M. Asano (University of Amsterdam)
- Poster 14 [Invited poster 4; CVPR 2024 accepted paper; Panel#87]:
Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek, Yuki M. Asano(University of Amsterdam)
- Poster 15 [Invited poster 5; CVPR 2024 accepted paper; Panel#88]:
Benjamin N. Chiche, Yuto Horikawa, Ryo Fujita (Rist Inc.)
Paper Submission / Call For Papers
- Guide for Authors. We invite original research papers. All submissions should be anonymized, formatted according to the template of CVPR 2024 (Recommended: 4-8 pages excluding references): https://cvpr.thecvf.com/Conferences/2024/CallForPapers.
- Microsoft CMT Site. Please submit your paper here: https://cmt3.research.microsoft.com/CVPRLIMIT2024
- {Self, Semi, Weakly, Un}-supervised learning with very limiteda data
- {Zero, One, Few}-shot learning with very limited data
- Synthetic training with computer graphics
- Model training from generative models and/or formula-generated data
- Training convnets, vision transformers, and more models with relatively fewer resources, Vision+X and multi-modality to accelerate a learning efficiency
- New datasets, benchmarks, and meta analysis
- Brave new ideas related to above-mentioned topics or create a novel field
Important Dates
- Submission deadline: March 21st, 2024
- Notification: April 4th, 2024
- Camera-ready deadline: April 11th, 2024
- # All dates are following Pacific Time (PT).
Organizers