Ausgabe 4/2024
Inhalt (22 Artikel)
Guest Editorial: Special Issue on the Promises and Dangers of Large Vision Models
Kaiyang Zhou, Ziwei Liu, Xiaohua Zhai, Chunyuan Li, Kate Saenko
Towards a Unified Network for Robust Monocular Depth Estimation: Network Architecture, Training Strategy and Dataset
Mochu Xiang, Yuchao Dai, Feiyu Zhang, Jiawei Shi, Xinyu Tian, Zhensong Zhang
A General Paradigm with Detail-Preserving Conditional Invertible Network for Image Fusion
Wu Wang, Liang-Jian Deng, Ran Ran, Gemine Vivone
FlowNAS: Neural Architecture Search for Optical Flow Estimation
Zhiwei Lin, Tingting Liang, Taihong Xiao, Yongtao Wang, Ming-Hsuan Yang
DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes
Shengyu Hao, Peiyuan Liu, Yibing Zhan, Kaixun Jin, Zuozhu Liu, Mingli Song, Jenq-Neng Hwang, Gaoang Wang
Symmetry-aware Neural Architecture for Embodied Visual Navigation
Shuang Liu, Masanori Suganuma, Takayuki Okatani
Language-Aware Soft Prompting: Text-to-Text Optimization for Few- and Zero-Shot Adaptation of V &L Models
Adrian Bulat, Georgios Tzimiropoulos
SegViT v2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers
Bowen Zhang, Liyang Liu, Minh Hieu Phan, Zhi Tian, Chunhua Shen, Yifan Liu
A Deeper Analysis of Volumetric Relightable Faces
Pramod Rao, B. R. Mallikarjun, Gereon Fox, Tim Weyrich, Bernd Bickel, Hanspeter Pfister, Wojciech Matusik, Fangneng Zhan, Ayush Tewari, Christian Theobalt, Mohamed Elgharib
InstaFormer++: Multi-Domain Instance-Aware Image-to-Image Translation with Transformer
Soohyun Kim, Jongbeom Baek, Jihye Park, Eunjae Ha, Homin Jung, Taeyoung Lee, Seungryong Kim
Local Compressed Video Stream Learning for Generic Event Boundary Detection
Libo Zhang, Xin Gu, Congcong Li, Tiejian Luo, Heng Fan
Learning Portrait Drawing with Unsupervised Parts
Burak Tasdemir, Mustafa Goktan Gudukbay, Dogac Eldenk, Adil Meric, Aysegul Dundar
Skeleton Ground Truth Extraction: Methodology, Annotation Tool and Benchmarks
Cong Yang, Bipin Indurkhya, John See, Bo Gao, Yan Ke, Zeyd Boukhers, Zhenyu Yang, Marcin Grzegorzek
Cascaded Iterative Transformer for Jointly Predicting Facial Landmark, Occlusion Probability and Head Pose
Yaokun Li, Guang Tan, Chao Gou
Universal Object Detection with Large Vision Model
Feng Lin, Wenze Hu, Yaowei Wang, Yonghong Tian, Guangming Lu, Fanglin Chen, Yong Xu, Xiaoyu Wang
Harmonizing Base and Novel Classes: A Class-Contrastive Approach for Generalized Few-Shot Segmentation
Weide Liu, Zhonghua Wu, Yang Zhao, Yuming Fang, Chuan-Sheng Foo, Jun Cheng, Guosheng Lin
Going Deeper into Recognizing Actions in Dark Environments: A Comprehensive Benchmark Study
Yuecong Xu, Haozhi Cao, Jianxiong Yin, Zhenghua Chen, Xiaoli Li, Zhengguo Li, Qianwen Xu, Jianfei Yang
Learning Robust Multi-scale Representation for Neural Radiance Fields from Unposed Images
Nishant Jain, Suryansh Kumar, Luc Van Gool
CCR: Facial Image Editing with Continuity, Consistency and Reversibility
Nan Yang, Xin Luan, Huidi Jia, Zhi Han, Xiaofeng Li, Yandong Tang
PartCom: Part Composition Learning for 3D Open-Set Recognition
Tingyu Weng, Jun Xiao, Hao Pan, Haiyong Jiang
Adapting Across Domains via Target-Oriented Transferable Semantic Augmentation Under Prototype Constraint
Mixue Xie, Shuang Li, Kaixiong Gong, Yulin Wang, Gao Huang