Issue 4/2022
Special issue on cross-modal retrieval and analysis
Content (20 Articles)
Contrastive self-supervised learning: review, progress, challenges and future research directions
Pranjal Kumar, Piyush Rawat, Siddhartha Chauhan
Human pose estimation using deep learning: review, methodologies, progress and future research directions
Pranjal Kumar, Siddhartha Chauhan, Lalit Kumar Awasthi
Prototype local–global alignment network for image–text retrieval
Lingtao Meng, Feifei Zhang, Xi Zhang, Changsheng Xu
Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods
Zhengjie Huang, Zhenguang Liu, Jianhai Chen, Qinming He, Shuang Wu, Lei Zhu, Meng Wang
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition
Ren Zhang, Ning He, Shengjie Liu, Ying Wu, Kang Yan, Yuzhe He, Ke Lu
Multi-aware coreference relation network for visual dialog
Zefan Zhang, Tianling Jiang, Chunping Liu, Yi Ji
Video deblurring and flow-guided feature aggregation for obstacle detection in agricultural videos
Keyang Cheng, Xuesen Zhu, Yongzhao Zhan, Yunshen Pei
TCKGE: Transformers with contrastive learning for knowledge graph embedding
Xiaowei Zhang, Quan Fang, Jun Hu, Shengsheng Qian, Changsheng Xu
FDAM: full-dimension attention module for deep convolutional neural networks
Silin Cai, Changping Wang, Jiajun Ding, Jun Yu, Jianping Fan
FCT: fusing CNN and transformer for scene classification
Yuxiang Xie, Jie Yan, Lai Kang, Yanming Guo, Jiahui Zhang, Xidao Luan
Semantic-aware visual scene representation
Mohammad Javad Parseh, Mohammad Rahmanimanesh, Parviz Keshavarzi, Zohreh Azimifar
Generative adversarial networks for 2D-based CNN pose-invariant face recognition
M. Kas, Y. El-merabet, Y. Ruichek, R. Messoussi
A novel method for video shot boundary detection using CNN-LSTM approach
Abdelhalim Benoughidene, Faiza Titouna
Visual and semantic ensemble for scene text recognition with gated dual mutual attention
Zhiguang Liu, Liangwei Wang, Jian Qiao
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning
Junyan Yang, Jie Jiang, Yanming Guo
Gender classification from face images using central difference convolutional networks
Mohammadreza Sheikh Fathollahi, Rezvan Heidari
Tri-RAT: optimizing the attention scores for image captioning
You Yang, Yongzhi An, Juntao Hu, Longyue Pan
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products
Stefanos-Iordanis Papadopoulos, Christos Koutlis, Symeon Papadopoulos, Ioannis Kompatsiaris
Similar interior coordination image retrieval with multi-view features
Ren Togo, Yuki Honma, Maiku Abe, Takahiro Ogawa, Miki Haseyama