Issue 6/2023
Content (53 Articles)
A customizable framework for multimodal emotion recognition using ensemble of deep neural network models
Chhavi Dixit, Shashank Mouli Satapathy
Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning
Ce Zhang, Xiao Yao, Changfeng Shi, Min Gu
Dental radiology: a convolutional neural network-based approach to detect dental disorders from dental images in a real-time environment
Humaira Shafiq, Ghulam Gilanie, Muhammad Sajid, Muhammad Ahsan
A deep learning image inpainting method based on stationary wavelet transform
Yuhan Huang, Jiacheng Lu, Nianzhe Chen, Hui Ding, Yuanyuan Shang
A LiDAR point cloud registration method combining linear feature extraction and TrICP algorithm
Chuanwang Wen, Shucheng Huang
Image captioning for cultural artworks: a case study on ceramics
Baoying Zheng, Fang Liu, Mohan Zhang, Tongqing Zhou, Shenglan Cui, Yunfan Ye, Yeting Guo
Hotspot defect detection for photovoltaic modules under complex backgrounds
Huimin Qian, Wenyu Shen, Zhengqi Wang, Shuwei Xu
TFA-CNN: an efficient method for dealing with crowding and noise problems in crowd counting
Liyan Xiong, Zhida Li, Xiaohui Huang, Yijuan Zeng, Peng Huang
Adversarial training in logit space against tiny perturbations
Xiaohui Guan, Qiqi Shao, Yaguan Qian, Tengteng Yao, Bin Wang
Generative adversarial text-to-image generation with style image constraint
Zekang Wang, Li Liu, Huaxiang Zhang, Dongmei Liu, Yu Song
“Tomato-Village”: a dataset for end-to-end tomato disease detection in a real-world environment
Mamta Gehlot, Rakesh Kumar Saxena, Geeta Chhabra Gandhi
YOLO-ERF: lightweight object detector for UAV aerial images
Xin Wang, Ning He, Chen Hong, Fengxi Sun, Wenjing Han, Qi Wang
Clustering by sparse orthogonal NMF and interpretable neural network
Yongwei Gai, Jinglei Liu
Class-agnostic counting with feature augmentation and similarity comparison
Mingju Shao, Guodong Wang
Image compression with learned lifting-based DWT and learned tree-based entropy models
Ugur Berk Sahin, Fatih Kamisli
A new adaptive VR-based exergame for hand rehabilitation after stroke
Amal Bouatrous, Abdelkrim Meziane, Nadia Zenati, Chafiaa Hamitouche
Bio-Inspired ensemble feature selection and deep auto-encoder approach for rapid diagnosis of breast cancer
V. Praveena, L. R. Sujithra, S. Karthik, M. S. Kavitha
Narrowing the variance of variational cross-encoder for cross-modal hashing
Dayong Tian, Yiqin Cao, Yiwen Wei, Deyun Zhou
G-UNeXt: a lightweight MLP-based network for reducing semantic gap in medical image segmentation
Xin Zhang, Xiaotian Cao, Jun Wang, Lei Wan
Integrated document segmentation and region identification: textual, equation and graphical
Jennil Thiyam, Sanasam Ranbir Singh, Prabin Kumar Bora
Improving transferable adversarial attack for vision transformers via global attention and local drop
Tuo Li, Yahong Han
Interactive video retrieval in the age of effective joint embedding deep models: lessons from the 11th VBS
Jakub Lokoč, Stelios Andreadis, Werner Bailer, Aaron Duane, Cathal Gurrin, Zhixin Ma, Nicola Messina, Thao-Nhu Nguyen, Ladislav Peška, Luca Rossetto, Loris Sauter, Konstantin Schall, Klaus Schoeffmann, Omar Shahbaz Khan, Florian Spiess, Lucia Vadicamo, Stefanos Vrochidis
A social-aware video sharing solution using demand prediction of epidemic-based propagation in wireless networks
Shijie Jia, Yan Cui, Xiaoyan Su, Zongzheng Liang
Lite general network and MagFace CNN for micro-expression spotting in long videos
Quan-Lin Gu, Sai Yang, Tianxing Yu
A viewpoint-guided prototype network for 3D shape classification
Li Han, Jinhai He, Feng Dou, Huiwen Ma, Xinyang Xie, Wanwen Yang
Student engagement detection in online environment using computer vision and multi-dimensional feature fusion
Nan Xie, Zhaojie Liu, Zhengxu Li, Wei Pang, Beier Lu
Dual-branch spectral–spatial feature extraction network for multispectral image compression
Fanqiang Kong, Jiahui Tang, Yunsong Li, Dan Li, Kedi Hu
Hierarchical multiples self-attention mechanism for multi-modal analysis
Wu Jun, Zhu Tianliang, Zhu Jiahui, Li Tianyi, Wang Chunzhi
A multi-scale feature fusion spatial–channel attention model for background subtraction
Yizhong Yang, Tingting Xia, Dajin Li, Zhang Zhang, Guangjun Xie
Audio–text retrieval based on contrastive learning and collaborative attention mechanism
Tao Hu, Xuyu Xiang, Jiaohua Qin, Yun Tan
Workpiece tracking based on improved SiamFC++ and virtual dataset
Kaisi Yang, Lianyu Zhao, Chenglin Wang
Multi-aggregation network based on non-separable lifting wavelet for single image deraining
Bin Liu, Siyan Fang
A two-stage attention augmented fully convolutional network-based dynamic video summarization
Deeksha Gupta, Akashdeep Sharma
MAF-Net: multidimensional attention fusion network for multichannel speech separation
Honglin Li, Qinghua Huang
Compression of face images using meta-heuristic algorithms based on curvelet transform with variable bit allocation
Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu
Artistic image adversarial attack via style perturbation
Haiyan Zhang, Quan Wang, Guorui Feng
Owner named entity recognition in website based on multidimensional text guidance and space alignment co-attention
Xin Zheng, Xin He, Yimo Ren, Jinfa Wang, Junyang Yu
Learning intra-inter-modality complementary for brain tumor segmentation
Jiangpeng Zheng, Fan Shi, Meng Zhao, Chen Jia, Congcong Wang
A comprehensive survey on deep-learning-based visual captioning
Bowen Xin, Ning Xu, Yingchen Zhai, Tingting Zhang, Zimu Lu, Jing Liu, Weizhi Nie, Xuanya Li, An-An Liu
Asymmetric bi-encoder for image–text retrieval
Wei Xiong, Haoliang Liu, Siya Mi, Yu Zhang
CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
Fengjun Xiao, Zhuxi Zhang, Ye Yao
Identification of haploid and diploid maize seeds using hybrid transformer model
Emrah Dönmez, Serhat Kılıçarslan, Cemil Közkurt, Aykut Diker, Fahrettin Burak Demir, Abdullah Elen
LET-Net: locally enhanced transformer network for medical image segmentation
Na Ta, Haipeng Chen, Xianzhu Liu, Nuo Jin
Inceptr: micro-expression recognition integrating inception-CBAM and vision transformer
Haoliang Zhou, Shucheng Huang, Yuqiao Xu
Images denoising for COVID-19 chest X-ray based on multi-scale parallel convolutional neural network
Noor Ahmed, Rozina, Ahmad Ali, Abdul Raziq
View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer
Jiacheng Chang, Lanyong Zhang, Zhuang Shao
Variable bit allocation method based on meta-heuristic algorithms for facial image compression
Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu
A novel study for automatic two-class COVID-19 diagnosis (between COVID-19 and Healthy, Pneumonia) on X-ray images using texture analysis and 2-D/3-D convolutional neural networks
Huseyin Yaşar, Murat Ceylan
Stories of love and violence: zero-shot interesting events’ classification for unsupervised TV series summarization
Alison Reboud, Ismail Harrando, Pasquale Lisena, Raphaël Troncy
Correction to: DL‑CNN‑based approach with image processing techniques for diagnosis of retinal diseases
Akash Tayal, Jivansha Gupta, Arun Solanki, Khyati Bisht, Anand Nayyar, Mehedi Masud
Correction: Comprehensive systematic review on virtual reality for cultural heritage practices: coherent taxonomy and motivations
Hwei Teeng Chong, Chen Kim Lim, Ahmad Rafi, Kian Lam Tan, Mazlin Mokhtar