Multimedia Systems 6/2023

Regular Paper

A customizable framework for multimodal emotion recognition using ensemble of deep neural network models

Chhavi Dixit, Shashank Mouli Satapathy

Regular Paper

Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning

Ce Zhang, Xiao Yao, Changfeng Shi, Min Gu

Regular Paper

Dental radiology: a convolutional neural network-based approach to detect dental disorders from dental images in a real-time environment

Humaira Shafiq, Ghulam Gilanie, Muhammad Sajid, Muhammad Ahsan

Regular Paper

A deep learning image inpainting method based on stationary wavelet transform

Yuhan Huang, Jiacheng Lu, Nianzhe Chen, Hui Ding, Yuanyuan Shang

Regular Paper

A LiDAR point cloud registration method combining linear feature extraction and TrICP algorithm

Chuanwang Wen, Shucheng Huang

Regular Paper

Image captioning for cultural artworks: a case study on ceramics

Baoying Zheng, Fang Liu, Mohan Zhang, Tongqing Zhou, Shenglan Cui, Yunfan Ye, Yeting Guo

Regular Paper

Hotspot defect detection for photovoltaic modules under complex backgrounds

Huimin Qian, Wenyu Shen, Zhengqi Wang, Shuwei Xu

Regular Paper

TFA-CNN: an efficient method for dealing with crowding and noise problems in crowd counting

Liyan Xiong, Zhida Li, Xiaohui Huang, Yijuan Zeng, Peng Huang

Regular Paper

Adversarial training in logit space against tiny perturbations

Xiaohui Guan, Qiqi Shao, Yaguan Qian, Tengteng Yao, Bin Wang

Regular Paper

Generative adversarial text-to-image generation with style image constraint

Zekang Wang, Li Liu, Huaxiang Zhang, Dongmei Liu, Yu Song

Regular Paper

“Tomato-Village”: a dataset for end-to-end tomato disease detection in a real-world environment

Mamta Gehlot, Rakesh Kumar Saxena, Geeta Chhabra Gandhi

Regular Paper

YOLO-ERF: lightweight object detector for UAV aerial images

Xin Wang, Ning He, Chen Hong, Fengxi Sun, Wenjing Han, Qi Wang

Regular Paper

Clustering by sparse orthogonal NMF and interpretable neural network

Yongwei Gai, Jinglei Liu

Regular Paper

Class-agnostic counting with feature augmentation and similarity comparison

Mingju Shao, Guodong Wang

Regular Paper

Image compression with learned lifting-based DWT and learned tree-based entropy models

Ugur Berk Sahin, Fatih Kamisli

Regular Paper

A new adaptive VR-based exergame for hand rehabilitation after stroke

Amal Bouatrous, Abdelkrim Meziane, Nadia Zenati, Chafiaa Hamitouche

Regular Paper

Bio-Inspired ensemble feature selection and deep auto-encoder approach for rapid diagnosis of breast cancer

V. Praveena, L. R. Sujithra, S. Karthik, M. S. Kavitha

Regular Paper

Narrowing the variance of variational cross-encoder for cross-modal hashing

Dayong Tian, Yiqin Cao, Yiwen Wei, Deyun Zhou

Regular Paper

G-UNeXt: a lightweight MLP-based network for reducing semantic gap in medical image segmentation

Xin Zhang, Xiaotian Cao, Jun Wang, Lei Wan

Regular Paper

Integrated document segmentation and region identification: textual, equation and graphical

Jennil Thiyam, Sanasam Ranbir Singh, Prabin Kumar Bora

Regular Paper

Improving transferable adversarial attack for vision transformers via global attention and local drop

Tuo Li, Yahong Han

Regular Paper

Interactive video retrieval in the age of effective joint embedding deep models: lessons from the 11th VBS

Jakub Lokoč, Stelios Andreadis, Werner Bailer, Aaron Duane, Cathal Gurrin, Zhixin Ma, Nicola Messina, Thao-Nhu Nguyen, Ladislav Peška, Luca Rossetto, Loris Sauter, Konstantin Schall, Klaus Schoeffmann, Omar Shahbaz Khan, Florian Spiess, Lucia Vadicamo, Stefanos Vrochidis

Open Access Regular Paper

A social-aware video sharing solution using demand prediction of epidemic-based propagation in wireless networks

Shijie Jia, Yan Cui, Xiaoyan Su, Zongzheng Liang

Regular Paper

Lite general network and MagFace CNN for micro-expression spotting in long videos

Quan-Lin Gu, Sai Yang, Tianxing Yu

Regular Paper

A viewpoint-guided prototype network for 3D shape classification

Li Han, Jinhai He, Feng Dou, Huiwen Ma, Xinyang Xie, Wanwen Yang

Regular Paper

Deep portrait matting via double-grained segmentation

Zhiwei Ma, Guilin Yao

Regular Paper

Student engagement detection in online environment using computer vision and multi-dimensional feature fusion

Nan Xie, Zhaojie Liu, Zhengxu Li, Wei Pang, Beier Lu

Regular Paper

Dual-branch spectral–spatial feature extraction network for multispectral image compression

Fanqiang Kong, Jiahui Tang, Yunsong Li, Dan Li, Kedi Hu

Regular Paper

Hierarchical multiples self-attention mechanism for multi-modal analysis

Wu Jun, Zhu Tianliang, Zhu Jiahui, Li Tianyi, Wang Chunzhi

Regular Paper

A multi-scale feature fusion spatial–channel attention model for background subtraction

Yizhong Yang, Tingting Xia, Dajin Li, Zhang Zhang, Guangjun Xie

Regular Paper

Audio–text retrieval based on contrastive learning and collaborative attention mechanism

Tao Hu, Xuyu Xiang, Jiaohua Qin, Yun Tan

Regular Paper

Workpiece tracking based on improved SiamFC++ and virtual dataset

Kaisi Yang, Lianyu Zhao, Chenglin Wang

Regular Paper

Multi-behavior recommendation based on intent learning

Xinglin Pan, Mingxin Gan

Regular Paper

Multi-aggregation network based on non-separable lifting wavelet for single image deraining

Bin Liu, Siyan Fang

Regular Paper

A two-stage attention augmented fully convolutional network-based dynamic video summarization

Deeksha Gupta, Akashdeep Sharma

Regular Paper

MAF-Net: multidimensional attention fusion network for multichannel speech separation

Honglin Li, Qinghua Huang

Regular Paper

Compression of face images using meta-heuristic algorithms based on curvelet transform with variable bit allocation

Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu

Regular Paper

Artistic image adversarial attack via style perturbation

Haiyan Zhang, Quan Wang, Guorui Feng

Regular Paper

Owner named entity recognition in website based on multidimensional text guidance and space alignment co-attention

Xin Zheng, Xin He, Yimo Ren, Jinfa Wang, Junyang Yu

Special Issue Paper

Learning intra-inter-modality complementary for brain tumor segmentation

Jiangpeng Zheng, Fan Shi, Meng Zhao, Chen Jia, Congcong Wang

Special Issue Paper

A comprehensive survey on deep-learning-based visual captioning

Bowen Xin, Ning Xu, Yingchen Zhai, Tingting Zhang, Zimu Lu, Jing Liu, Weizhi Nie, Xuanya Li, An-An Liu

Special Issue Paper

Asymmetric bi-encoder for image–text retrieval

Wei Xiong, Haoliang Liu, Siya Mi, Yu Zhang

Special Issue Paper

CTNet: hybrid architecture based on CNN and transformer for image inpainting detection

Fengjun Xiao, Zhuxi Zhang, Ye Yao

Special Issue Paper

Identification of haploid and diploid maize seeds using hybrid transformer model

Emrah Dönmez, Serhat Kılıçarslan, Cemil Közkurt, Aykut Diker, Fahrettin Burak Demir, Abdullah Elen

Open Access Special Issue Paper

LET-Net: locally enhanced transformer network for medical image segmentation

Na Ta, Haipeng Chen, Xianzhu Liu, Nuo Jin

Special Issue Paper

Inceptr: micro-expression recognition integrating inception-CBAM and vision transformer

Haoliang Zhou, Shucheng Huang, Yuqiao Xu

Special Issue Paper

Images denoising for COVID-19 chest X-ray based on multi-scale parallel convolutional neural network

Noor Ahmed, Rozina, Ahmad Ali, Abdul Raziq

Open Access Special Issue Paper

View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer

Jiacheng Chang, Lanyong Zhang, Zhuang Shao

Special Issue Paper

Variable bit allocation method based on meta-heuristic algorithms for facial image compression

Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu

Special Issue Paper

A novel study for automatic two-class COVID-19 diagnosis (between COVID-19 and Healthy, Pneumonia) on X-ray images using texture analysis and 2-D/3-D convolutional neural networks

Huseyin Yaşar, Murat Ceylan

Special Issue Paper

Stories of love and violence: zero-shot interesting events’ classification for unsupervised TV series summarization

Alison Reboud, Ismail Harrando, Pasquale Lisena, Raphaël Troncy

Correction

Correction to: DL‑CNN‑based approach with image processing techniques for diagnosis of retinal diseases

Akash Tayal, Jivansha Gupta, Arun Solanki, Khyati Bisht, Anand Nayyar, Mehedi Masud

Correction

Correction: Comprehensive systematic review on virtual reality for cultural heritage practices: coherent taxonomy and motivations

Hwei Teeng Chong, Chen Kim Lim, Ahmad Rafi, Kian Lam Tan, Mazlin Mokhtar

Springer Professional

Multimedia Systems

Content (53 Articles)

Current Publications