Issue 1/2024
Content (60 Articles)
SwinCT: feature enhancement based low-dose CT images denoising with swin transformer
Muwei Jian, Xiaoyang Yu, Haoran Zhang, Chengdong Yang
Improving the application performance of Loki via algorithm optimization
Wenming Zhu, Wenjing Su, Kai Yang, Hao Chen
Yolov5s-MSD: a multi-scale ship detector for visible video image
Yan-Tong Chen, Yan-Yan Zhang, Jia-Liang Wang, Yang Liu
An automatic music generation method based on RSCLN_Transformer network
Yumei Zhang, Xiaojiao Lv, Qi Li, Xiaojun Wu, Yuping Su, Honghong Yang
Occluded pedestrian re-identification via Res-ViT double-branch hybrid network
Yunbin Zhao, Songhao Zhu
Rendering acceleration based on JND-guided sampling prediction
Ripei Zhang, Chunyi Chen, Zhongye Shen, Jun Peng, Minghui Ma
Tensorial multi-view subspace clustering with side constraints for elevator security warning
Huangzhen Xu, Licheng Ruan, Yuzhou Ni, Hongwei Yin, Ping Yu, Xinmin Cheng
Multi-label neural architecture search for chest radiography image classification
Yi Yang, Jiaxuan Wei, Zhixuan Yu, Ruisheng Zhang
Frequency disentangled residual network
Satya Rajendra Singh, Roshan Reddy Yedla, Shiv Ram Dubey, Rakesh Kumar Sanodiya, Wei-Ta Chu
Balanced sentimental information via multimodal interaction model
Yuanyi Luo, Rui Wu, Jiafeng Liu, Xianglong Tang
Dy-MIL: dynamic multiple-instance learning framework for video anomaly detection
Chen Li, Mo Chen
Underwater acoustic target recognition based on knowledge distillation under working conditions mismatching
Shuang Yang, Anqi Jin, Xiangyang Zeng, Haitao Wang, Xi Hong, Menghui Lei
Facial expression intensity estimation using label-distribution-learning-enhanced ordinal regression
Ruyi Xu, Zhun Wang, Jingying Chen, Longpu Zhou
Video question answering via traffic knowledge database and question classification
Xiaoyong Sun, Yu Dai, Yuchen Wang, Weifeng Ma, Xuefen Lin
A real-time camera-based gaze-tracking system involving dual interactive modes and its application in gaming
He Zhang, Lu Yin, Hanling Zhang
SV2-SQL: a text-to-SQL transformation mechanism based on BERT models for slot filling, value extraction, and verification
Chih-Yung Chang, Yuan-Lin Liang, Shih-Jung Wu, Diptendu Sinha Roy
Universal unsupervised cross-domain 3D shape retrieval
Heyu Zhou, Fan Wang, Qipei Liu, Jiayu Li, Wen Liu, Xuanya Li, An-An Liu
AF-FPN: an attention-guided enhanced feature pyramid network for breakwater armor layer unit segmentation
Linchun Gao, Shoujun Wang, Songgui Chen, Yuanye Hu
DiffuseRoll: multi-track multi-attribute music generation based on diffusion model
Hongfei Wang, Yi Zou, Haonan Cheng, Long Ye
Learning scale-aware relationships via Laplacian decomposition-based transformer for 3D human pose estimation
Jeonghwan Kim, Hyukmin Kwon, Seong Yong Lim, Wonjun Kim
ITrans: generative image inpainting with transformers
Wei Miao, Lijun Wang, Huchuan Lu, Kaining Huang, Xinchu Shi, Bocong Liu
A novel hashing-inverted index for secure content-based retrieval with massive encrypted speeches
Yingjie Hu, Qiuyu Zhang, Qiwen Zhang, Yugui Jia
Coarse registration of point cloud base on deep local extremum detection and attentive description
Haotian Lu, Jianhui Nie
Enhanced 3D reconstruction with all-neighbor-first philosophy and Ricci flow-based mesh smoothing approach
Mriganka Sarmah, Arambam Neelima
AI and data-driven media analysis of TV content for optimised digital content marketing
Lyndon Nixon, Konstantinos Apostolidis, Evlampios Apostolidis, Damianos Galanopoulos, Vasileios Mezaris, Basil Philipp, Rasa Bocyte
Underwater image enhancement method based on a cross attention mechanism
Sunhan Xu, Jinhua Wang, Ning He, Xin Hu, Fengxi Sun
A plug-and-play image enhancement model for end-to-end object detection in low-light condition
Jiaojiao Yuan, Yongli Hu, Yanfeng Sun, Boyue Wang, Baocai Yin
A simple spatial domain method for quality evaluation of blurred images
Md Amir Baig, Athar A. Moinuddin, E. Khan
Event log anomaly detection method based on auto-encoder and control flow
Daoyu Kan, Xianwen Fang
NDAM-YOLOseg: a real-time instance segmentation model based on multi-head attention mechanism
Chengang Dong, Yuhao Tang, Liyan Zhang
Generalizing to unseen domains via PatchMix
Juncheng Yang, Zuchao Li, Chao Li, Shuai Xie, Wei Yu, Shijun Li
One-step graph-based incomplete multi-view clustering
Baishun Zhou, Jintian Ji, Zhibin Gu, Zihao Zhou, Gangyi Ding, Songhe Feng
Detecting facial manipulated images via one-class domain generalization
Pengxiang Xu, Zhiyuan Ma, Xue Mei, jie Shen
Locally controllable network based on visual–linguistic relation alignment for text-to-image generation
Zaike Li, Li Liu, Huaxiang Zhang, Dongmei Liu, Yu Song, Boqun Li
Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network
Gang Lv, Yining Sun, Fudong Nian
HCNNet: hybrid convolution neural network for automatic identification of ischaemia in diabetic foot ulcer wounds
Sujit Kumar Das, Suyel Namasudra, Arun Kumar Sangaiah
MCLEMCD: multimodal collaborative learning encoder for enhanced music classification from dances
Wenjuan Gong, Qingshuang Yu, Haoran Sun, Wendong Huang, Peng Cheng, Jordi Gonzàlez
Lightweight image super-resolution based on stepwise feedback mechanism and multi-feature maps fusion
Xu Yao, Houjin Chen, Yanfeng Li, Jia Sun, Jiayu Wei
A comparative study of color quantization methods using various image quality assessment indices
María-Luisa Pérez-Delgado, M. Emre Celebi
Weighted bilinear factorization of low-rank matrix with structural smoothness for image denoising
Wanhong Wu, Zikai Wu, Hongjuan Zhang
Generalizing sentence-level lipreading to unseen speakers: a two-stream end-to-end approach
Yu Li, Feng Xue, Lin Wu, Yincen Xie, Shujie Li
STSD: spatial–temporal semantic decomposition transformer for skeleton-based action recognition
Hu Cui, Tessai Hayama
Real-walk modelling: deep learning model for user mobility in virtual reality
Murtada Dohan, Mu Mu, Suraj Ajit, Gary Hill
Adequately hierarchical patterns based on pairwise regions
Thanh Tuan Nguyen, Thanh Phuong Nguyen, Frédéric Bouchara
An ensemble pruning method considering classifiers’ interaction based on information theory for facial expression recognition
Yiqing Wu, Danyang Li, Xing Chen, Yumei Tang, Shisong Huang
Bag of states: a non-sequential approach to video-based engagement measurement
Ali Abedi, Chinchu Thomas, Dinesh Babu Jayagopi, Shehroz S. Khan
BENet: bi-directional enhanced network for image captioning
Peixin Yan, Zuoyong Li, Rong Hu, Xinrong Cao
An entropy-weighted local intensity clustering-based model for segmenting intensity inhomogeneous images
Wei-Ting Liao, Suh-Yuh Yang, Cheng-Shu You
GVA: guided visual attention approach for automatic image caption generation
Md. Bipul Hossen, Zhongfu Ye, Amr Abdussalam, Md. Imran Hossain
Depth alignment interaction network for camouflaged object detection
Hongbo Bi, Yuyu Tong, Jiayuan Zhang, Cong Zhang, Jinghui Tong, Wei Jin
A multi-layer mesh synchronized reversible data hiding algorithm on the 3D model
Guoyou Zhang, Zheyu Sui, Chaoli Sun, Qi Liu, Xiaoxue Cheng
You watch once more: a more effective CNN architecture for video spatio-temporal action localization
Yefeng Qin, Lei Chen, Xianye Ben, Mingqiang Yang
Object-based video anomaly detection using multi-attention and adaptive velocity attribute representation learning
Xiaopeng Ren, Huifen Xia, Yongzhao Zhan
Ecarnet: enhanced clue-ambiguity reasoning network for multimodal fake news detection
Shannan Zhong, ShuJuan Peng, Xin Liu, Lei Zhu, Xing Xu, Taihao Li
A defensive attention mechanism to detect deepfake content across multiple modalities
S. Asha, P. Vinod, Varun G. Menon
An insight into topological, machine and Deep Learning-based approaches for influential node identification in social media networks: a systematic review
Yasir Rashid, Javaid Iqbal Bhat
MVIndEmo: a dataset for micro video public-induced emotion prediction on social media
Zhenhua Guo, Qi Jia, Baoyu Fan, Di Wang, Cong Xu, Yanwei Wang, Yaqian Zhao, Rengang Li
Weighted sparse gradient reconstruction model with a robust fidelity for edge-aware image smoothing
Lanling Zeng, Yucheng Chen, Yang Yang
Attribute- and attention-guided few-shot classification
Ziquan Wang, Hui Li, Zikai Zhang, Feng Chen, Jia Zhai