Skip to main content

International Journal of Data Science and Analytics OnlineFirst articles

22-04-2024 | Regular Paper

Enhancing author assessment: an advanced modified recursive elimination technique (MRET) for ranking key parameters and conducting statistical analysis of top-ranked parameter

Assessing the impact of authors in scientific research is crucial for evaluating scholarly contributions. Various parameters exist in the literature to quantify researchers’ productivity, such as publication count, citation count, and the h index.

Authors:
Ghulam Mustafa, Abid Rauf, Muhammad Tanvir Afzal

20-04-2024 | Regular Paper

Building the interpolating model for interval time series based on the fuzzy clustering technique

In the development of the social-economics of countries, time series is a data type stored commonly nowadays. For these data, forecasting has always received the attention of statisticians and managers because it brings to great advantages.

Authors:
Dan Nguyen-Thihong, Loc Tran-Phuoc, Tai Vo-Van

20-04-2024 | Review

Real-time anomaly detection in sky quality meter data using probabilistic exponential weighted moving average

Light pollution is a problem that impacts many elements of human life and the environment, including astronomical observations. The authors of this work offer a unique method for detecting anomalies in night sky brightness data recorded using a …

Authors:
Lala Septem Riza, Zulfikar Ali Yunara Putra, Muhammad Iqbal Zain, Fajar Zuliansyah Trihutama, Judhistira Aria Utama, Khyrina Airin Fariza Abu Samah, Dhani Herdiwijaya, Rinto Anugraha NQZ, Emanuel Sungging Mumpuni, Rhorom Priyatikanto

Open Access 16-04-2024 | Regular Paper

Enhancing the Vietoris–Rips simplicial complex for topological data analysis: applications in cancer gene expression datasets

The aim of this study is to enhance the extraction of informative features from complex data through the application of topological data analysis (TDA) using novel topological overlapping measures. Topological data analysis has emerged as a …

Authors:
Lebohang Mashatola, Zubayr Kader, Naaziyah Abdulla, Mandeep Kaur

15-04-2024 | Review

Predicting the pharmaceutical needs of hospitals using machine learning algorithms

People’s lives are always threatened by various diseases. The role of health and medical services, in particular medicine, is undeniable in protecting their lives. Timely preparation and providing medicine for patients is vital since medicine …

Authors:
Amir Hossein Nabizadeh, Mohammad Mehdi Ghaemi, Daniel Goncalves

14-04-2024 | Regular Paper

K-means DTW Barycenter Averaging: a clustering analysis of COVID-19 cases and deaths on the Brazilian federal units

A challenge faced while monitoring the COVID-19 pandemic in Brazil is the identification of patterns of incidence and mortality, which can help prioritize interventions to avoid excessive disease transmission and associated deaths. This study …

Authors:
Jonatas Silva do Espirito Santo, Jackson Santos da Conceição, Lilia Carolina Carneiro da Costa, Rosemeire Leovigildo Fiaccone, Marcos Ennes Barreto, Maria Yury Ichihara, Anderson Ara

13-04-2024 | Regular Paper

A common-specific feature cross-fusion attention mechanism for KGVQA

Knowledge graph-based visual question answering aims to utilize the information in the knowledge graph to assist in answering complex questions that are difficult to answer based on image features alone. However, using knowledge graphs increases …

Authors:
Mingyang Ma, Turdi Tohti, Askar Hamdulla

Open Access 13-04-2024 | Review

Objective metrics for ethical AI: a systematic literature review

The field of AI Ethics has recently gained considerable attention, yet much of the existing academic research lacks practical and objective contributions for the development of ethical AI systems. This systematic literature review aims to identify …

Authors:
Guilherme Palumbo, Davide Carneiro, Victor Alves

12-04-2024 | Regular Paper

An efficient join operations for utility list-based high-utility mining approaches using hybrid search technique

Frequent itemset mining (FIM) has firmly established itself as a pivotal and reliable tool in the realm of business analytics, enabling the systematic discovery of valuable patterns and association rules within extensive datasets. However, FIM has …

Authors:
Rashmin Gajera, Suresh Patel, Khushbu Madhani, Ayush Solanki

Open Access 10-04-2024 | Regular Paper

Pseudo datasets explain artificial neural networks

Machine learning enhances predictive ability in various research compared to conventional statistical approaches. However, the advantage of the regression model is that it can effortlessly interpret the effect of each predictor. Therefore …

Authors:
Yi-Chi Chu, Yi-Hau Chen, Chao-Yu Guo

09-04-2024 | Regular Paper

Decision making for selection of smart vehicle transportation system using VIKOR approach

The advent of technology has facilitated communication among individuals, structures, and transport, thereby revolutionizing contemporary living. In this regard, the Internet of Things has emerged as a transformative approach. Smart vehicles and …

Authors:
Habib Ullah Khan, Farhad Ali, Muhammad Sohail, Shah Nazir, Mohammad Arif

Open Access 09-04-2024 | Editorial

Introduction to the special issue on PAKDD’2021

Author:
P. Krishna Reddy

08-04-2024 | Review

A comprehensive and analytical review of text clustering techniques

Document clustering involves grouping together documents so that similar documents are grouped together in the same cluster and different documents in the different clusters. Clustering of documents is considered a fundamental problem in the field …

Authors:
Vivek Mehta, Mohit Agarwal, Rohit Kumar Kaliyar

06-04-2024 | Regular Paper

K-Trickle: performance evaluation and impact on quality of service in resource-constrained networks

The Internet of Things is an emerging domain in the field of establishing the effective communications. Routing protocols are crucial for ensuring dependable data transmission in networks with limited resources. RPL is a standardized protocol used …

Authors:
P. Arivubrakan, G. R. Kanagachidambaresan

29-03-2024 | Regular Paper

Stopping fake news: Who should be banned?

Fake news and misinformation spread in online social networks in a manner similar to contagious diseases. One possibility to thwart the contagion cascade is to selectively remove a small number of nodes from the network. Although most of the …

Authors:
Pablo Ignacio Fierens, Leandro Chaves Rêgo

27-03-2024 | Regular Paper

An efficient machine learning approach for extracting eSports players’ distinguishing features and classifying their skill levels using symbolic transfer entropy and consensus nested cross-validation

Discovering features that set elite players apart is of great significance for eSports coaches as it enables them to arrange a more effective training program focused on improving those features. Moreover, finding such features results in a better …

Authors:
Amin Noroozi, Mohammad S. Hasan, Maryam Ravan, Elham Norouzi, Ying-Ying Law

Open Access 26-03-2024 | Regular Paper

Alternative feature selection with user control

Feature selection is popular for obtaining small, interpretable, yet highly accurate prediction models. Conventional feature-selection methods typically yield one feature set only, which does not suffice in certain scenarios. For example, users …

Authors:
Jakob Bach, Klemens Böhm

Open Access 25-03-2024 | Regular Paper

Forecasting implied volatilities of currency options with machine learning techniques and econometrics models

Developing an effective modeling framework to minimize foreign exchange (FX) risk is of vital importance for hedgers and traders in FX markets. In this study, we compare the ability of long short-term memory (LSTM) models to that of random forest …

Authors:
Asbjørn Olsen, Gard Djupskås, Petter Eilif de Lange, Morten Risstad

21-03-2024 | Regular Paper

Symmetric contrastive learning for robust fault detection in time-series traffic sensor data

Traffic sensor data are prone to malfunctions caused by various factors such as manufacturing defects, harsh environmental conditions, improper installation, and maintenance. While fault data detection is a well-established practice in many …

Authors:
Yongcan Huang, Jidong J. Yang

Open Access 21-03-2024 | Regular Paper

A probabilistic spatio-temporal neural network to forecast COVID-19 counts

Geo-referenced and temporal data are becoming more and more ubiquitous in a wide range of fields such as medicine and economics. Particularly in the realm of medical research, spatio-temporal data play a pivotal role in tracking and understanding …

Authors:
Federico Ravenda, Mirko Cesarini, Stefano Peluso, Antonietta Mira