Journal of Big Data

Papers
(The H4-Index of Journal of Big Data is 49. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Prognostic stratification based on HIF-1α signaling for evaluating hypoxia status and immune landscape in hepatocellular carcinoma600
Domain-relevance of influence: characterizing variations in online influence across multiple domains on social media413
Integrating deep learning and transfer learning: optimizing white blood cells classification in medical educational institutions402
Long-term survival prediction in patients with acute brain lesions using ensemble machine learning algorithms: a cohort study with combined national health insurance service and its self-run hospital 374
A new dimensionality reduction technique based on the Wavelet Transform for cancer classification333
A universal approach for multi-model schema inference312
Context-aware prediction of active and passive user engagement: Evidence from a large online social platform251
Gene selection via improved nuclear reaction optimization algorithm for cancer classification in high-dimensional data247
Identification of tumor antigens and anoikis-based molecular subtypes in the hepatocellular carcinoma immune microenvironment: implications for mRNA vaccine development and precision treatment240
Data provider research overview from a public management perspective: a bibliometric analysis utilizing CiteSpace210
GB-AFS: graph-based automatic feature selection for multi-class classification via Mean Simplified Silhouette163
Value-at-risk student prescription trees for price personalization159
FONDUE—Fine-Tuned Optimization: Nurturing Data Usability & Efficiency157
A multi-granular hybrid neural architecture for detecting abusive content in online social networks (OSNs) with contextual awareness148
A proposed hybrid framework to improve the accuracy of customer churn prediction in telecom industry147
Efficient pollen grain classification using pre-trained Convolutional Neural Networks: a comprehensive study141
Hybrid beluga whale optimization algorithm with multi-strategy for functions and engineering optimization problems138
Designing and evaluating a big data analytics approach for predicting students’ success factors138
An artificial intelligence platform for predicting postoperative complications in metastatic spinal surgery: development and validation study129
Defining user spectra to classify Ethereum users based on their behavior118
Comprehensive study of driver behavior monitoring systems using computer vision and machine learning techniques118
The stability of different aggregation techniques in ensemble feature selection104
COA_DNN: a hybrid crayfish optimization with deep neural network for detection of rapid eye movement behaviour disorder102
Breast cancer prediction using gated attentive multimodal deep learning97
Big data in human behavior research: a contextual turn94
Exploring differential privacy in CNNs, LSTMs, GRUs, and RNNs for heartbeat detection from multimodal data91
Traffic and road conditions monitoring system using extracted information from Twitter91
The adaptive community-response (ACR) method for collecting misinformation on social media90
An adaptive k-means clustering algorithm based on grid and domain centroid weights for digital twins in the context of digital transformation86
A model for investment type recommender system based on the potential investors based on investors and experts feedback using ANFIS and MNN82
Artificial intelligence for improving Nitrogen Dioxide forecasting of Abu Dhabi environment agency ground-based stations80
Deep-Eware: spatio-temporal social event detection using a hybrid learning model79
A unified IoT architectural model for smart hospitals: enhancing interoperability, security, and efficiency through clinical information systems (CIS)78
Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis76
An efficient binary spider wasp optimizer for multi-dimensional knapsack instances: experimental validation and analysis74
Pre-trained transformer-based language models for Sundanese73
Advancing multimodal emotion recognition in big data through prompt engineering and deep adaptive learning70
Risk and UCON-based access control model for healthcare big data67
DiabSense: early diagnosis of non-insulin-dependent diabetes mellitus using smartphone-based human activity recognition and diabetic retinopathy analysis with Graph Neural Network66
Survey on terminology extraction from texts64
Artificial intelligence models for prediction of monthly rainfall without climatic data for meteorological stations in Ethiopia63
Distributed fuzzy clustering algorithm for mixed-mode data in Apache SPARK63
The use of Big Data Analytics in healthcare56
Surface defect detection on bolt surface using a real-time fine-tuned YOLOv6 model55
SMT efficiency in supervised ML methods: a throughput and interference analysis55
Review of deep learning methods for remote sensing satellite images classification: experimental survey and comparative analysis55
Developing insights from the collective voice of target users in Twitter54
Fast agglomerative clustering using approximate traveling salesman solutions50
Machine learning based customer churn prediction in home appliance rental business49
Novel mathematical model for the classification of music and rhythmic genre using deep neural network49
Hajj pilgrimage abnormal crowd movement monitoring using optical flow and FCNN49
Advancing stock price prediction through the development of hybrid ensembles: a comprehensive comparative analysis of machine learning approaches49
Short-term photovoltaic power production forecasting based on novel hybrid data-driven models49
Predicting startup success using two bias-free machine learning: resolving data imbalance using generative adversarial networks49
From distributed machine to distributed deep learning: a comprehensive survey49
0.11956286430359