Data Mining and Knowledge Discovery

Papers
(The median citation count of Data Mining and Knowledge Discovery is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Joint dynamic topic model for recognition of lead-lag relationship in two text corpora228
Knowledge graph completion based on asymmetric translation and automatic entity type representation191
A probabilistic model for API contract specification retrieval focusing on the openAPI standard188
Structural-temporal coupling anomaly detection with dynamic graph transformer180
Counterfactual explanations as interventions in latent space153
Traffic forecasting on new roads using spatial contrastive pre-training (SCPT)133
Discord-based counterfactual explanations for time series classification90
Thompson sampling-based recursive block elimination for dynamic assignment under limited budget in pure-exploration73
Representing ensembles of networks for fuzzy cluster analysis: a case study69
TCMI: a non-parametric mutual-dependence estimator for multivariate continuous distributions68
Exploiting sensor data in professional road cycling: personalized data-driven approach for frequent fitness monitoring65
The grammar of interactive explanatory model analysis62
Exploring zero-shot essay scoring: from feature-based to LLM-based approaches54
Correction: TSelect: selecting relevant and non-redundant channels for multivariate time series classification46
Quantitative evaluation of motif sets in time series45
Correction: Marginal effects for non-linear prediction functions42
Combating confirmation bias: a unified pseudo-labeling framework for entity alignment34
Hydra: competing convolutional kernels for fast and accurate time series classification32
VEM$$^2$$L: an easy but effective framework for fusing text and structure knowledge on sparse knowledge graph completion32
MMA: metadata supported multi-variate attention for onset detection and prediction31
Leveraging internal representations of GNNs with Shapley values29
Fitter: post-mining user-preferred co-location patterns interactively28
Improve contrastive clustering performance by multiple fusing-augmenting ViT blocks27
Wisdom of the contexts: active ensemble learning for contextual anomaly detection27
Neural content-aware collaborative filtering for cold-start music recommendation25
Fine-grained multi-prompt essay scoring with multi-level disentanglement24
Reflective-net: learning from explanations24
TenGAN: adversarially generating multiplex tensor graphs23
Approximation trees: statistical reproducibility in model distillation22
Optirefine: densest subgraphs and maximum cuts with k refinements22
Correction: Deep anomaly detection with partition contrastive learning for tabular data22
OLIVANDER: a counterfactual-based method to generate adversarial Windows PE malware22
SALτ: efficiently stopping TAR by improving priors estimates22
On computing exact means of time series using the move-split-merge metric22
Improving neural network’s robustness on tabular data with D-layers22
Explainable decomposition of nested dense subgraphs21
Interpretable representations in explainable AI: from theory to practice20
Correction: Bake off redux: a review and experimental evaluation of recent time series classification algorithms19
AA-forecast: anomaly-aware forecast for extreme events18
Robust explainer recommendation for time series classification17
What do anomaly scores actually mean? Dynamic characteristics beyond accuracy17
On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles17
MultiRocket: multiple pooling operators and transformations for fast and effective time series classification17
Contextualization of soccer analysis with tactical periodization and machine learning16
Robust and sparse multinomial regression in high dimensions16
On GNN explainability with activation rules16
Efficient algorithms for fair clustering with a new notion of fairness15
Exploiting second-order dissimilarity representations for hierarchical clustering and visualization15
Explanatory artificial intelligence (YAI): human-centered explanations of explainable AI and complex data15
Does user-end work? User-item-aware knowledge graph convolutional networks for recommendation15
EmbAssi: embedding assignment costs for similarity search in large graph databases15
Explainable and interpretable machine learning and data mining15
A multi-class imbalanced data stream classification algorithm based on sample weighting and adaptive oversampling15
Sky-signatures: detecting and characterizing recurrent behavior in sequential data15
Multilayer horizontal visibility graphs for multivariate time series analysis15
SimHawNet: a modified Hawkes process for temporal network simulation14
A comprehensive taxonomy for explainable artificial intelligence: a systematic survey of surveys on methods and concepts14
Coupled block diagonal regularization for multi-view subspace clustering14
Algorithmic fairness datasets: the story so far14
Metadata supported scale space attention networks for multivariate timeseries prediction13
When subgraphs outperform graphs: a scalable training strategy for churn prediction on large class-imbalanced networks13
Mondrian forest for data stream classification under memory constraints13
Random walks with variable restarts for negative-example-informed label propagation13
Bounding the family-wise error rate in local causal discovery using Rademacher averages13
Bijective graph learning architecture with multi-level attributes interaction13
Hypercore decomposition for non-fragile hyperedges: concepts, algorithms, observations, and applications12
Unsupervised feature based algorithms for time series extrinsic regression12
Locality adaptive incomplete multi-view subspace clustering12
Missing value replacement in strings and applications12
NICE: an algorithm for nearest instance counterfactual explanations12
Grouped feature importance and combined features effect plot12
ClaSP: parameter-free time series segmentation12
Dynamic self-paced sampling ensemble for highly imbalanced and class-overlapped data classification12
Randomnet: clustering time series using untrained deep neural networks12
Beyond additivity: sparse isotonic shapley regression toward nonlinear explainability12
Structural learning of simple staged trees12
Intersectional fair ranking via subgroup divergence11
When graph convolution meets double attention: online privacy disclosure detection with multi-label text classification11
Hamming encoder: mining discriminative k-mers for discrete sequence classification11
Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach11
Inferring tie strength in temporal networks11
Detach-ROCKET: sequential feature selection for time series classification with random convolutional kernels11
Making clusterings fairer by post-processing: algorithms, complexity results and experiments11
SFC: a time series decomposition attention network with continuous nature for time series analysis11
Robust subgroup discovery10
JammyTS: joint attention and memory network for temporal scoping of facts10
Stable graph based decision route explanation in siamese neural networks10
Central node identification via weighted kernel density estimation10
A tale of two roles: exploring topic-specific susceptibility and influence in cascade prediction10
Sentiment analysis in tweets: an assessment study from classical to modern word representation models10
Knowledge graph embedding closed under composition10
Modelling event sequence data by type-wise neural point process10
Sequential pattern detection: similarities and differences across various fields10
Predicted motion pressure—metricizing pressure created by pass rushers in the NFL and predicting their motions using weighted K-nearest neighbors machine learning models10
RMIDDM: an unsupervised and interpretable concept drift detection method for data streams9
Bake off redux: a review and experimental evaluation of recent time series classification algorithms9
Marginal effects for non-linear prediction functions9
Structural iterative lexicographic autoencoded node representation8
Binary quantification and dataset shift: an experimental investigation8
Data-driven learning optimal K values for K-nearest neighbour matching in causal inference8
Topic-aware influence maximization with deep reinforcement learning and graph attention networks8
One-shot relational learning for extrapolation reasoning on temporal knowledge graphs8
MrTF: model refinery for transductive federated learning8
Benchmarking and survey of explanation methods for black box models8
Large language models are zero-shot point-of-interest recommenders8
An external stability audit framework to test the validity of personality prediction in AI hiring7
Overlapping community detection with a new modularity measure in directed weighted networks7
Sequential query prediction based on multi-armed bandits with ensemble of transformer experts and immediate feedback7
i-Align: an interpretable knowledge graph alignment model7
Temporal motif-based representation learning on continuous-time dynamic graphs7
TSelect: selecting relevant and non-redundant channels for multivariate time series classification7
Trace alignment algorithm optimization7
Entity completion for industrial knowledge graph based on zero-shot learning6
MIRACLE: Malware image recognition and classification by layered extraction6
Fastere: a fast framework for entity relation extractions6
On regime changes in text data using hidden Markov model of contaminated vMF distribution6
Guardnet: an imbalance-aware graph neural network for fraud detection6
Entropy-regularized multimodal fusion for robust and explainable knowledge graph completion6
Z-Time: efficient and effective interpretable multivariate time series classification6
Detecting and reacting to smart home novelties6
Deep clustering for large-scale interpretable time series segmentation6
Online concept evolution detection based on active learning6
ArcMatch: high-performance subgraph matching for labeled graphs by exploiting edge domains6
Improving position encoding of transformers for multivariate time series classification6
Session-based recommendation by exploiting substitutable and complementary relationships from multi-behavior data6
Community detection in interval-weighted networks6
GeoRF: a geospatial random forest5
Large scale K-means clustering using GPUs5
An attention matrix for every decision: faithfulness-based arbitration among multiple attention-based interpretations of transformers in text classification5
On the number of iterations of the DBA algorithm5
MERLIN++: parameter-free discovery of time series anomalies5
Instance space of clustering validation measures5
Improving the core resilience of real-world hypergraphs5
Negative-sample-free knowledge graph embedding5
Regularization-based methods for ordinal quantification5
Exploring potential biases towards blockbuster items in ranking-based recommendations5
Fairness in vulnerable attribute prediction on social media5
Using differential evolution for an attribute-weighted inverted specific-class distance measure for nominal attributes5
ContE: contextualized knowledge graph embedding for circular relations5
SOKNL: A novel way of integrating K-nearest neighbours with adaptive random forest regression for data streams5
OEC: an online ensemble classifier for mining data streams with noisy labels5
Knowledge graph embedding methods for entity alignment: experimental review5
HyEED: embedding learning of knowledge graphs with entity description in hyperbolic space5
BDRI: block decomposition based on relational interaction for knowledge graph completion5
ConvMOS: climate model output statistics with deep learning5
Effective interpretable learning for large-scale categorical data5
Large language models are few-shot multivariate time series classifiers5
Model-agnostic variable importance for predictive uncertainty: an entropy-based approach5
Efficient outlier detection in numerical and categorical data5
Bias characterization, assessment, and mitigation in location-based recommender systems4
Multi-hop clustering for reasoning chain extraction in multi-hop question answering4
Multi-neighbor social recommendation with attentional graph convolutional network4
Relation-aware multimodal data hashing for scalable recommendation systems4
A multi-scale time series forecasting framework with temporal hierarchical information fusion and reconciliation4
Explainable contextual anomaly detection using quantile regression forests4
ARL: analogical reinforcement learning for knowledge graph reasoning4
Interplay between topology and edge weights in real-world graphs: concepts, patterns, and an algorithm4
Enhancing cluster analysis via topological manifold learning4
ARM-stream: active recovery of miscategorizations in clustering-based data stream classifiers4
Uplift modeling with quasi-loss-functions4
Improving graph-based recommendation with unraveled graph learning4
The impact of variable ordering on Bayesian network structure learning4
Tractable probabilistic models and computational complexity4
A spatiotemporal deep neural network for fine-grained multi-horizon wind prediction4
Stratify: unifying multi-step forecasting strategies4
Explaining deep convolutional models by measuring the influence of interpretable features in image classification4
Crypsis: an elitism-driven observer-based approach for detection and mitigation of concept drift, concept evolution, and label drift4
Certifying robustness of graph convolutional networks for node perturbation with polyhedra abstract interpretation4
Conclusive local interpretation rules for random forests4
Differentiated matching for individual and average treatment effect estimation4
kNN matrix profile for knowledge discovery from time series4
A two-step anomaly detection based method for PU classification in imbalanced data sets3
MODE-Bi-GRU: orthogonal independent Bi-GRU model with multiscale feature extraction3
A systematic review of deep learning for structural geological interpretation3
Regularized impurity reduction: accurate decision trees with complexity guarantees3
Learning a consensus sub-network with polarization regularization and one pass training3
Correction to: Studying bias in visual features through the lens of optimal transport3
Structure-aware decoupled imputation network for multivariate time series3
tPARAFAC2: tracking evolving patterns in (incomplete) temporal data3
Fed-FUEL: fairness and utility enhancing agnostic federated learning framework3
Hybrid federated continual graph contrastive learning for evolving money laundering threats3
Design and evaluation of highly accurate smart contract code vulnerability detection framework3
Distilcyphergpt: enhancing large language models for knowledge graph question answering in cypher through knowledge distillation3
Series2vec: similarity-based self-supervised representation learning for time series classification3
Multi-relational knowledge graph contrastive learning for link prediction3
Fast block-wise partitioning for extreme multi-label classification3
CSCN: an efficient snapshot ensemble learning based sparse transformer model for long-range spatial-temporal traffic flow prediction3
LoCoMotif: discovering time-warped motifs in time series3
Relation prediction based on the attention-enhanced fusion of graph strcuture and multi-hop neighborhood information in knowledge graphs3
Unpacking the trend: decomposition as a catalyst to enhance time series forecasting models3
0.072635889053345