Data Mining and Knowledge Discovery

Papers
(The TQCC of Data Mining and Knowledge Discovery is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-11-01 to 2025-11-01.)
ArticleCitations
A probabilistic model for API contract specification retrieval focusing on the openAPI standard215
Knowledge graph completion based on asymmetric translation and automatic entity type representation161
Joint dynamic topic model for recognition of lead-lag relationship in two text corpora137
Who can receive the pass? A computational model for quantifying availability in soccer127
Counterfactual explanations as interventions in latent space110
Correction: Marginal effects for non-linear prediction functions92
Combating confirmation bias: a unified pseudo-labeling framework for entity alignment87
Representing ensembles of networks for fuzzy cluster analysis: a case study67
Thompson sampling-based recursive block elimination for dynamic assignment under limited budget in pure-exploration55
TCMI: a non-parametric mutual-dependence estimator for multivariate continuous distributions51
Discord-based counterfactual explanations for time series classification49
The grammar of interactive explanatory model analysis48
Traffic forecasting on new roads using spatial contrastive pre-training (SCPT)46
Hydra: competing convolutional kernels for fast and accurate time series classification45
Exploiting sensor data in professional road cycling: personalized data-driven approach for frequent fitness monitoring44
VEM$$^2$$L: an easy but effective framework for fusing text and structure knowledge on sparse knowledge graph completion38
Fine-grained multi-prompt essay scoring with multi-level disentanglement31
MMA: metadata supported multi-variate attention for onset detection and prediction28
Dynamic cyber risk estimation with competitive quantile autoregression27
Leveraging internal representations of GNNs with Shapley values24
Neural content-aware collaborative filtering for cold-start music recommendation24
Wisdom of the contexts: active ensemble learning for contextual anomaly detection24
SALτ: efficiently stopping TAR by improving priors estimates23
Reflective-net: learning from explanations23
TenGAN: adversarially generating multiplex tensor graphs23
Correction: Bake off redux: a review and experimental evaluation of recent time series classification algorithms23
Explainable decomposition of nested dense subgraphs21
Approximation trees: statistical reproducibility in model distillation21
On computing exact means of time series using the move-split-merge metric21
OLIVANDER: a counterfactual-based method to generate adversarial Windows PE malware20
Optirefine: densest subgraphs and maximum cuts with k refinements20
Correction: Deep anomaly detection with partition contrastive learning for tabular data20
Robust explainer recommendation for time series classification19
Interpretable representations in explainable AI: from theory to practice18
On the evaluation of outlier detection and one-class classification: a comparative study of algorithms, model selection, and ensembles18
Improving neural network’s robustness on tabular data with D-layers18
AA-forecast: anomaly-aware forecast for extreme events18
MultiRocket: multiple pooling operators and transformations for fast and effective time series classification18
Explanatory artificial intelligence (YAI): human-centered explanations of explainable AI and complex data17
Contextualization of soccer analysis with tactical periodization and machine learning17
Multilayer horizontal visibility graphs for multivariate time series analysis17
On GNN explainability with activation rules16
Explainable and interpretable machine learning and data mining16
What do anomaly scores actually mean? Dynamic characteristics beyond accuracy16
Does user-end work? User-item-aware knowledge graph convolutional networks for recommendation16
Robust and sparse multinomial regression in high dimensions16
Exploiting second-order dissimilarity representations for hierarchical clustering and visualization15
EmbAssi: embedding assignment costs for similarity search in large graph databases15
Sky-signatures: detecting and characterizing recurrent behavior in sequential data15
Algorithmic fairness datasets: the story so far14
A comprehensive taxonomy for explainable artificial intelligence: a systematic survey of surveys on methods and concepts14
Coupled block diagonal regularization for multi-view subspace clustering14
SimHawNet: a modified Hawkes process for temporal network simulation14
Efficient algorithms for fair clustering with a new notion of fairness13
Random walks with variable restarts for negative-example-informed label propagation13
Hypercore decomposition for non-fragile hyperedges: concepts, algorithms, observations, and applications13
PAC-Bayesian lifelong learning for multi-armed bandits12
Metadata supported scale space attention networks for multivariate timeseries prediction12
Bounding the family-wise error rate in local causal discovery using Rademacher averages12
Mondrian forest for data stream classification under memory constraints12
Unsupervised feature based algorithms for time series extrinsic regression11
Making clusterings fairer by post-processing: algorithms, complexity results and experiments11
When graph convolution meets double attention: online privacy disclosure detection with multi-label text classification11
Randomnet: clustering time series using untrained deep neural networks11
Dynamic self-paced sampling ensemble for highly imbalanced and class-overlapped data classification11
NICE: an algorithm for nearest instance counterfactual explanations11
Inferring tie strength in temporal networks11
Locality adaptive incomplete multi-view subspace clustering11
An eager splitting strategy for online decision trees in ensembles10
SFC: a time series decomposition attention network with continuous nature for time series analysis10
Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach10
Grouped feature importance and combined features effect plot10
Structural learning of simple staged trees10
Detach-ROCKET: sequential feature selection for time series classification with random convolutional kernels9
Knowledge graph embedding closed under composition9
Missing value replacement in strings and applications9
Synwalk: community detection via random walk modelling9
Sequential pattern detection: similarities and differences across various fields9
Hamming encoder: mining discriminative k-mers for discrete sequence classification9
Temporal state change Bayesian networks for modeling of evolving multivariate state sequences: model, structure discovery and parameter estimation9
PETSC: pattern-based embedding for time series classification9
Intersectional fair ranking via subgroup divergence9
ClaSP: parameter-free time series segmentation9
Stable graph based decision route explanation in siamese neural networks8
Robust subgroup discovery8
JammyTS: joint attention and memory network for temporal scoping of facts8
Sentiment analysis in tweets: an assessment study from classical to modern word representation models8
A tale of two roles: exploring topic-specific susceptibility and influence in cascade prediction8
Modelling event sequence data by type-wise neural point process8
Central node identification via weighted kernel density estimation8
Bake off redux: a review and experimental evaluation of recent time series classification algorithms8
0.04560399055481