Data Mining and Knowledge Discovery

Papers
(The TQCC of Data Mining and Knowledge Discovery is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
InceptionTime: Finding AlexNet for time series classification562
ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels364
The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances195
A survey of community detection methods in multilayer networks76
Challenges in benchmarking stream learning algorithms with real-world data60
Counterfactual explanations and how to find them: literature review and benchmarking58
MultiRocket: multiple pooling operators and transformations for fast and effective time series classification49
Deep graph similarity learning: a survey37
An efficient K-means clustering algorithm for tall data36
Deep soccer analytics: learning an action-value function for evaluating soccer players36
A comprehensive taxonomy for explainable artificial intelligence: a systematic survey of surveys on methods and concepts35
Matrix profile goes MAD: variable-length motif and discord discovery in data series34
Time series extrinsic regression34
TEASER: early and accurate time series classification32
Fake review detection on online E-commerce platforms: a systematic literature review30
Relational Learning Analysis of Social Politics using Knowledge Graph Embedding30
Smoothed dilated convolutions for improved dense prediction29
Time series motifs discovery under DTW allows more robust discovery of conserved structure28
Scalable attack on graph data by injecting vicious nodes26
ColluEagle: collusive review spammer detection using Markov random fields26
Forecast evaluation for data scientists: common pitfalls and best practices24
Comparison of novelty detection methods for multispectral images in rover-based planetary exploration missions24
Word-class embeddings for multiclass text classification23
Data-driven detection of counterpressing in professional football22
End-to-end deep representation learning for time series clustering: a comparative study22
XEM: An explainable-by-design ensemble method for multivariate time series classification22
Improving embedded knowledge graph multi-hop question answering by introducing relational chain reasoning19
Active learning for hierarchical multi-label classification19
ABBA: adaptive Brownian bridge-based symbolic aggregation of time series17
Treant: training evasion-aware decision trees16
Gaussian bandwidth selection for manifold learning and classification16
Algorithmic fairness datasets: the story so far16
User preference and embedding learning with implicit feedback for recommender systems15
A framework for deep constrained clustering15
Multi-label learning with missing and completely unobserved labels15
Efficient mining of the most significant patterns with permutation testing15
A survey of deep network techniques all classifiers can adopt14
The area under the ROC curve as a measure of clustering quality14
Benchmarking and survey of explanation methods for black box models14
struc2gauss: Structural role preserving network embedding via Gaussian embedding14
INK: knowledge graph embeddings for node classification14
MIDIA: exploring denoising autoencoders for missing data imputation14
Graph convolutional networks for traffic forecasting with missing values13
Grouped feature importance and combined features effect plot13
Dataset2Vec: learning dataset meta-features13
Efficient set-valued prediction in multi-class classification12
Extending greedy feature selection algorithms to multiple solutions12
Sequential recommendation with metric models based on frequent sequences12
Detecting virtual concept drift of regressors without ground truth values11
Cost-sensitive ensemble learning: a unifying framework11
Expected passes11
An efficient procedure for mining egocentric temporal motifs10
Hydra: competing convolutional kernels for fast and accurate time series classification10
PETSC: pattern-based embedding for time series classification10
VFC-SMOTE: very fast continuous synthetic minority oversampling for evolving data streams10
A deep multimodal model for bug localization10
Boosting house price predictions using geo-spatial network embedding10
An ultra-fast time series distance measure to allow data mining in more complex real-world deployments10
Robust subgroup discovery9
Who can receive the pass? A computational model for quantifying availability in soccer9
Large-scale network motif analysis using compression9
Bayesian mean-parameterized nonnegative binary matrix factorization9
BROCCOLI: overlapping and outlier-robust biclustering through proximal stochastic gradient descent9
For real: a thorough look at numeric attributes in subgroup discovery9
Controlling hallucinations at word level in data-to-text generation9
SPEck: mining statistically-significant sequential patterns efficiently with exact sampling8
Feature extraction from unequal length heterogeneous EHR time series via dynamic time warping and tensor decomposition8
Hierarchical message-passing graph neural networks8
Simplification of genetic programs: a literature survey8
Mining communities and their descriptions on attributed graphs: a survey8
Chebyshev approaches for imbalanced data streams regression models8
Introducing time series snippets: a new primitive for summarizing long time series7
Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach7
Interpretability, personalization and reliability of a machine learning based clinical decision support system7
Natural language techniques supporting decision modelers7
Early abandoning and pruning for elastic distances including dynamic time warping7
On GNN explainability with activation rules7
Simple and effective neural-free soft-cluster embeddings for item cold-start recommendations7
Time series clustering in linear time complexity6
Detecting singleton spams in reviews via learning deep anomalous temporal aspect-sentiment patterns6
An overlap sensitive neural network for class imbalanced data6
The minimum description length principle for pattern mining: a survey6
ClaSP: parameter-free time series segmentation6
Adversarial balancing-based representation learning for causal effect inference with observational data6
Sequence graph transform (SGT): a feature embedding function for sequence data mining6
Explanatory artificial intelligence (YAI): human-centered explanations of explainable AI and complex data6
Sufficient dimension reduction for average causal effect estimation6
POI recommendation with queuing time and user interest awareness6
Neural content-aware collaborative filtering for cold-start music recommendation6
What’s in a name? – gender classification of names with character based machine learning models6
Improving position encoding of transformers for multivariate time series classification6
Novel features for time series analysis: a complex networks approach6
A recurrent neural network architecture to model physical activity energy expenditure in older people6
Recurring concept memory management in data streams: exploiting data stream concept evolution to improve performance and transparency6
Mining full, inner and tail periodic patterns with perfect, imperfect and asynchronous periodicity simultaneously6
SMILE: a feature-based temporal abstraction framework for event-interval sequence classification6
TEAGS: time-aware text embedding approach to generate subgraphs6
0.030521869659424