VLDB Journal

Papers
(The TQCC of VLDB Journal is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-04-01 to 2024-04-01.)
ArticleCitations
Data collection and quality challenges in deep learning: a data-centric AI perspective47
TADOC: Text analytics directly on compression38
A survey of RDF stores & SPARQL engines for querying knowledge graphs35
Fast-adapting and privacy-preserving federated recommender system32
Managing bias and unfairness in data for decision support: a survey of machine learning and data engineering approaches to identify and mitigate bias and unfairness within data management and analytic31
Fairness in rankings and recommendations: an overview30
A model and query language for temporal graph databases30
MDDE: multitasking distributed differential evolution for privacy-preserving database fragmentation29
Efficient approximation algorithms for adaptive influence maximization26
A survey on outlier explanations22
A benchmark and comprehensive survey on knowledge graph entity alignment via representation learning21
Location- and keyword-based querying of geo-textual data: a survey20
Tidy Tuples and Flying Start: fast compilation and fast execution of relational queries in Umbra20
LineageChain: a fine-grained, secure and efficient data provenance system for blockchains20
Unsupervised and scalable subsequence anomaly detection in large data series19
Memory-aware framework for fast and scalable second-order random walk over billion-edge natural graphs19
Data dependencies for query optimization: a survey18
Building blocks for persistent memory18
Towards efficient solutions of bitruss decomposition for large-scale bipartite graphs17
RDF graph summarization for first-sight structure discovery17
Cross-chain deals and adversarial commerce16
A survey on deep learning approaches for text-to-SQL16
Scalable data series subsequence matching with ULISSE15
Dragoon: a hybrid and efficient big trajectory management system for offline and online analytics14
GeoSparkViz: a cluster computing system for visualizing massive-scale geospatial data13
Visually aware recommendation with aesthetic features13
$$\hbox {CDBTune}^{+}$$: An efficient deep reinforcement learning-based automatic cloud database tuning system13
Distributed temporal graph analytics with GRADOOP13
Answering reachability and K-reach queries on large graphs with label constraints12
A survey on semantic schema discovery12
eRiskCom: an e-commerce risky community detection platform11
Privacy and efficiency guaranteed social subgraph matching11
PrefixFPM: a parallel framework for general-purpose mining of frequent and closed patterns11
In-Memory Interval Joins11
DIFF: a relational interface for large-scale data explanation11
I/O efficient k-truss community search in massive graphs10
Efficient and effective ER with progressive blocking10
On entity alignment at scale10
VIP: A SIMD vectorized analytical query engine10
A dataspace-based framework for OLAP analyses in a high-variety multistore10
Efficient Hop-constrained s-t Simple Path Enumeration10
Autoscaling tiered cloud storage in Anna9
Continuous top-k spatial–keyword search on dynamic objects8
Data distribution debugging in machine learning pipelines8
Temporal locality-aware sampling for accurate triangle counting in real graph streams8
RHEEMix in the data jungle: a cost-based optimizer for cross-platform systems8
A cost model for random access queries in document stores8
Algorithms for the discovery of embedded functional dependencies8
Better database cost/performance via batched I/O on programmable SSD7
Fast data series indexing for in-memory data7
EI-LSH: An early-termination driven I/O efficient incremental c-approximate nearest neighbor search7
Leveraging range joins for the computation of overlap joins6
Accelerated butterfly counting with vertex priority on bipartite graphs6
Cache-efficient sweeping-based interval joins for extended Allen relation predicates6
A design space for RDF data representations6
ABSTAT-HD: a scalable tool for profiling very large knowledge graphs6
Model averaging in distributed machine learning: a case study with Apache Spark6
Effective entity matching with transformers6
Finding skyline communities in multi-valued networks6
A game-based framework for crowdsourced data labeling6
G-thinker: a general distributed framework for finding qualified subgraphs in a big graph with load balancing6
VolcanoML: speeding up end-to-end AutoML via scalable search space decomposition6
0.029000997543335