EPJ Data Science

Papers
(The median citation count of EPJ Data Science is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
Does United Kingdom parliamentary attention follow social media posts?51
Drivers of hate speech in political conversations on Twitter: the case of the 2022 Italian general election48
Estimating work engagement from online chat tools39
The microvelocity of money in Ethereum37
How floods may affect the spatial spread of respiratory pathogens: the case of Emilia-Romagna, Italy in May 202328
Effective strategies for targeted attacks to the network of Cosa Nostra affiliates27
Modeling international mobility using roaming cell phone traces during COVID-19 pandemic25
A hybrid stock prediction method based on periodic/non-periodic features analyses23
Assessing geographic polarisation in Britain’s digital landscape through stable dynamic embedding of spatial web data22
Detecting coordinated and bot-like behavior in Twitter: the Jürgen Conings case22
Analyzing image-based political propaganda in referendum campaigns: from elements to strategies22
Generating mobility networks with generative adversarial networks21
Online advertisement in a pink-colored market20
Which sport is becoming more predictable? A cross-discipline analysis of predictability in team sports20
Origin and destination attachment: study of cultural integration on Twitter20
Quantifying polarization in online political discourse19
Disentangling degree and tie strength heterogeneity in egocentric social networks19
Dream content discovery from social media using natural language processing18
A path-based approach to analyzing the global liner shipping network18
Human mobility reshaped? Deciphering the impacts of the Covid-19 pandemic on activity patterns, spatial habits, and schedule habits18
Suspended accounts align with the Internet Research Agency misinformation campaign to influence the 2016 US election18
Leveraging WiFi network logs to infer student collocation and its relationship with academic performance17
On the duration of face-to-face contacts17
Evolution of sample-based music authorship network16
Identifying urban features for vulnerable road user safety in Europe16
Developing a hierarchical model for unraveling conspiracy theories16
Classifying social position with social media behavioral data16
Companies under stress: the impact of shocks on the production network15
Correction: Temporal network analysis using zigzag persistence15
Computational social science is growing up: why puberty consists of embracing measurement validation, theory development, and open science practices13
Adaptation of student behavioural routines during Covid-19: a multimodal approach13
Academic support network reflects doctoral experience and productivity13
Multifaceted online coordinated behavior in the 2020 US presidential election13
Science as exploration in a knowledge landscape: tracing hotspots or seeking opportunity?12
Rhythm of the streets: a street classification framework based on street activity patterns12
Identifying the temporal dynamics of densification and sparsification in human contact networks12
Analysis and classification of privacy-sensitive content in social media posts12
Open data and quantitative techniques for anthropology of road traffic12
Fair automated assessment of noncompliance in cargo ship networks11
Milgram’s experiment in the knowledge space: individual navigation strategies11
Critical computational social science11
Temporal patterns of reciprocity in communication networks10
Can Google Trends predict asylum-seekers’ destination choices?10
Misinformation is not about bad facts: an analysis of the production and consumption of fringe content10
Entropy-based text feature engineering approach for forecasting financial liquidity changes10
Learning to cluster urban areas: two competitive approaches and an empirical validation10
Large scale analysis of gender bias and sexism in song lyrics10
The simpliciality of higher-order networks10
Endogenous labour flow networks10
News sharing on Twitter reveals emergent fragmentation of media agenda and persistent polarization9
Social media warfare: investigating human-bot engagement in English, Japanese and German during the Russo-Ukrainian war on Twitter and Reddit9
The Russian invasion of Ukraine selectively depolarized the Finnish NATO discussion on Twitter9
Explaining human mobility predictions through a pattern matching algorithm9
Studying social networks in the age of computational social science8
A new methodology to measure faultlines at scale leveraging digital traces8
Large-scale digital signatures of emotional response to the COVID-19 vaccination campaign8
Forecasting patient flows with pandemic induced concept drift using explainable machine learning8
Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter7
Mapping language literacy at scale: a case study on Facebook7
Connectivity and community structure of online and register-based social networks7
UTDRM: unsupervised method for training debunked-narrative retrieval models7
Linking physical violence to women’s mobility in Chile7
On the adoption of e-moped sharing systems7
When dialects collide: how socioeconomic mixing affects language use7
Detecting political biases of named entities and hashtags on Twitter7
Tackling racial bias in automated online hate detection: Towards fair and accurate detection of hateful users with geometric deep learning6
Measuring user engagement with low credibility media sources in a controversial online debate6
Evaluating Twitter’s algorithmic amplification of low-credibility content: an observational study6
Quantifying interdisciplinary synergy in higher STEM education6
Design and analysis of tweet-based election models for the 2021 Mexican legislative election6
Construction and analysis of corporate greenwashing index: a deep learning approach6
Characterizing partisan political narrative frameworks about COVID-19 on Twitter6
Uncovering large inconsistencies between machine learning derived gridded settlement datasets6
LEIA: Linguistic Embeddings for the Identification of Affect6
The right to audit and power asymmetries in algorithm auditing6
Structural gender imbalances in ballet collaboration networks6
Sweet tweets! Evaluating a new approach for probability-based sampling of Twitter6
Spatio-temporal changes in racial segregation and diversity in large US cities from 1990 to 2020: a visual data analysis5
Analyzing user ideologies and shared news during the 2019 argentinian elections5
Garbage in garbage out? Impacts of data quality on criminal network intervention5
Detection of anomalous spatio-temporal patterns of app traffic in response to catastrophic events5
Consensus formation on heterogeneous networks5
Early warning signals for stock market crashes: empirical and analytical insights utilizing nonlinear methods5
Charting mobility patterns in the scientific knowledge landscape5
The impact of playlist characteristics on coherence in user-curated music playlists5
Investigating the contribution of author- and publication-specific features to scholars’ h-index prediction5
The role of transport systems in housing insecurity: a mobility-based analysis5
Has Covid-19 permanently changed online purchasing behavior?5
Quantifying participation biases on social media5
Mental health concerns precede quits: shifts in the work discourse during the Covid-19 pandemic and great resignation5
Untangling pair synergy in the evolution of collaborative scientific impact4
Unsupervised detection of coordinated information operations in the wild4
Understanding China’s urban system evolution from web search index data4
Shift in house price estimates during COVID-19 reveals effect of crisis on collective speculation4
Downscaling spatial interaction with socioeconomic attributes4
Unveiling the silent majority: stance detection and characterization of passive users on social media using collaborative filtering and graph convolutional networks4
Profile update: the effects of identity disclosure on network connections and language4
Multiple gravity laws for human mobility within cities4
Comparison of home detection algorithms using smartphone GPS data4
Cryptocurrency co-investment network: token returns reflect investment patterns4
Journalists are most likely to receive abuse: analysing online abuse of UK public figures across sport, politics, and journalism on Twitter4
Public perception of generative AI on Twitter: an empirical study based on occupation and usage4
A new approach to estimate neighborhood socioeconomic status using supermarket transactions and GNNs4
Corruption red flags in public procurement: new evidence from Italian calls for tenders4
Using mobile money data and call detail records to explore the risks of urban migration in Tanzania4
Holistic approach to analysing debates on ecological sustainability over time on X4
Exposure to parks through the lens of urban mobility3
Higher-order structures of local collaboration networks are associated with individual scientific productivity3
A generative model for age and income distribution3
Leveraging augmentation techniques for tasks with unbalancedness within the financial domain: a two-level ensemble approach3
Assessing the complexity of a path search optimization method based on clustering for a transport graph3
Correction to: Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter3
Designing transit routes based on vehicle routing behavior determined through location-based services data3
Identifying latent activity behaviors and lifestyles using mobility data to describe urban dynamics3
Computational approaches for cyber social threats3
A novel activity space approach to discover displacement patterns via mobile phone data: an analysis of the 2023 Türkiye-Syria earthquakes3
Enhancing short-term crime prediction with human mobility flows and deep learning architectures3
Impact and dynamics of hate and counter speech online3
Allotaxonometry and rank-turbulence divergence: a universal instrument for comparing complex systems3
Safe spaces or toxic places? Content moderation and social dynamics of online eating disorder communities3
Using word embeddings to analyse audience effects and individual differences in parenting Subreddits3
The shock, the coping, the resilience: smartphone application use reveals Covid-19 lockdown effects on human behaviors3
Computational reproducibility in computational social science3
Socioeconomic disparities in mobility behavior during the COVID-19 pandemic in developing countries3
Bridging the digital divide: mapping Internet connectivity evolution, inequalities, and resilience in six Brazilian cities3
Computational social science with confidence3
Do poverty and wealth look the same the world over? A comparative study of 12 cities from five high-income countries using street images3
CORAL: COde RepresentAtion learning with weakly-supervised transformers for analyzing data analysis3
Bibliometric cartography of data science: a large-scale analysis on knowledge integration and diffusion3
The presence of occupational structure in online texts based on word embedding NLP models3
Tainted ties: the structure and dynamics of corruption networks extracted from deferred prosecution agreements3
Temporal network analysis using zigzag persistence3
Scaling law of real traffic jams under varying travel demand2
Comparing GPS and cell-based mobile phone data to identify activity participation during the COVID-19 pandemic2
Reaching the bubble may not be enough: news media role in online political polarization2
Correction to: Brexit and bots: characterizing the behaviour of automated accounts on Twitter during the UK election2
A large scale study of reader interactions with images on Wikipedia2
Unsupervised detection of coordinated fake-follower campaigns on social media2
Modelling railway delay propagation as diffusion-like spreading2
Robustness of topological persistence in knowledge distillation for wearable sensor data2
Sustainability of Stack Exchange Q&A communities: the role of trust2
Glitter or gold? Deriving structured insights from sustainability reports via large language models2
Measuring close proximity interactions in summer camps during the COVID-19 pandemic2
Emotions in online rumor diffusion2
Longitudinal modularity, a modularity for link streams2
Correction: Impact and dynamics of hate and counter speech online2
The geometry of suspicious money laundering activities in financial networks2
Human mobility prediction with causal and spatial-constrained multi-task network2
Exposing influence campaigns in the age of LLMs: a behavioral-based AI approach to detecting state-sponsored trolls2
Novel embeddings improve the prediction of risk perception2
AGECovP: identifying ageism and analyzing COVID-19 discourse on older adults in YouTube2
0.049586057662964