EPJ Data Science

Papers
(The median citation count of EPJ Data Science is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
Does United Kingdom parliamentary attention follow social media posts?51
Drivers of hate speech in political conversations on Twitter: the case of the 2022 Italian general election44
Estimating work engagement from online chat tools36
The microvelocity of money in Ethereum35
How floods may affect the spatial spread of respiratory pathogens: the case of Emilia-Romagna, Italy in May 202327
Effective strategies for targeted attacks to the network of Cosa Nostra affiliates25
A hybrid stock prediction method based on periodic/non-periodic features analyses23
Modeling international mobility using roaming cell phone traces during COVID-19 pandemic23
Assessing geographic polarisation in Britain’s digital landscape through stable dynamic embedding of spatial web data22
Detecting coordinated and bot-like behavior in Twitter: the Jürgen Conings case22
Human mobility reshaped? Deciphering the impacts of the Covid-19 pandemic on activity patterns, spatial habits, and schedule habits22
Origin and destination attachment: study of cultural integration on Twitter21
Online advertisement in a pink-colored market21
Which sport is becoming more predictable? A cross-discipline analysis of predictability in team sports20
A path-based approach to analyzing the global liner shipping network19
Disentangling degree and tie strength heterogeneity in egocentric social networks19
Analyzing image-based political propaganda in referendum campaigns: from elements to strategies18
Suspended accounts align with the Internet Research Agency misinformation campaign to influence the 2016 US election18
Quantifying polarization in online political discourse18
Dream content discovery from social media using natural language processing18
Extracting complements and substitutes from sales data: a network perspective17
Developing a hierarchical model for unraveling conspiracy theories17
Generating mobility networks with generative adversarial networks17
On the duration of face-to-face contacts16
Identifying urban features for vulnerable road user safety in Europe16
Leveraging WiFi network logs to infer student collocation and its relationship with academic performance16
Correction: Temporal network analysis using zigzag persistence15
Evolution of sample-based music authorship network15
Science as exploration in a knowledge landscape: tracing hotspots or seeking opportunity?13
Companies under stress: the impact of shocks on the production network13
Academic support network reflects doctoral experience and productivity13
Analysis and classification of privacy-sensitive content in social media posts13
Computational social science is growing up: why puberty consists of embracing measurement validation, theory development, and open science practices12
Multifaceted online coordinated behavior in the 2020 US presidential election12
Adaptation of student behavioural routines during Covid-19: a multimodal approach12
Open data and quantitative techniques for anthropology of road traffic12
Rhythm of the streets: a street classification framework based on street activity patterns12
Identifying the temporal dynamics of densification and sparsification in human contact networks12
Milgram’s experiment in the knowledge space: individual navigation strategies11
Critical computational social science11
Can Google Trends predict asylum-seekers’ destination choices?11
Fair automated assessment of noncompliance in cargo ship networks11
Large scale analysis of gender bias and sexism in song lyrics10
The simpliciality of higher-order networks10
Endogenous labour flow networks10
Temporal patterns of reciprocity in communication networks10
News sharing on Twitter reveals emergent fragmentation of media agenda and persistent polarization10
Entropy-based text feature engineering approach for forecasting financial liquidity changes10
Learning to cluster urban areas: two competitive approaches and an empirical validation9
Social media warfare: investigating human-bot engagement in English, Japanese and German during the Russo-Ukrainian war on Twitter and Reddit9
Misinformation is not about bad facts: an analysis of the production and consumption of fringe content9
Connectivity and community structure of online and register-based social networks9
The Russian invasion of Ukraine selectively depolarized the Finnish NATO discussion on Twitter9
Explaining human mobility predictions through a pattern matching algorithm9
Large-scale digital signatures of emotional response to the COVID-19 vaccination campaign8
Forecasting patient flows with pandemic induced concept drift using explainable machine learning8
Studying social networks in the age of computational social science8
A new methodology to measure faultlines at scale leveraging digital traces8
Mapping language literacy at scale: a case study on Facebook7
Detecting political biases of named entities and hashtags on Twitter7
Linking physical violence to women’s mobility in Chile7
Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter7
When dialects collide: how socioeconomic mixing affects language use7
UTDRM: unsupervised method for training debunked-narrative retrieval models7
LEIA: Linguistic Embeddings for the Identification of Affect6
Tackling racial bias in automated online hate detection: Towards fair and accurate detection of hateful users with geometric deep learning6
Sweet tweets! Evaluating a new approach for probability-based sampling of Twitter6
On the adoption of e-moped sharing systems6
Measuring user engagement with low credibility media sources in a controversial online debate6
The right to audit and power asymmetries in algorithm auditing6
Evaluating Twitter’s algorithmic amplification of low-credibility content: an observational study6
Design and analysis of tweet-based election models for the 2021 Mexican legislative election6
Construction and analysis of corporate greenwashing index: a deep learning approach6
Characterizing partisan political narrative frameworks about COVID-19 on Twitter6
Structural gender imbalances in ballet collaboration networks6
Consensus formation on heterogeneous networks5
Spatio-temporal changes in racial segregation and diversity in large US cities from 1990 to 2020: a visual data analysis5
Has Covid-19 permanently changed online purchasing behavior?5
Garbage in garbage out? Impacts of data quality on criminal network intervention5
Detection of anomalous spatio-temporal patterns of app traffic in response to catastrophic events5
Early warning signals for stock market crashes: empirical and analytical insights utilizing nonlinear methods5
Analyzing user ideologies and shared news during the 2019 argentinian elections5
The impact of playlist characteristics on coherence in user-curated music playlists5
Unsupervised detection of coordinated information operations in the wild5
The role of transport systems in housing insecurity: a mobility-based analysis5
Charting mobility patterns in the scientific knowledge landscape5
Quantifying participation biases on social media5
Mental health concerns precede quits: shifts in the work discourse during the Covid-19 pandemic and great resignation5
Investigating the contribution of author- and publication-specific features to scholars’ h-index prediction5
Public perception of generative AI on Twitter: an empirical study based on occupation and usage4
Understanding China’s urban system evolution from web search index data4
Shift in house price estimates during COVID-19 reveals effect of crisis on collective speculation4
Downscaling spatial interaction with socioeconomic attributes4
Unveiling the silent majority: stance detection and characterization of passive users on social media using collaborative filtering and graph convolutional networks4
Profile update: the effects of identity disclosure on network connections and language4
Comparison of home detection algorithms using smartphone GPS data4
Multiple gravity laws for human mobility within cities4
Cryptocurrency co-investment network: token returns reflect investment patterns4
Journalists are most likely to receive abuse: analysing online abuse of UK public figures across sport, politics, and journalism on Twitter4
A new approach to estimate neighborhood socioeconomic status using supermarket transactions and GNNs4
Corruption red flags in public procurement: new evidence from Italian calls for tenders4
Using mobile money data and call detail records to explore the risks of urban migration in Tanzania4
Holistic approach to analysing debates on ecological sustainability over time on X4
Untangling pair synergy in the evolution of collaborative scientific impact4
Correction to: Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter3
The shock, the coping, the resilience: smartphone application use reveals Covid-19 lockdown effects on human behaviors3
Enhancing short-term crime prediction with human mobility flows and deep learning architectures3
Socioeconomic disparities in mobility behavior during the COVID-19 pandemic in developing countries3
Exposure to parks through the lens of urban mobility3
A generative model for age and income distribution3
Do poverty and wealth look the same the world over? A comparative study of 12 cities from five high-income countries using street images3
Assessing the complexity of a path search optimization method based on clustering for a transport graph3
Heaps’ law and vocabulary richness in the history of classical music harmony3
Higher-order structures of local collaboration networks are associated with individual scientific productivity3
Tainted ties: the structure and dynamics of corruption networks extracted from deferred prosecution agreements3
Bridging the digital divide: mapping Internet connectivity evolution, inequalities, and resilience in six Brazilian cities3
The presence of occupational structure in online texts based on word embedding NLP models3
Temporal network analysis using zigzag persistence3
Impact and dynamics of hate and counter speech online3
Using word embeddings to analyse audience effects and individual differences in parenting Subreddits3
Allotaxonometry and rank-turbulence divergence: a universal instrument for comparing complex systems3
CORAL: COde RepresentAtion learning with weakly-supervised transformers for analyzing data analysis3
Designing transit routes based on vehicle routing behavior determined through location-based services data3
Computational reproducibility in computational social science3
Computational approaches for cyber social threats3
Identifying latent activity behaviors and lifestyles using mobility data to describe urban dynamics3
Computational social science with confidence3
Bibliometric cartography of data science: a large-scale analysis on knowledge integration and diffusion3
Leveraging augmentation techniques for tasks with unbalancedness within the financial domain: a two-level ensemble approach3
Correction to: Brexit and bots: characterizing the behaviour of automated accounts on Twitter during the UK election2
Novel embeddings improve the prediction of risk perception2
Emotions in online rumor diffusion2
Understanding trends, patterns, and dynamics in global company acquisitions: a network perspective2
Putting human behavior predictability in context2
Scaling law of real traffic jams under varying travel demand2
Human mobility prediction with causal and spatial-constrained multi-task network2
Both sides of the story: comparing student-level data on reading performance from administrative registers to application generated data from a reading app2
Glitter or gold? Deriving structured insights from sustainability reports via large language models2
A large scale study of reader interactions with images on Wikipedia2
Addressing long-tailed distribution in judicial text for criminal motive classification: a balanced contrastive learning approach2
Exposing influence campaigns in the age of LLMs: a behavioral-based AI approach to detecting state-sponsored trolls2
Sustainability of Stack Exchange Q&A communities: the role of trust2
Robustness of topological persistence in knowledge distillation for wearable sensor data2
Comparing GPS and cell-based mobile phone data to identify activity participation during the COVID-19 pandemic2
Longitudinal modularity, a modularity for link streams2
Measuring close proximity interactions in summer camps during the COVID-19 pandemic2
Unsupervised detection of coordinated fake-follower campaigns on social media2
Language and the use of law are predictive of judge gender and seniority2
Modelling railway delay propagation as diffusion-like spreading2
The geometry of suspicious money laundering activities in financial networks2
Correction: Impact and dynamics of hate and counter speech online2
Reaching the bubble may not be enough: news media role in online political polarization2
0.047366857528687