EPJ Data Science

Papers
(The median citation count of EPJ Data Science is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-12-01 to 2025-12-01.)
ArticleCitations
Does United Kingdom parliamentary attention follow social media posts?56
Estimating work engagement from online chat tools44
The microvelocity of money in Ethereum35
Effective strategies for targeted attacks to the network of Cosa Nostra affiliates33
A hybrid stock prediction method based on periodic/non-periodic features analyses28
How floods may affect the spatial spread of respiratory pathogens: the case of Emilia-Romagna, Italy in May 202324
Drivers of hate speech in political conversations on Twitter: the case of the 2022 Italian general election23
Simulating conversations on social media with generative agent-based models23
Modeling international mobility using roaming cell phone traces during COVID-19 pandemic23
Endogenous conflict and the limits of predictive optimization23
Analyzing image-based political propaganda in referendum campaigns: from elements to strategies21
Suspended accounts align with the Internet Research Agency misinformation campaign to influence the 2016 US election20
Online advertisement in a pink-colored market20
Assessing geographic polarisation in Britain’s digital landscape through stable dynamic embedding of spatial web data20
Which sport is becoming more predictable? A cross-discipline analysis of predictability in team sports19
Origin and destination attachment: study of cultural integration on Twitter19
Quantifying polarization in online political discourse19
Disentangling degree and tie strength heterogeneity in egocentric social networks19
A path-based approach to analyzing the global liner shipping network18
Dream content discovery from social media using natural language processing17
Detecting coordinated and bot-like behavior in Twitter: the Jürgen Conings case16
Generating mobility networks with generative adversarial networks16
On the duration of face-to-face contacts16
Human mobility reshaped? Deciphering the impacts of the Covid-19 pandemic on activity patterns, spatial habits, and schedule habits16
Leveraging WiFi network logs to infer student collocation and its relationship with academic performance15
Classifying social position with social media behavioral data15
Developing a hierarchical model for unraveling conspiracy theories15
Correction: Temporal network analysis using zigzag persistence14
Identifying urban features for vulnerable road user safety in Europe14
Adaptation of student behavioural routines during Covid-19: a multimodal approach14
Companies under stress: the impact of shocks on the production network14
Computational social science is growing up: why puberty consists of embracing measurement validation, theory development, and open science practices13
Rhythm of the streets: a street classification framework based on street activity patterns13
Understanding rhythmic structures, novelty, and influence in classical music from data13
Analysis and classification of privacy-sensitive content in social media posts13
Evolution of sample-based music authorship network13
Science as exploration in a knowledge landscape: tracing hotspots or seeking opportunity?12
Academic support network reflects doctoral experience and productivity11
Multifaceted online coordinated behavior in the 2020 US presidential election11
Identifying the temporal dynamics of densification and sparsification in human contact networks11
Fair automated assessment of noncompliance in cargo ship networks10
Entropy-based text feature engineering approach for forecasting financial liquidity changes10
Critical computational social science10
The simpliciality of higher-order networks10
Open data and quantitative techniques for anthropology of road traffic10
Milgram’s experiment in the knowledge space: individual navigation strategies10
Can Google Trends predict asylum-seekers’ destination choices?10
News sharing on Twitter reveals emergent fragmentation of media agenda and persistent polarization9
Endogenous labour flow networks9
Large scale analysis of gender bias and sexism in song lyrics9
Temporal patterns of reciprocity in communication networks9
The Russian invasion of Ukraine selectively depolarized the Finnish NATO discussion on Twitter9
Misinformation is not about bad facts: an analysis of the production and consumption of fringe content9
Learning to cluster urban areas: two competitive approaches and an empirical validation8
Mapping language literacy at scale: a case study on Facebook8
Social media warfare: investigating human-bot engagement in English, Japanese and German during the Russo-Ukrainian war on Twitter and Reddit8
Studying social networks in the age of computational social science8
Large-scale digital signatures of emotional response to the COVID-19 vaccination campaign8
Connectivity and community structure of online and register-based social networks8
Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter8
Forecasting patient flows with pandemic induced concept drift using explainable machine learning7
When dialects collide: how socioeconomic mixing affects language use7
UTDRM: unsupervised method for training debunked-narrative retrieval models7
Tackling racial bias in automated online hate detection: Towards fair and accurate detection of hateful users with geometric deep learning7
Is it getting harder to make a hit? Evidence from 65 years of US music chart history7
Detecting political biases of named entities and hashtags on Twitter7
Linking physical violence to women’s mobility in Chile7
The right to audit and power asymmetries in algorithm auditing7
Explaining human mobility predictions through a pattern matching algorithm7
A new methodology to measure faultlines at scale leveraging digital traces7
Design and analysis of tweet-based election models for the 2021 Mexican legislative election7
LEIA: Linguistic Embeddings for the Identification of Affect6
Personalisation and profiling using algorithms and not-so-popular Colombian music: goal-directed mechanisms in music emotion recognition6
Evaluating Twitter’s algorithmic amplification of low-credibility content: an observational study6
Early warning signals for stock market crashes: empirical and analytical insights utilizing nonlinear methods6
Sweet tweets! Evaluating a new approach for probability-based sampling of Twitter6
The role of transport systems in housing insecurity: a mobility-based analysis6
Uncovering large inconsistencies between machine learning derived gridded settlement datasets6
On the adoption of e-moped sharing systems6
Construction and analysis of corporate greenwashing index: a deep learning approach6
Consensus formation on heterogeneous networks6
Analyzing user ideologies and shared news during the 2019 argentinian elections6
Quantifying interdisciplinary synergy in higher STEM education6
Measuring user engagement with low credibility media sources in a controversial online debate6
Structural gender imbalances in ballet collaboration networks6
Spatio-temporal changes in racial segregation and diversity in large US cities from 1990 to 2020: a visual data analysis6
Has Covid-19 permanently changed online purchasing behavior?6
Charting mobility patterns in the scientific knowledge landscape6
The impact of playlist characteristics on coherence in user-curated music playlists5
Quantifying participation biases on social media5
Profile update: the effects of identity disclosure on network connections and language5
Corruption red flags in public procurement: new evidence from Italian calls for tenders5
Multiple gravity laws for human mobility within cities5
Garbage in garbage out? Impacts of data quality on criminal network intervention5
Mental health concerns precede quits: shifts in the work discourse during the Covid-19 pandemic and great resignation5
Unsupervised detection of coordinated information operations in the wild5
Comparison of home detection algorithms using smartphone GPS data5
Detection of anomalous spatio-temporal patterns of app traffic in response to catastrophic events5
Investigating the contribution of author- and publication-specific features to scholars’ h-index prediction5
A new approach to estimate neighborhood socioeconomic status using supermarket transactions and GNNs5
Quantifying the risk of pastoral conflict in 4 central African countries5
Public perception of generative AI on Twitter: an empirical study based on occupation and usage4
Bridging the digital divide: mapping Internet connectivity evolution, inequalities, and resilience in six Brazilian cities4
Socioeconomic disparities in mobility behavior during the COVID-19 pandemic in developing countries4
Identifying latent activity behaviors and lifestyles using mobility data to describe urban dynamics4
A novel activity space approach to discover displacement patterns via mobile phone data: an analysis of the 2023 Türkiye-Syria earthquakes4
Understanding China’s urban system evolution from web search index data4
Journalists are most likely to receive abuse: analysing online abuse of UK public figures across sport, politics, and journalism on Twitter4
Unveiling the silent majority: stance detection and characterization of passive users on social media using collaborative filtering and graph convolutional networks4
Computational approaches for cyber social threats4
The shock, the coping, the resilience: smartphone application use reveals Covid-19 lockdown effects on human behaviors4
Untangling pair synergy in the evolution of collaborative scientific impact4
Holistic approach to analysing debates on ecological sustainability over time on X4
Shift in house price estimates during COVID-19 reveals effect of crisis on collective speculation4
Using mobile money data and call detail records to explore the risks of urban migration in Tanzania4
Enhancing short-term crime prediction with human mobility flows and deep learning architectures4
Computational reproducibility in computational social science4
Downscaling spatial interaction with socioeconomic attributes4
Temporal network analysis using zigzag persistence4
Cryptocurrency co-investment network: token returns reflect investment patterns4
Higher-order structures of local collaboration networks are associated with individual scientific productivity3
Designing transit routes based on vehicle routing behavior determined through location-based services data3
Safe spaces or toxic places? Content moderation and social dynamics of online eating disorder communities3
A generative model for age and income distribution3
Do poverty and wealth look the same the world over? A comparative study of 12 cities from five high-income countries using street images3
Scaling law of real traffic jams under varying travel demand3
Using word embeddings to analyse audience effects and individual differences in parenting Subreddits3
Computational social science with confidence3
Tainted ties: the structure and dynamics of corruption networks extracted from deferred prosecution agreements3
Correction to: Keep your friends close, and your enemies closer: structural properties of negative relationships on Twitter3
Allotaxonometry and rank-turbulence divergence: a universal instrument for comparing complex systems3
Leveraging augmentation techniques for tasks with unbalancedness within the financial domain: a two-level ensemble approach3
Impact and dynamics of hate and counter speech online3
Fusing content and social relationships: a multi-modal heterogeneous graph transformer approach for social bot detection3
Understanding the role of sentiment and emotion for predicting forced displacement3
Exposure to parks through the lens of urban mobility3
CORAL: COde RepresentAtion learning with weakly-supervised transformers for analyzing data analysis3
Bibliometric cartography of data science: a large-scale analysis on knowledge integration and diffusion3
Assessing the complexity of a path search optimization method based on clustering for a transport graph3
Robustness of topological persistence in knowledge distillation for wearable sensor data3
Comparing GPS and cell-based mobile phone data to identify activity participation during the COVID-19 pandemic3
Language and the use of law are predictive of judge gender and seniority2
The penumbra of open source: projects outside of centralized platforms are longer maintained, more academic and more collaborative2
Correction to: Brexit and bots: characterizing the behaviour of automated accounts on Twitter during the UK election2
Glitter or gold? Deriving structured insights from sustainability reports via large language models2
The geometry of suspicious money laundering activities in financial networks2
Sustainability of Stack Exchange Q&A communities: the role of trust2
Analyzing news engagement on Facebook: tracking ideological segregation and news quality in the Facebook URL dataset2
Academic failures and co-location social networks in campus2
Keyword expansion techniques for mining social movement data on social media2
Twitter-MusicPD: melody of minds - navigating user-level data on multiple mental health disorders and music preferences2
A large scale study of reader interactions with images on Wikipedia2
Measuring close proximity interactions in summer camps during the COVID-19 pandemic2
Human mobility prediction with causal and spatial-constrained multi-task network2
Reaching the bubble may not be enough: news media role in online political polarization2
AGECovP: identifying ageism and analyzing COVID-19 discourse on older adults in YouTube2
The geography of interest in international conversations on Twitter2
Correction: Measuring user engagement with low credibility media sources in a controversial online debate2
Understanding trends, patterns, and dynamics in global company acquisitions: a network perspective2
Addressing long-tailed distribution in judicial text for criminal motive classification: a balanced contrastive learning approach2
Novel embeddings improve the prediction of risk perception2
Longitudinal modularity, a modularity for link streams2
Modelling railway delay propagation as diffusion-like spreading2
Correction: Impact and dynamics of hate and counter speech online2
Exposing influence campaigns in the age of LLMs: a behavioral-based AI approach to detecting state-sponsored trolls2
Unsupervised detection of coordinated fake-follower campaigns on social media2
Quantifying digital habits2
Spatial distribution of solar PV deployment: an application of the region-based convolutional neural network2
0.11762404441833