GigaScience

Papers
(The TQCC of GigaScience is 10. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-09-01 to 2025-09-01.)
ArticleCitations
The curse and blessing of abundance—the evolution of drug interaction databases and their impact on drug network analysis229
DivBrowse—interactive visualization and exploratory data analysis of variant call matrices93
ntsm: an alignment-free, ultra-low-coverage, sequencing technology agnostic, intraspecies sample comparison tool for sample swap detection85
Protein–protein and protein–nucleic acid binding site prediction via interpretable hierarchical geometric deep learning76
MBGC: Multiple Bacteria Genome Compressor70
Democratizing data-independent acquisition proteomics analysis on public cloud infrastructures via the Galaxy framework66
3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources62
The probability of edge existence due to node degree: a baseline for network-based predictions61
Reducing skin microbiome exposure impacts through swine farm biosecurity59
Current status of global conservation and characterisation of wild and cultivated Brassicaceae genetic resources49
Hecatomb: an integrated software platform for viral metagenomics47
CODARFE: Unlocking the prediction of continuous environmental variables based on microbiome46
Qiber3D—an open-source software package for the quantitative analysis of networks from 3D image stacks46
Preventing dataset shift from breaking machine-learning biomarkers38
A high-quality, long-read genome assembly of the endangered ring-tailed lemur (Lemur catta)37
A decade of GigaScience: A perspective on conservation genetics35
CoVEffect: interactive system for mining the effects of SARS-CoV-2 mutations and variants based on deep learning35
FAIR data station for lightweight metadata management and validation of omics studies35
Correction to: Antibiotic resistance genes are differentially mobilized according to resistance mechanism35
Dual-Alpha: a large EEG study for dual-frequency SSVEP brain–computer interface34
Galaxy as a gateway to bioinformatics: Multi-Interface Galaxy Hands-on Training Suite (MIGHTS) for scRNA-seq32
X-ray microtomography imaging of craniofacial hard tissues in selected reptile species with different types of dentition31
The Nencki-Symfonia electroencephalography/event-related potential dataset: Multiple cognitive tasks and resting-state data collected in a sample of healthy adults31
scMAPA: Identification of cell-type–specific alternative polyadenylation in complex tissues31
Cellsnake: a user-friendly tool for single-cell RNA sequencing analysis30
A Case for estradiol: younger brains in women with earlier menarche and later menopause29
gNOMO2: a comprehensive and modular pipeline for integrated multi-omics analyses of microbiomes27
Characteristics and filtering of low-frequency artificial short deletion variations based on nanopore sequencing27
Knowledge graph–based thought: a knowledge graph–enhanced LLM framework for pan-cancer question answering26
WaveSeekerNet: accurate prediction of influenza A virus subtypes and host source using attention-based deep learning26
A virtual library for behavioral performance in standard conditions—rodent spontaneous activity in an open field during repeated testing and after treatment with drugs or brain lesions26
ricu: R’s interface to intensive care data26
Molecular mechanisms underlying hematophagia revealed by comparative analyses of leech genomes26
Studying mutation rate evolution in primates—the effects of computational pipelines and parameter choices25
Early microbial intervention reshapes phenotypes of newborn Bos taurus through metabolic regulations25
vEMstitch: an algorithm for fully automatic image stitching of volume electron microscopy25
A Decade of GigaScience: Milestones in Open Science25
Harnessing population diversity: in search of tools of the trade25
CAT Bridge: an efficient toolkit for gene–metabolite association mining from multiomics data24
xAtlas: scalable small variant calling across heterogeneous next-generation sequencing experiments24
Multiomics uncovers the epigenomic and transcriptomic response to viral and bacterial stimulation in turbot23
Hi-GDT: A Hi-C-based 3D gene domain analysis tool for analyzing local chromatin contacts in plants23
Genomic insights into endangerment and conservation of the garlic-fruit tree (Malania oleifera), a plant species with extremely small populations23
External validation of machine learning models—registered models and adaptive sample splitting23
Computational reproducibility of Jupyter notebooks from biomedical publications23
Unveiling vertebrate development dynamics in frog Xenopus laevis using micro-CT imaging23
Population modeling with machine learning can enhance measures of mental health22
Hiding in plain sight: a research parasite’s perspective on new lessons in old data21
A new haplotype-resolved turkey genome to enable turkey genetics and genomics research21
Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios21
Deterministic succession patterns in the rumen and fecal microbiome associate with host metabolic shifts in peripartum dairy cattle20
An accessible infrastructure for artificial intelligence using a Docker-based JupyterLab in Galaxy20
Spacemake: processing and analysis of large-scale spatial transcriptomics data20
Disentangling river and swamp buffalo genetic diversity: initial insights from the 1000 Buffalo Genomes Project19
PEPhub: a database, web interface, and API for editing, sharing, and validating biological sample metadata19
GADMA2: more efficient and flexible demographic inference from genetic data19
High-quality genome assembles from key Hawaiian coral species19
The Capparis spinosa var. herbacea genome provides the first genomic instrument for a diversity and evolution study of the Capparaceae family18
Retracted and Replaced: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis18
De novoscreening of disease-resistant genes from the chromosome-level genome of rare minnow using CRISPR-cas9 random mutation18
The whole-genome assembly of an endangered Salicaceae species: Chosenia arbutifolia (Pall.) A. Skv18
A near telomere-to-telomere phased genome assembly and annotation for the Australian central bearded dragon Pogona vitticeps18
spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics18
Publishing data to support the fight against human vector-borne diseases18
Lessons learned about the biology and genomics of Diaphorina citri infection with “Candidatus Liberibacter asiaticus” by integrating new and archived organ-specific transcriptome data18
DeePhage: distinguishing virulent and temperate phage-derived sequences in metavirome data with a deep learning approach17
HaploMaker: An improved algorithm for rapid haplotype assembly of genomic sequences17
Loop detection using Hi-C data with HiCExplorer17
Resequencing of a Pekin duck breeding population provides insights into the genomic response to short-term artificial selection17
Data standardization of plant–pollinator interactions17
Haplogenome assembly reveals structural variation in Eucalyptus interspecific hybrids17
A Decade of GigaScience: Women in Science: Past, Present, and Future17
High temporal resolution Nanopore sequencing dataset of SARS-CoV-2 and host cell RNAs16
Deep learning links localized digital pathology phenotypes with transcriptional subtype and patient outcome in glioblastoma16
ssMutPA: single-sample mutation-based pathway analysis approach for cancer precision medicine16
ToxCodAn-Genome: an automated pipeline for toxin-gene annotation in genome assembly of venomous lineages15
Construction and analysis of the chromosome-level haplotype-resolved genomes of two Crassostrea oyster congeners: Crassostrea angulata and Crassostrea gigas15
RWRtoolkit: multi-omic network analysis using random walks on multiplex networks in any species15
LRTK: a platform agnostic toolkit for linked-read analysis of both human genome and metagenome15
On the benefits of self-taught learning for brain decoding15
A chromosome-scale genome assembly of the pioneer plant Stylosanthes angustifolia: insights into genome evolution and drought adaptation15
TF-Prioritizer: a Java pipeline to prioritize condition-specific transcription factors15
Lifting the curse from high-dimensional data: automated projection pursuit clustering for a variety of biological data modalities15
NuCLS: A scalable crowdsourcing approach and dataset for nucleus classification and segmentation in breast cancer15
Long-read and chromosome-scale assembly of the hexaploid wheat genome achieves high resolution for research and breeding15
Telomere-to-telomere chromosome-scale genome assemblies of black and golden koi carp variants support construction of an ancient karyotype of Cypriniformes15
Disease classification for whole-blood DNA methylation: Meta-analysis, missing values imputation, and XAI15
Correction to: Habitat suitability maps for Australian flora and fauna under CMIP6 climate scenarios15
Profiling the baseline performance and limits of machine learning models for adaptive immune receptor repertoire classification14
Large-scale genomic survey with deep learning-based method reveals strain-level phage specificity determinants14
Celebrating 30 years of access to NASA Space Life Sciences data14
Chromosome-level genome and the identification of sex chromosomes in Uloborus diversus14
The telomere-to-telomere (T2T) genome provides insights into the evolution of specialized centromere sequences in sandalwood14
A chromosome-level genome assembly and annotation of the desert horned lizard, Phrynosoma platyrhinos, provides insight into chromosomal rearrangements among reptiles14
learnMSA: learning and aligning large protein families13
Message in a Bottle—Metabarcoding enables biodiversity comparisons across ecoregions13
HeteroMRI: Robust white matter abnormality classification across multi-scanner MRI data13
A novel dataset for nuclei and tissue segmentation in melanoma with baseline nuclei segmentation and tissue segmentation benchmarks13
A near-complete genome assembly of the bearded dragon Pogona vitticeps provides insights into the origin of Pogona sex chromosomes13
EAGS: efficient and adaptive Gaussian smoothing applied to high-resolved spatial transcriptomics13
simAIRR: simulation of adaptive immune repertoires with realistic receptor sequence sharing for benchmarking of immune state prediction methods13
Telomere-to-telomere genome of common bean (Phaseolus vulgaris L., YP4)13
The state of Medusozoa genomics: current evidence and future challenges13
A dataset of ant colonies’ motion trajectories in indoor and outdoor scenes to study clustering behavior12
Whole-genome sequencing of the invasive golden apple snail Pomacea canaliculata from Asia reveals rapid expansion and adaptive evolution12
CoCoPyE: feature engineering for learning and prediction of genome quality indices12
Chromosome-level genome of the venomous snail Kalloconus canariensis: a valuable model for venomics and comparative genomics12
Exploring the cellular and molecular basis of murine cardiac development through spatiotemporal transcriptome sequencing12
A near telomere-to-telomere genome assembly of the Jinhua pig: enabling more accurate genetic research12
Gapless genome assembly and epigenetic profiles reveal gene regulation of whole-genome triplication in lettuce12
Metaphor—A workflow for streamlined assembly and binning of metagenomes11
Monash DaCRA fPET-fMRI: A dataset for comparison of radiotracer administration for high temporal resolution functional FDG-PET11
A Decade of GigaScience: GigaDB and the Open Data Movement11
Living in darkness: Exploring adaptation of Proteus anguinus in 3 dimensions by X-ray imaging11
DENTIST—using long reads for closing assembly gaps at high accuracy11
An overview of the National COVID-19 Chest Imaging Database: data quality and cohort analysis11
Guidance framework to apply best practices in ecological data analysis: lessons learned from building Galaxy-Ecology11
M6Allele: a toolkit for detection of allele-specific RNA N6-methyladenosine modifications11
An in vitro whole-cell electrophysiology dataset of human cortical neurons11
A telomere-to-telomere genome assembly of koi carp (Cyprinus carpio) using long reads and Hi-C technology11
A high-quality assembly revealing the PMEL gene for the unique plumage phenotype in Liancheng ducks11
Chromosome-level reference genome of tetraploid Isoetes sinensis provides insights into evolution and adaption of lycophytes10
Deepdefense: annotation of immune systems in prokaryotes using deep learning10
Katdetectr: an R/bioconductor package utilizing unsupervised changepoint analysis for robust kataegis detection10
Near telomere-to-telomere genome assembly of Mongolian cattle: implications for population genetic variation and beef quality10
Chromosome-level genome assembly of goose provides insight into the adaptation and growth of local goose breeds10
Retraction and replacement of: Telomere-to-telomere genome and resequencing of 254 individuals reveal evolution, genomic footprints in Asian icefish, Protosalanx chinensis10
Chromosome-level genome assemblies of two littorinid marine snails indicate genetic basis of intertidal adaptation and ancient karyotype evolved from bilaterian ancestors10
Evaluation of Swin Transformer and knowledge transfer for denoising of super-resolution structured illumination microscopy data10
A high-quality pseudo-phased genome for Melaleuca quinquenervia shows allelic diversity of NLR-type resistance genes10
On the variability of dynamic functional connectivity assessment methods10
0.055814027786255