Journal of Cheminformatics

Papers
(The median citation count of Journal of Cheminformatics is 5. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Biosynfoni: a biosynthesis-informed and interpretable lightweight molecular fingerprint148
Equivariant diffusion for structure-based de novo ligand generation with latent-conditioning97
HERGAI: an artificial intelligence tool for structure-based prediction of hERG inhibitors81
Moldina: a fast and accurate search algorithm for simultaneous docking of multiple ligands80
Generating diversity and securing completeness in algorithmic retrosynthesis78
An explainability framework for deep learning on chemical reactions exemplified by enzyme-catalysed reaction classification70
Exploring the ability of machine learning-based virtual screening models to identify the functional groups responsible for binding66
Dimensionally reduced machine learning model for predicting single component octanol–water partition coefficients63
Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder62
Transformer-based molecular optimization beyond matched molecular pairs59
Explainable uncertainty quantifications for deep learning-based molecular property prediction54
Assessing interaction recovery of predicted protein-ligand poses53
European Registry of Materials: global, unique identifiers for (undisclosed) nanomaterials53
Determining the parent and associated fragment formulae in mass spectrometry via the parent subformula graph51
APBIO: bioactive profiling of air pollutants through inferred bioactivity signatures and prediction of novel target interactions50
Paths to cheminformatics: Q&A with Ann M. Richard49
MolPrice: assessing synthetic accessibility of molecules based on market value47
Computer-aided pattern scoring (C@PS): a novel cheminformatic workflow to predict ligands with rare modes-of-action47
PMF-CPI: assessing drug selectivity with a pretrained multi-functional model for compound–protein interactions45
NPBS Atlas: a comprehensive data resource for exploring the biological sources of natural products44
Reproducible untargeted metabolomics workflow for exhaustive MS2 data acquisition of MS1 features44
AdapTor: Adaptive Topological Regression for quantitative structure–activity relationship modeling43
MLinvitroTox reloaded for high-throughput hazard-based prioritization of high-resolution mass spectrometry data39
Novel molecule design with POWGAN, a policy-optimized Wasserstein generative adversarial network39
Machine learning to predict food effects during drug development: a comprehensive review38
AutoTemplate: enhancing chemical reaction datasets for machine learning applications in organic chemistry38
VSFlow: an open-source ligand-based virtual screening tool38
AiZynthFinder 4.0: developments based on learnings from 3 years of industrial application37
Fifteen years of ChEMBL and its role in cheminformatics and drug discovery36
One chiral fingerprint to find them all36
ELECTRA-DTA: a new compound-protein binding affinity prediction model based on the contextualized sequence encoding36
Shinyscreen: mass spectrometry data inspection and quality checking utility35
Deep learning of multimodal networks with topological regularization for drug repositioning35
Analysis of the benefits of imputation models over traditional QSAR models for toxicity prediction35
Crossover operators for molecular graphs with an application to virtual drug screening35
Implementation of an open chemistry knowledge base with a Semantic Wiki34
Advancements in thermochemical predictions: a multi-output thermodynamics-informed neural network approach33
A quantum chemical dataset of interacting molecular pairs for chemical reaction studies33
The BinDiscover database: a biology-focused meta-analysis tool for 156,000 GC–TOF MS metabolome samples33
NanoBinder: a machine learning assisted nanobody binding prediction tool using Rosetta energy scores33
Bitter peptide prediction using graph neural networks32
Explaining compound activity predictions with a substructure-aware loss for graph neural networks31
Enhancing chemical reaction search through contrastive representation learning and human-in-the-loop30
Prediction model for chemical explosion consequences via multimodal feature fusion30
PyL3dMD: Python LAMMPS 3D molecular descriptors package29
GT-NMR: a novel graph transformer-based approach for accurate prediction of NMR chemical shifts28
PURE: policy-guided unbiased REpresentations for structure-constrained molecular generation28
CRAFT: a web-integrated cavity prediction tool based on flow transfer algorithm27
ProjFusNet: deep neural network for peptide precursor prediction using projection-fused protein language model and structural features27
Accurate structure-activity relationship prediction of antioxidant peptides using a multimodal deep learning framework27
The development of the generative adversarial supporting vector machine for molecular property generation27
Accelerated hit identification with target evaluation, deep learning and automated labs: prospective validation in IRAK127
Predictive modeling of visible-light azo-photoswitches’ properties using structural features27
On the difficulty of validating molecular generative models realistically: a case study on public and proprietary data26
Papyrus: a large-scale curated dataset aimed at bioactivity predictions26
Comprehensive benchmarking of computational tools for predicting toxicokinetic and physicochemical properties of chemicals25
Ionization efficiency prediction of electrospray ionization mass spectrometry analytes based on molecular fingerprints and cumulative neutral losses25
Diversifying cheminformatics25
A systematic review of deep learning chemical language models in recent era25
Structure-based machine learning screening identifies natural product candidates as potential geroprotectors25
Summarizing relationships between chemicals, genes, proteins, and diseases in PubChem using analysis of their co-occurrences in patents25
UMAP-based clustering split for rigorous evaluation of AI models for virtual screening on cancer cell lines*25
canSAR chemistry registration and standardization pipeline25
VGSC-DB: an online database of voltage-gated sodium channels24
New algorithms demonstrate untargeted detection of chemically meaningful changing units and formula assignment for HRMS data of polymeric mixtures in the open-source constellation web application24
Using test-time augmentation to investigate explainable AI: inconsistencies between method, model and human intuition24
Efficient 3D conformer generation of cyclic peptides formed by a disulfide bond23
How to crack a SMILES: automatic crosschecked chemical structure resolution across multiple services using MoleculeResolver23
Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices23
Gromacs MetaDump: a tool for extracting GROMACS simulation metadata23
PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank22
ScaffoldGVAE: scaffold generation and hopping of drug molecules via a variational autoencoder based on multi-view graph neural networks22
Infrared spectrum analysis of organic molecules with neural networks using standard reference data sets in combination with real-world data22
FlavorMiner: a machine learning platform for extracting molecular flavor profiles from structural data21
YoDe-Segmentation: automated noise-free retrieval of molecular structures from scientific publications21
Implementation of a soft grading system for chemistry in a Moodle plugin21
Scaffold Generator: a Java library implementing molecular scaffold functionalities in the Chemistry Development Kit (CDK)21
Visualising lead optimisation series using reduced graphs21
Chemical reaction network knowledge graphs: the OntoRXN ontology20
Application of deep metric learning to molecular graph similarity20
Applying atomistic neural networks to bias conformer ensembles towards bioactive-like conformations20
Reaction rebalancing: a novel approach to curating reaction databases20
Notes on molecular fragmentation and parameter settings for a dissipative particle dynamics study of a C10E4/water mixture with lamellar bilayer formation20
The specification game: rethinking the evaluation of drug response prediction for precision oncology20
Ilm-NMR-P31: an open-access 31P nuclear magnetic resonance database and data-driven prediction of 31P NMR shifts20
Subgrapher: visual fingerprinting of chemical structures20
AI-powered prediction of critical properties and boiling points: a hybrid ensemble learning and QSPR approach19
Activity cliff-aware reinforcement learning for de novo drug design19
Novel multi-objective affinity approach allows to identify pH-specific μ-opioid receptor agonists19
Automatic molecular fragmentation by evolutionary optimisation19
Deepmol: an automated machine and deep learning framework for computational chemistry18
Achieving well-informed decision-making in drug discovery: a comprehensive calibration study using neural network-based structure-activity models18
Towards a partial order graph for interactive pharmacophore exploration: extraction of pharmacophores activity delta18
DeepSA: a deep-learning driven predictor of compound synthesis accessibility18
Off-targetP ML: an open source machine learning framework for off-target panel safety assessment of small molecules18
A transformer based generative chemical language AI model for structural elucidation of organic compounds18
ChemInformatics Model Explorer (CIME): exploratory analysis of chemical model explanations18
TCMSID: a simplified integrated database for drug discovery from traditional chinese medicine17
Searching chemical databases in the pre-history of cheminformatics17
Chemical rules for optimization of chemical mutagenicity via matched molecular pairs analysis and machine learning methods17
Data mining of PubChem bioassay records reveals diverse OXPHOS inhibitory chemotypes as potential therapeutic agents against ovarian cancer17
AI-driven molecular generation of not-patented pharmaceutical compounds using world open patent data17
piscesCSM: prediction of anticancer synergistic drug combinations17
Leveraging computational tools to combat malaria: assessment and development of new therapeutics16
Geometric deep learning for molecular property predictions with chemical accuracy across chemical space16
MolNexTR: a generalized deep learning model for molecular image recognition16
AI/ML methodologies and the future-will they be successful in designing the next generation of new chemical entities?16
Correction: Enhanced Thompson sampling by roulette wheel selection for screening ultralarge combinatorial libraries16
HepatoToxicity Portal (HTP): an integrated database of drug-induced hepatotoxicity knowledgebase and graph neural network-based prediction model16
Machine learning to predict metabolic drug interactions related to cytochrome P450 isozymes16
Enhancing molecular property prediction with quantized GNN models16
Systematic benchmarking of 13 AI methods for predicting cyclic peptide membrane permeability16
Integrating synthetic accessibility with AI-based generative drug design15
VitroBert: modeling DILI by pretraining BERT on in vitro data15
TB-IECS: an accurate machine learning-based scoring function for virtual screening15
Paths to cheminformatics: Q&A with Phyo Phyo Kyaw Zin15
UnCorrupt SMILES: a novel approach to de novo design15
Tuning gradient boosting for imbalanced bioassay modelling with custom loss functions14
Human-in-the-loop active learning for goal-oriented molecule generation14
One size does not fit all: revising traditional paradigms for assessing accuracy of QSAR models used for virtual screening14
LAGNet: better electron density prediction for LCAO-based data and drug-like substances14
qHTSWaterfall: 3-dimensional visualization software for quantitative high-throughput screening (qHTS) data14
Decrypting orphan GPCR drug discovery via multitask learning14
DECIMER—hand-drawn molecule images dataset14
What makes a reaction network “chemical”?14
Benchmarking ML in ADMET predictions: the practical impact of feature representations in ligand-based models14
Nanodesigner: resolving the complex-CDR interdependency with iterative refinement14
rMSIfragment: improving MALDI-MSI lipidomics through automated in-source fragment annotation14
Advancing material property prediction: using physics-informed machine learning models for viscosity14
Automated molecular structure segmentation from documents using ChemSAM14
Predicting the critical micelle concentration of binary surfactant mixtures using machine learning13
Llamol: a dynamic multi-conditional generative transformer for de novo molecular design13
Nipah Virus Inhibitor Knowledgebase (NVIK): a combined evidence approach to prioritise small molecule inhibitors13
Suitability of large language models for extraction of high-quality chemical reaction dataset from patent literature13
PROTEOMAS: a workflow enabling harmonized proteomic meta-analysis and proteomic signature mapping13
XSMILES: interactive visualization for molecules, SMILES and XAI attribution scores13
Chemical characteristics vectors map the chemical space of natural biomes from untargeted mass spectrometry data13
A novel multitask learning algorithm for tasks with distinct chemical space: zebrafish toxicity prediction as an example13
Solvent flashcards: a visualisation tool for sustainable chemistry13
VNFlow: integration of variational autoencoders and normalizing flows for novel molecular design13
SMILES all around: structure to SMILES conversion for transition metal complexes13
PIKAChU: a Python-based informatics kit for analysing chemical units13
OWSum: algorithmic odor prediction and insight into structure-odor relationships13
ReMODE: a deep learning-based web server for target-specific drug design13
Prediction of UGT-mediated phase II metabolism via ligand- and structure-based predictive models13
Application of machine reading comprehension techniques for named entity recognition in materials science13
Ontologies4Cat: investigating the landscape of ontologies for catalysis research data management13
Enhanced Thompson sampling by roulette wheel selection for screening ultralarge combinatorial libraries12
Galaxy workflows for fragment-based virtual screening: a case study on the SARS-CoV-2 main protease12
Machine learning approaches to optimize small-molecule inhibitors for RNA targeting12
Large-scale comparison of machine learning methods for profiling prediction of kinase inhibitors12
Evaluating uncertainty-based active learning for accelerating the generalization of molecular property prediction12
InterDILI: interpretable prediction of drug-induced liver injury through permutation feature importance and attention mechanism12
Uncovering molecular determinants of potency and binding affinity in hit compounds targeting FGF14/Nav1.6 complex12
MDDI-SCL: predicting multi-type drug-drug interactions via supervised contrastive learning12
From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials12
PKSmart: an open-source computational model to predict intravenous pharmacokinetics of small molecules12
Evaluation of chirality descriptors derived from SMILES heteroencoders12
PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models12
Capsule graph networks for accurate and interpretable crystalline materials property prediction12
Pretraining graph transformers with atom-in-a-molecule quantum properties for improved ADMET modeling12
HobPre: accurate prediction of human oral bioavailability for small molecules11
kMoL: an open-source machine and federated learning library for drug discovery11
Correction: Reconstruction of lossless molecular representations from fingerprints11
E-GuARD: expert-guided augmentation for the robust detection of compounds interfering with biological assays11
Integrating QSAR modelling with reinforcement learning for Syk inhibitor discovery11
Prediction of blood–brain barrier and Caco-2 permeability through the Enalos Cloud Platform: combining contrastive learning and atom-attention message passing neural networks11
Building shape-focused pharmacophore models for effective docking screening11
DrugDiff: small molecule diffusion model with flexible guidance towards molecular properties11
Naturally-meaningful and efficient descriptors: machine learning of material properties based on robust one-shot ab initio descriptors11
Large language models open new way of AI-assisted molecule design for chemists10
A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL10
Benchmarking molecular conformer augmentation with context-enriched training: graph-based transformer versus GNN models10
Small molecule autoencoders: architecture engineering to optimize latent space utility and sustainability10
Correction to: TorsiFlex: an automatic generator of torsional conformers. Application to the twenty proteinogenic amino acids10
Double-head transformer neural network for molecular property prediction10
Generate what you can make: achieving in-house synthesizability with readily available resources in de novo drug design10
Enhancing atom mapping with multitask learning and symmetry-aware deep graph matching10
Machine learning-driven generation and screening of potential ionic liquids for cellulose dissolution10
TraceMetrix: a traceable metabolomics interactive analysis platform10
Advantages of two quantum programming platforms in quantum computing and quantum chemistry10
A molecule perturbation software library and its application to study the effects of molecular design constraints10
TUCAN: A molecular identifier and descriptor applicable to the whole periodic table from hydrogen to oganesson9
Improving the performance of models for one-step retrosynthesis through re-ranking9
Combatting over-specialization bias in growing chemical databases9
Classification of battery compounds using structure-free Mendeleev encodings9
PubChem synonym filtering process using crowdsourcing9
Contrastive explanations for machine learning predictions in chemistry9
Relative molecule self-attention transformer9
InflamNat: web-based database and predictor of anti-inflammatory natural products9
StreaMD: the toolkit for high-throughput molecular dynamics simulations9
Explaining and avoiding failure modes in goal-directed generation of small molecules9
PermuteDDS: a permutable feature fusion network for drug-drug synergy prediction9
Matched pairs demonstrate robustness against inter-assay variability9
BitterMatch: recommendation systems for matching molecules with bitter taste receptors9
COMA: efficient structure-constrained molecular generation using contractive and margin losses9
Syn-MolOpt: a synthesis planning-driven molecular optimization method using data-derived functional reaction templates9
Transfer learning across different chemical domains: virtual screening of organic materials with deep learning models pretrained on small molecule and chemical reaction data9
A comprehensive comparison of deep learning-based compound-target interaction prediction models to unveil guiding design principles9
Barlow Twins deep neural network for advanced 1D drug–target interaction prediction9
Physicochemical modelling of the retention mechanism of temperature-responsive polymeric columns for HPLC through machine learning algorithms9
Reproducible MS/MS library cleaning pipeline in matchms8
PINNED: identifying characteristics of druggable human proteins using an interpretable neural network8
Exploring QSAR models for activity-cliff prediction8
xBitterT5: an explainable transformer-based framework with multimodal inputs for identifying bitter-taste peptides8
Enhancing molecular property prediction with auxiliary learning and task-specific adaptation8
Unveiling polyphenol-protein interactions: a comprehensive computational analysis8
DLM-DTI: a dual language model for the prediction of drug-target interaction with hint-based learning8
In-silico target prediction by ensemble chemogenomic model based on multi-scale information of chemical structures and protein sequences8
Principles and requirements for nanomaterial representations to facilitate machine processing and cooperation with nanoinformatics tools8
BioisoIdentifier: an online free tool to investigate local structural replacements from PDB8
Large-scale evaluation of k-fold cross-validation ensembles for uncertainty estimation8
Improving VAE based molecular representations for compound property prediction8
GloMPO (Globally Managed Parallel Optimization): a tool for expensive, black-box optimizations, application to ReaxFF reparameterizations8
Improved estimation of intrinsic solubility of drug-like molecules through multi-task graph transformer8
DeepRNA-DTI: a deep learning approach for RNA-compound interaction prediction with binding site interpretability8
TransExION: a transformer based explainable similarity metric for comparing IONS in tandem mass spectrometry8
Advancements in hand-drawn chemical structure recognition through an enhanced DECIMER architecture7
Positional embeddings and zero-shot learning using BERT for molecular-property prediction7
Evaluating the generalizability of graph neural networks for predicting collision cross section7
Uncertain of uncertainties? A comparison of uncertainty quantification metrics for chemical data sets7
Evaluating ligand docking methods for drugging protein–protein interfaces: insights from AlphaFold2 and molecular dynamics refinement7
hERGAT: predicting hERG blockers using graph attention mechanism through atom- and molecule-level interaction analyses7
Graph neural processes for molecules: an evaluation on docking scores and strategies to improve generalization7
An end-to-end method for predicting compound-protein interactions based on simplified homogeneous graph convolutional network and pre-trained language model7
UmetaFlow: an untargeted metabolomics workflow for high-throughput data processing and analysis7
From molecules to data: the emerging impact of chemoinformatics in chemistry7
Investigation of the structure-odor relationship using a Transformer model7
Mass-Suite: a novel open-source python package for high-resolution mass spectrometry data analysis7
Multi-fidelity graph neural networks for predicting toluene/water partition coefficients7
Enhancing multi-task in vivo toxicity prediction via integrated knowledge transfer of chemical knowledge and in vitro toxicity information7
SMPR: a structure-enhanced multimodal drug‒disease prediction model for drug repositioning and cold start7
From theory to experiment: transformer-based generation enables rapid discovery of novel reactions7
Free and open-source QSAR-ready workflow for automated standardization of chemical structures in support of QSAR modeling7
A workflow for deriving chemical entities from crystallographic data and its application to the Crystallography Open Database7
Box embeddings for extending ontologies: a data-driven and interpretable approach7
An automated calculation pipeline for differential pair interaction energies with molecular force fields using the Tinker Molecular Modeling Package7
Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation7
“DompeKeys”: a set of novel substructure-based descriptors for efficient chemical space mapping, development and structural interpretation of machine learning models, and indexing of large databases7
Application of the digital annealer unit in optimizing chemical reaction conditions for enhanced production yields7
Predicting RP-LC retention indices of structurally unknown chemicals from mass spectrometry data6
Predicting protein network topology clusters from chemical structure using deep learning6
A look back at a pilot of the citation typing ontology6
A new workflow for the effective curation of membrane permeability data from open ADME information6
Accelerating the inference of string generation-based chemical reaction models for industrial applications6
Biomedical data analyses facilitated by open cheminformatics workflows6
Multi-modal contrastive drug synergy prediction model guided by single modality6
Improving the quality of chemical language model outcomes with atom-in-SMILES tokenization6
Conditional reduction of the loss value versus reinforcement learning for biassing a de-novo drug design generator6
Group graph: a molecular graph representation with enhanced performance, efficiency and interpretability6
0.17413282394409