Journal of Cheminformatics

Papers
(The TQCC of Journal of Cheminformatics is 11. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-10-01 to 2025-10-01.)
ArticleCitations
Predicting chemical structure using reinforcement learning with a stack-augmented conditional variational autoencoder115
Unexpected similarity between HIV-1 reverse transcriptase and tumor necrosis factor binding sites revealed by computer vision109
Transformer-based molecular optimization beyond matched molecular pairs73
Biosynfoni: a biosynthesis-informed and interpretable lightweight molecular fingerprint71
HERGAI: an artificial intelligence tool for structure-based prediction of hERG inhibitors70
Explainable uncertainty quantifications for deep learning-based molecular property prediction69
An explainability framework for deep learning on chemical reactions exemplified by enzyme-catalysed reaction classification62
Dimensionally reduced machine learning model for predicting single component octanol–water partition coefficients61
Assessing interaction recovery of predicted protein-ligand poses58
Equivariant diffusion for structure-based de novo ligand generation with latent-conditioning58
Exploring the ability of machine learning-based virtual screening models to identify the functional groups responsible for binding56
Generating diversity and securing completeness in algorithmic retrosynthesis52
Moldina: a fast and accurate search algorithm for simultaneous docking of multiple ligands52
European Registry of Materials: global, unique identifiers for (undisclosed) nanomaterials49
Determining the parent and associated fragment formulae in mass spectrometry via the parent subformula graph48
Fifteen years of ChEMBL and its role in cheminformatics and drug discovery47
APBIO: bioactive profiling of air pollutants through inferred bioactivity signatures and prediction of novel target interactions46
Computer-aided pattern scoring (C@PS): a novel cheminformatic workflow to predict ligands with rare modes-of-action46
PMF-CPI: assessing drug selectivity with a pretrained multi-functional model for compound–protein interactions44
MLinvitroTox reloaded for high-throughput hazard-based prioritization of high-resolution mass spectrometry data42
AutoTemplate: enhancing chemical reaction datasets for machine learning applications in organic chemistry42
Paths to cheminformatics: Q&A with Ann M. Richard41
VSFlow: an open-source ligand-based virtual screening tool40
One chiral fingerprint to find them all39
AdapTor: Adaptive Topological Regression for quantitative structure–activity relationship modeling38
ELECTRA-DTA: a new compound-protein binding affinity prediction model based on the contextualized sequence encoding38
Deep learning of multimodal networks with topological regularization for drug repositioning37
Reproducible untargeted metabolomics workflow for exhaustive MS2 data acquisition of MS1 features34
MolPrice: assessing synthetic accessibility of molecules based on market value34
AiZynthFinder 4.0: developments based on learnings from 3 years of industrial application34
Analysis of the benefits of imputation models over traditional QSAR models for toxicity prediction33
Bitter peptide prediction using graph neural networks33
NanoBinder: a machine learning assisted nanobody binding prediction tool using Rosetta energy scores32
Crossover operators for molecular graphs with an application to virtual drug screening31
Explaining compound activity predictions with a substructure-aware loss for graph neural networks31
Advancements in thermochemical predictions: a multi-output thermodynamics-informed neural network approach31
Shinyscreen: mass spectrometry data inspection and quality checking utility31
Implementation of an open chemistry knowledge base with a Semantic Wiki31
Chemical toxicity prediction based on semi-supervised learning and graph convolutional neural network30
Splitting chemical structure data sets for federated privacy-preserving machine learning30
The BinDiscover database: a biology-focused meta-analysis tool for 156,000 GC–TOF MS metabolome samples30
Structure-based machine learning screening identifies natural product candidates as potential geroprotectors29
Semi-automated workflow for molecular pair analysis and QSAR-assisted transformation space expansion29
GT-NMR: a novel graph transformer-based approach for accurate prediction of NMR chemical shifts28
Accelerated hit identification with target evaluation, deep learning and automated labs: prospective validation in IRAK128
Prediction model for chemical explosion consequences via multimodal feature fusion28
The development of the generative adversarial supporting vector machine for molecular property generation28
Exploration and augmentation of pharmacological space via adversarial auto-encoder model for facilitating kinase-centric drug development27
Papyrus: a large-scale curated dataset aimed at bioactivity predictions26
Predictive modeling of visible-light azo-photoswitches’ properties using structural features26
On the difficulty of validating molecular generative models realistically: a case study on public and proprietary data26
Enhancing chemical reaction search through contrastive representation learning and human-in-the-loop26
A systematic review of deep learning chemical language models in recent era24
CRAFT: a web-integrated cavity prediction tool based on flow transfer algorithm24
PyL3dMD: Python LAMMPS 3D molecular descriptors package24
Comprehensive benchmarking of computational tools for predicting toxicokinetic and physicochemical properties of chemicals24
canSAR chemistry registration and standardization pipeline24
Scaffold Generator: a Java library implementing molecular scaffold functionalities in the Chemistry Development Kit (CDK)23
UMAP-based clustering split for rigorous evaluation of AI models for virtual screening on cancer cell lines*23
New algorithms demonstrate untargeted detection of chemically meaningful changing units and formula assignment for HRMS data of polymeric mixtures in the open-source constellation web application23
FlavorMiner: a machine learning platform for extracting molecular flavor profiles from structural data23
Diversifying cheminformatics23
VGSC-DB: an online database of voltage-gated sodium channels22
Using test-time augmentation to investigate explainable AI: inconsistencies between method, model and human intuition22
Rxn-INSIGHT: fast chemical reaction analysis using bond-electron matrices22
PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank22
How to crack a SMILES: automatic crosschecked chemical structure resolution across multiple services using MoleculeResolver22
Reaction rebalancing: a novel approach to curating reaction databases22
Efficient 3D conformer generation of cyclic peptides formed by a disulfide bond22
Infrared spectrum analysis of organic molecules with neural networks using standard reference data sets in combination with real-world data22
ScaffoldGVAE: scaffold generation and hopping of drug molecules via a variational autoencoder based on multi-view graph neural networks22
Automatic molecular fragmentation by evolutionary optimisation21
YoDe-Segmentation: automated noise-free retrieval of molecular structures from scientific publications21
Implementation of a soft grading system for chemistry in a Moodle plugin21
Notes on molecular fragmentation and parameter settings for a dissipative particle dynamics study of a C10E4/water mixture with lamellar bilayer formation21
Applying atomistic neural networks to bias conformer ensembles towards bioactive-like conformations21
Chemical reaction network knowledge graphs: the OntoRXN ontology21
Visualising lead optimisation series using reduced graphs20
Ilm-NMR-P31: an open-access 31P nuclear magnetic resonance database and data-driven prediction of 31P NMR shifts20
Application of deep metric learning to molecular graph similarity20
AI-powered prediction of critical properties and boiling points: a hybrid ensemble learning and QSPR approach20
Novel multi-objective affinity approach allows to identify pH-specific μ-opioid receptor agonists18
Activity cliff-aware reinforcement learning for de novo drug design18
Off-targetP ML: an open source machine learning framework for off-target panel safety assessment of small molecules18
Subgrapher: visual fingerprinting of chemical structures18
AI-driven molecular generation of not-patented pharmaceutical compounds using world open patent data18
Data mining of PubChem bioassay records reveals diverse OXPHOS inhibitory chemotypes as potential therapeutic agents against ovarian cancer18
The specification game: rethinking the evaluation of drug response prediction for precision oncology18
A transformer based generative chemical language AI model for structural elucidation of organic compounds17
Towards a partial order graph for interactive pharmacophore exploration: extraction of pharmacophores activity delta17
TCMSID: a simplified integrated database for drug discovery from traditional chinese medicine17
ChemInformatics Model Explorer (CIME): exploratory analysis of chemical model explanations17
Achieving well-informed decision-making in drug discovery: a comprehensive calibration study using neural network-based structure-activity models16
DeepSA: a deep-learning driven predictor of compound synthesis accessibility16
Searching chemical databases in the pre-history of cheminformatics16
Deepmol: an automated machine and deep learning framework for computational chemistry16
Chemical rules for optimization of chemical mutagenicity via matched molecular pairs analysis and machine learning methods16
UnCorrupt SMILES: a novel approach to de novo design15
Leveraging computational tools to combat malaria: assessment and development of new therapeutics15
Machine learning to predict metabolic drug interactions related to cytochrome P450 isozymes15
piscesCSM: prediction of anticancer synergistic drug combinations15
AI/ML methodologies and the future-will they be successful in designing the next generation of new chemical entities?15
Integrating synthetic accessibility with AI-based generative drug design15
Systematic benchmarking of 13 AI methods for predicting cyclic peptide membrane permeability15
HepatoToxicity Portal (HTP): an integrated database of drug-induced hepatotoxicity knowledgebase and graph neural network-based prediction model15
MolNexTR: a generalized deep learning model for molecular image recognition15
TB-IECS: an accurate machine learning-based scoring function for virtual screening15
Geometric deep learning for molecular property predictions with chemical accuracy across chemical space15
Enhancing molecular property prediction with quantized GNN models15
Paths to cheminformatics: Q&A with Phyo Phyo Kyaw Zin14
DECIMER—hand-drawn molecule images dataset14
Nanodesigner: resolving the complex-CDR interdependency with iterative refinement14
VitroBert: modeling DILI by pretraining BERT on in vitro data14
What makes a reaction network “chemical”?13
One size does not fit all: revising traditional paradigms for assessing accuracy of QSAR models used for virtual screening13
rMSIfragment: improving MALDI-MSI lipidomics through automated in-source fragment annotation13
Tuning gradient boosting for imbalanced bioassay modelling with custom loss functions13
Human-in-the-loop active learning for goal-oriented molecule generation13
Decrypting orphan GPCR drug discovery via multitask learning13
qHTSWaterfall: 3-dimensional visualization software for quantitative high-throughput screening (qHTS) data12
Application of machine reading comprehension techniques for named entity recognition in materials science12
Llamol: a dynamic multi-conditional generative transformer for de novo molecular design12
SMILES all around: structure to SMILES conversion for transition metal complexes12
Benchmarking ML in ADMET predictions: the practical impact of feature representations in ligand-based models12
Advancing material property prediction: using physics-informed machine learning models for viscosity12
Solvent flashcards: a visualisation tool for sustainable chemistry12
Ontologies4Cat: investigating the landscape of ontologies for catalysis research data management12
Automated molecular structure segmentation from documents using ChemSAM12
A novel multitask learning algorithm for tasks with distinct chemical space: zebrafish toxicity prediction as an example12
The effect of noise on the predictive limit of QSAR models12
LAGNet: better electron density prediction for LCAO-based data and drug-like substances12
PROTEOMAS: a workflow enabling harmonized proteomic meta-analysis and proteomic signature mapping12
PIKAChU: a Python-based informatics kit for analysing chemical units12
Integrating QSAR modelling with reinforcement learning for Syk inhibitor discovery11
Naturally-meaningful and efficient descriptors: machine learning of material properties based on robust one-shot ab initio descriptors11
PromptSMILES: prompting for scaffold decoration and fragment linking in chemical language models11
From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials11
Large-scale comparison of machine learning methods for profiling prediction of kinase inhibitors11
XSMILES: interactive visualization for molecules, SMILES and XAI attribution scores11
Machine learning approaches to optimize small-molecule inhibitors for RNA targeting11
Evaluation of chirality descriptors derived from SMILES heteroencoders11
Large language models open new way of AI-assisted molecule design for chemists11
Deep scaffold hopping with multimodal transformer neural networks11
kMoL: an open-source machine and federated learning library for drug discovery11
Chemical characteristics vectors map the chemical space of natural biomes from untargeted mass spectrometry data11
PKSmart: an open-source computational model to predict intravenous pharmacokinetics of small molecules11
Galaxy workflows for fragment-based virtual screening: a case study on the SARS-CoV-2 main protease11
Suitability of large language models for extraction of high-quality chemical reaction dataset from patent literature11
OWSum: algorithmic odor prediction and insight into structure-odor relationships11
Building shape-focused pharmacophore models for effective docking screening11
Prediction of blood–brain barrier and Caco-2 permeability through the Enalos Cloud Platform: combining contrastive learning and atom-attention message passing neural networks11
HobPre: accurate prediction of human oral bioavailability for small molecules11
Correction: Reconstruction of lossless molecular representations from fingerprints11
InterDILI: interpretable prediction of drug-induced liver injury through permutation feature importance and attention mechanism11
Pretraining graph transformers with atom-in-a-molecule quantum properties for improved ADMET modeling11
ReMODE: a deep learning-based web server for target-specific drug design11
Evaluating uncertainty-based active learning for accelerating the generalization of molecular property prediction11
MDDI-SCL: predicting multi-type drug-drug interactions via supervised contrastive learning11
E-GuARD: expert-guided augmentation for the robust detection of compounds interfering with biological assays11
0.13881206512451