We use cookies to distinguish you from other users and to provide you with a better experience on our websites. Close this message to accept cookies or find out how to manage your cookie settings. Learn more about our Privacy Notice... [opens in a new tab]

Mathematical Analysis of PDB-to-SMILES-to-PDB Transformations: Structural Optimization, Coordinate Invariance, and Molecular Interaction Fidelity

06 March 2025, Version 1
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

The bidirectional transformation of molecular structures between the Protein Data Bank (PDB) format and the Simplified Molecular Input Line Entry System (SMILES) is widely used to rectify structural distortions in molecular geometry. This process inherently minimizes energy, optimizes bond lengths, angles, and torsional constraints while introducing a global coordinate shift. In this paper, we provide a mathematical breakdown of PDB-to-SMILES and SMILES-to-PDB transformations and formally prove that while these transformations rectify the molecular distortions, the resulting coordinate shift does not alter molecular interaction properties. We will prove this using molecular mechanics force fields, rigid-body transformations, and quantum mechanical energy functionals, confirming the preservation of key interaction terms governing molecular behaviour.

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.