-
Notifications
You must be signed in to change notification settings - Fork 24
Extract data on harmonic bonds #73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
More data from ff14SB
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #73 +/- ##
=======================================
Coverage 95.32% 95.32%
=======================================
Files 14 14
Lines 2033 2033
=======================================
Hits 1938 1938
Misses 95 95 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
This is the rather complex topic of atom types and classes, see http://docs.openmm.org/latest/userguide/application/06_creating_ffs.html#atomtypes for example. Carbons in similar chemical environments, such as generic tetrahedral carbons (CT), are given the same atom type and consequently the same Lennard-Jones parameters.
Not sure.
As I recall OpenMM allows both orders in its parsing. What is the intended use case of having the bond data in BioStructures? I ask because the values here are associated specifically with this force field, compared to the bonding topology in #66 which is a property of the amino acids themselves. Maybe we could have the parsing machinery but not check the values into source for data specific to a force field. |
Thanks for the explanation! I agree with your reservations about whether we want all this. I'd like the bond lengths and angles to support structure generation, but I don't have a use for the spring constants. Do you regard the lengths and angles as reasonably "intrinsic," or do they too make sense only in the context of a specific force field? If they are reasonably intrinsic, would it make sense to drop the spring constants but keep the rest? |
Ah I see, yes the bond lengths and angles would be useful for structure building. These are probably standard enough to use from Amber and have in source. I always liked the https://github.com/clauswilke/PeptideBuilder library for this task, might be worth seeing where they source their bonds and angles. |
There isn't an immediately obvious attribution: https://github.com/clauswilke/PeptideBuilder/blob/6d38a167b9992c27adc86f64370f7083303ce877/PeptideBuilder/Geometry.py#L68-L91 Since you mentioned an interest in structure generation, here's the docstring of the function I'm writing now (warning: in progress, subject to change):
Currently that's intended to go in a lab package, but if you think it's of general interest I could put it here instead. It's a different interface than PeptideBuilder (I don't want to hand-specify the dihedral angles), so maybe it's not of widespread interest. |
Are you happy having lengths in nanometers or would you prefer Angstroms? Angles in radians or degrees? |
My preference is Angstrom for compat with our |
This extracts more data from ff14SB.
I confess I don't understand the atom notations: what are, e.g.,
CX
,CT
,2C
,C*
, etc? I'm guessing some of these are particular carbons, which accounts for why there doesn't seem to be aN-Cα
bond listed. Is there special significance to theprotein-
part of the label? EDIT: oh, wait, the type-tag appears in the residue data...I still wouldn't mind understanding the designation scheme, but that resolves the immediate concern. Are there special rules at the N- and C-termini?Also, is there a rule for key-ordering? In most cases it seems alphabetical, but then you get down to
("protein-H1", "protein-2C")
and that goes out the window. It would be nice to know if there is a rule so one doesn't have to try both orders.