Fix loading of ligands when three letter code matches NCAA #480

roccomoretti · 2025-07-25T16:03:51Z

@LAnAlchemist noticed that when a ligand params file provided with -extra_res_fa has a three letter code which matches an NCAA three letter code from the database, that ligand ResidueType is never selected on PDB read-in, even if it's a much better match for the names in the PDB.

The reason for this is that the PDB reader residue typer prioritizes patched polymeric terminus types (those with TERMINUS properties) for residues at the ends of chains, discarding the ligand types as a possibility before even encountering the name-based selection.

This PR adjusts how the typer selects residues. Instead of having chain-terminal residues preferring terminus properties, actually look at the connection points, and look for residues which have/don't have the UPPER & LOWER connection points. (This is really what "is_lower_terminus" and "is_upper_terminus" in PoseFromSFRBuilder signifies: is this residue polymerically connected to the adjacent residue.)

woodsh17

Are the error messages popping up in integration tests rna_denovo_dna_bridge, swm_dna_bridge, posttranslationalmod_io expected from this change? The tests look like they still end the same though.

roccomoretti · 2025-07-30T22:05:20Z

Are the error messages popping up in integration tests rna_denovo_dna_bridge, swm_dna_bridge, posttranslationalmod_io expected from this change? The tests look like they still end the same though.

That's not expected - it's from patch definitions which weren't consulted before but are now (but which don't actually contribute to the results). I've submitted fixes for the patches which were causing issues.

I've also adjusted the simple_metrics_b_factor test to accommodate for poor handling of the BMA glycan residue. That's probably something which should be fixed more generally, though. (Though people using glycans are likely to be using an approach which avoids the issue -- the same one I've adjusted the test to use.)

lyskov

Code LGTM!

roccomoretti added 4 commits July 24, 2025 17:18

Initial approach

4a7568a

Add discouraged connects, need to enable patches to alter connections

92aa1d4

Adjust filtering to treat all connections equally.

730246c

Better match the original logic

87c8c2e

roccomoretti requested review from lyskov and woodsh17 July 25, 2025 16:03

roccomoretti added ready_for_review This PR is ready to be reviewed and merged. 90 standard tests labels Jul 25, 2025

woodsh17 reviewed Jul 30, 2025

View reviewed changes

roccomoretti added 3 commits July 30, 2025 15:14

Beauty

b40ca74

Adjust patches which look to be poorly formed.

ea61d9d

Adjust simple_metrics_b_factor for glycan codes

fc72db1

lyskov approved these changes Jul 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix loading of ligands when three letter code matches NCAA #480

Fix loading of ligands when three letter code matches NCAA #480

roccomoretti commented Jul 25, 2025

Uh oh!

woodsh17 left a comment

Uh oh!

roccomoretti commented Jul 30, 2025

Uh oh!

lyskov left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix loading of ligands when three letter code matches NCAA #480

Are you sure you want to change the base?

Fix loading of ligands when three letter code matches NCAA #480

Conversation

roccomoretti commented Jul 25, 2025

Uh oh!

woodsh17 left a comment

Choose a reason for hiding this comment

Uh oh!

roccomoretti commented Jul 30, 2025

Uh oh!

lyskov left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lyskov left a comment •

edited

Loading