Skip to content

Conversation

vinay-ebi
Copy link
Contributor

Summary

PR updates the GeneNameDescProjection_conf pipeline configuration to ensure gene names and descriptions from canonical mouse (mus_musculus) are projected into all mouse strains.


Changes

  • Extended gn_config:
    • Removed mouse strains from anti-species
    • Taxonomy Myomorpha covers all the mouse strains

Rationale

  • Provides consistent gene naming and descriptions across mouse reference and strain assemblies.

Example

For a gene annotated in mus_musculus with:

  • Name: Pten (from MGI)
  • Description: phosphatase and tensin homolog

It will now be projected automatically into all configured strains (129s1svimj, balbcj, c57bl6nj, etc.).

@vinay-ebi vinay-ebi changed the base branch from release/115 to release/116 September 23, 2025 07:39
@vinay-ebi vinay-ebi requested a review from nwillhoft September 23, 2025 07:39
@nwillhoft
Copy link
Contributor

Thanks Vinay. Please could I check with the addition of danio rerio to the antispecies - are we wanting to prevent projections onto zebrafish?

@vinay-ebi
Copy link
Contributor Author

Thanks Vinay. Please could I check with the addition of danio rerio to the antispecies - are we wanting to prevent projections onto zebrafish?

Yes, zebrafish has its own zfin ID. We don't want to override with HGNC ID, We fixed this in release 115 with these changes, but we have not pushed these changes.

Release 114:
+--------------------+-------------+-----------------------+--------------------+
| ensembl_gene_id | gene_symbol | gene_symbol_source_db | gene_symbol_source |
+--------------------+-------------+-----------------------+--------------------+
| ENSDARG00000000563 | TTN | HGNC | HGNC:12403 |
+--------------------+-------------+-----------------------+--------------------+
Release 115:
+--------------------+-------------+-----------------------+---------------------+
| ensembl_gene_id | gene_symbol | gene_symbol_source_db | gene_symbol_source |
+--------------------+-------------+-----------------------+---------------------+
| ENSDARG00000000563 | ttn.1 | ZFIN_ID | ZDB-GENE-030616-413 |
+--------------------+-------------+-----------------------+---------------------+

Copy link
Contributor

@nwillhoft nwillhoft left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All looks good for projecting onto mouse strains, and removing projection onto zebrafish.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants