Skip to content

chopardda/text_to_HPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Text to HPO Pipeline

Welcome to the Text to HPO Pipeline project! This tool is designed to map free text descriptions of phenotypic abnormalities to Human Phenotype Ontology (HPO) terms, providing a standardized way to capture and analyze phenotypic data.

⚠️ Warning: This project is currently under development. The repository will be fully functional and publicly available soon. Please check back later for updates.

Overview

The text to HPO Pipeline is a Python-based tool that enables the conversion of unstructured clinical descriptions into structured, standardized HPO terms.

Installation

Install metamap from https://github.com/AnthonyMRios/pymetamap.

Install pymetamap from https://github.com/AnthonyMRios/pymetamap.

Config

To run main.py, the text_hpo_mapping/config/config.yaml file needs to be adapted:

clinical_data_path contains the path to the input data. A fake example is provided for illustration.

base_dir must contain the base_dir of the metamap install (Note the other metamap settings might need to be adapted as well)

api_key must be set to the user's private UMLS api key. Instruction on where to find the API key can be found here: https://documentation.uts.nlm.nih.gov/rest/authentication.html

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages