A curated list of open source OMOP (Observational Medical Outcomes Partnership) tools and resources.
The OMOP Common Data Model (CDM) is a standardized data model for observational healthcare data. It enables large-scale analytics across different healthcare systems and data sources. This list focuses on open source tools that work with OMOP CDM. It's not everything - the OHDSI organisation has over 350 repositories, so this is the interesting ones + the wider community.
- Common Data Model - The CDM itself
Tools for transforming data into OMOP CDM:
- Rabbit-in-a-Blender - Full ETL tool
- Carrot - Toolset for ETL
- Data2Evidence - End to end ETL tool
- OMOP on FHIR - FHIR built on top of OMOP
- WhiteRabbit - Data profiling tool for OMOP CDM mapping
- Usagi - Vocabulary mapping tool
- Carrot Mapper - Community mapping tool
- ATLAS - Web-based application for cohort definition and analysis
- Broadsea - Deploys the core OHDSI stack
- OMOP2OBO - Mapping to Open Biomedical Ontologies
- Bouzyges - Semantic mapping with LLM
- kotobuki - Update outdated mappings in Usagi
- OMOCL - Declarative mapping language for openEHR to OMOP
- Genomics England Mappings - NHS datasets mapping
- DICOM2OMOP - DICOM mapping to OMOP
- ETL-Synthea - ETL for Synthea synthetic data
- Carrot Transform - Data transform to CDM
- convert-pheno - Conversion of datamodels for phenotypic data
- oxford-omop-data-mapper - Transformation of NHS datasets
- i2o-transform - PCORnet to OMOP
- fhir-omop-ig - FHIR implementation
- DataQualityDashboard - Data quality assessment reports
- ACHILLES - Data quality assessment and characterization
- Data-Quality-Analysis - Data assessment tool
- CdmInspection - R Quality control
- BlendedICU - ETL for ICU data
- Janssen CDM - ETL inc. CPRD
- MIMIC-OMOP MIMI-III ETL
- Eos - openEHR ETL
- IMI_SOPHIA_DMS_OMOP - Maastricht ETL
- ETL-Synthea - Synthea to OMOP ETL
- FinnGen/ETL - FinnGen ETL
Tools for analyzing and visualizing OMOP data:
- OmopSketch - Feasibility tool
- HADES - Large scale analytics packages
- Bunny - Cohort discovery tool
- ATLAS Data Sources - REST API for OMOP data access
- CohortDiagnostics - Cohort characterization and diagnostics
- CohortGenerator - Cohort definition and generation
- Characterization - Patient and cohort characterization
- Cylcops - Large scale regularized regressions
- CohortMethod - Population-level effect estimation
- Cohort360 - Cohort builder / visualiser
- eds-scikit - Python lib for analysing OMOP
- LinkR - Web app for vis / analysing OMOP
- recruit - Clinical trial recruitment system with FHIR + OMOP
- Strategus - Coordinating / executing analytics
- CohortConstructor - Create study cohorts
AI/ML tools built for OMOP data:
- omop-learn - Python ML with OMOP
- Lettuce - LLM for matching drug names to OMOP concepts
- PatientLevelPrediction - Patient-level prediction models
- DeepPatientLevelPrediction - Deep learning for patient-level prediction
- CohortMethod - Population-level effect estimation
- omcp - MCP server for OMOP
- onto-llm-mapping - Ontology mapping with LLM + RAG
Developer tools and infrastructure components:
- omop-lite - OMOP CDM database creator
- CDMConnector - Tidyverse connector
- DatabaseConnector - Database connection management
- dsOMOP - OMOP support for DataSHIELD
- SqlRender - SQL translation and rendering
- FeatureExtraction - Feature extraction from OMOP data
- omopcept - R package for getting OMOP concepts
- omock - Generate mock OMOP CDM data
- OHDSIonAzure - Deploy OHDSI on Azure
- sqlalchemy_omopcdm - SQLAlchemy models for OMOP CDM
- omop-cdm - Databricks 5.3 CDM
- dzd-omop-cdm-python-models - Python data class lib for OMOP CDM
- OMOPCommonDataModel.jl - Julia implementation of the OMOP CDM
- pyomop - Useful OMOP "swiss army knife" in Python
- Vocabulary-v5.0 - Build/releases of vocabularies
- Athena - Web application for distributing vocabularies
- CVB - Custom vocabulary builder
- Eunomia - Package to access sample datasets
- EonomiaDatasets - Sample CDM
- UK-OMOP - Directory of UK OMOP data sets
- Barts Health - 10k synthetic CDM
- Synthea - 1k, 100k, 2.8m
- MIMIC-IV 100 patients
See contributing.md
This list is open source and available under the MIT License.