SmartEM Decisions

A comprehensive system for smart data collection and processing in cryo-electron microscopy, designed to optimize acquisition workflows through intelligent decision-making and real-time data analysis.

Quick Links

Source	https://github.com/DiamondLightSource/smartem-decisions
Docker	`docker run ghcr.io/DiamondLightSource/smartem-backend:latest`
Documentation	https://DiamondLightSource.github.io/smartem-decisions
Releases	https://github.com/DiamondLightSource/smartem-decisions/releases
Project Board	https://github.com/orgs/DiamondLightSource/projects/33/views/1
Test Datasets	https://gitlab.diamond.ac.uk/scisoft/cryoem/smartem-decisions-test-datasets

System Components

smartem_common: Shared schemas, types, and utilities used across all components
smartem_backend: Core backend service with HTTP API, database operations, and message queue processing
smartem_agent: Data collection agent that monitors EPU output and communicates with backend
athena_api: Athena HTTP API server integration

Quick Start

from smartem_backend._version import __version__

print(f"Hello smartem_backend {__version__}")

Architecture Overview

flowchart TD
    %% Equipment and Data Collection Layer
    subgraph facility["Diamond Light Source Facility"]
        microscope["Cryo-EM Microscope"]
        athena_hw["Athena Hardware"]
        epu["EPU Software<br/>(ThermoFisher)"]
        gpfs[("GPFS Storage")]
    end

    %% Agent Layer
    subgraph agent["smartem_agent (Windows)"]
        fs_watcher["File System Watcher"]
        fs_parser["EPU Data Parser"]
        sse_client["SSE Client"]
    end

    %% Backend Services Layer
    subgraph backend["smartem_backend (Kubernetes)"]
        api_server["FastAPI Server<br/>SSE + HTTP API"]
        consumer["RabbitMQ Consumer"]
        conn_mgr["Connection Manager"]
    end

    %% Infrastructure Layer
    subgraph infra["Infrastructure (Kubernetes)"]
        db[("PostgreSQL<br/>Sessions & Instructions")]
        mq[("RabbitMQ<br/>Event Streaming")]
        log["Logging<br/>(Graylog)"]
    end

    %% Additional Packages
    common["smartem_common<br/>(Shared Schemas)"]
    athena_api["athena_api<br/>(API Client)"]

    %% Data Flow Connections
    microscope --> epu
    athena_hw --> microscope
    epu --> gpfs
    gpfs --> fs_watcher
    fs_watcher --> fs_parser
    fs_parser --> sse_client

    %% External Processing → RabbitMQ → Backend
    subgraph external["External Processing Systems"]
        ml_pipeline["ML Pipeline<br/>(Quality Predictions)"]
        image_proc["Image Processing<br/>(Motion/CTF/Particles)"]
    end

    ml_pipeline -.->|"Model Predictions<br/>Parameter Updates"| mq
    image_proc -.->|"Processing Results<br/>Quality Metrics"| mq

    %% Backend Communication
    sse_client <-.->|"SSE Streams &<br/>HTTP ACKs"| api_server

    %% Backend Internal
    api_server --> db
    api_server --> mq
    api_server --> log
    consumer --> db
    consumer --> mq
    conn_mgr --> db

    %% External Integration
    api_server --> athena_api
    athena_api -.->|"Control Commands<br/>(reorder, skip)"| athena_hw

    %% Package Dependencies
    backend -.-> common
    agent -.-> common

    %% Styling
    classDef facility fill:#e8f5e8,stroke:#28a745,stroke-width:2px
    classDef agent fill:#fff5e6,stroke:#e67e22,stroke-width:2px
    classDef backend fill:#e6f3ff,stroke:#0066cc,stroke-width:2px
    classDef infra fill:#f5f5f5,stroke:#666,stroke-width:2px
    classDef packages fill:#f9f9f9,stroke:#999,stroke-width:1px
    classDef external fill:#f8f9fa,stroke:#6c757d,stroke-width:2px

    class facility facility
    class agent agent
    class backend backend
    class infra infra
    class common,athena_api packages
    class external,ml_pipeline,image_proc external

Development Setup

# venv and requirements
python -m venv .venv
source .venv/bin/activate
pip install -e .[dev] # or .[backend] for production

# Start services with verbosity controls:
python -m smartem_backend.api_server               # HTTP API server
python -m smartem_backend.consumer -v              # Message queue consumer with INFO logging
python -m smartem_agent watch /path/to/data -v     # File watcher with INFO logging

# For testing file watcher with simulated EPU data:
python tools/fsrecorder/fsrecorder.py replay recording.tar.gz /path/to/data --fast

# For testing external message processing:
python tools/external_message_simulator.py list-messages           # See available message types
python tools/external_message_simulator.py workflow-simulation --gridsquare-id "DEV_001"

# For testing agent-backend communication:
python tools/sse_client_example.py                                 # Test SSE instruction delivery

Administrative Utilities

After installation with pip install -e .[all], the following command-line utilities are available for system administration and maintenance:

Agent Communication Management

smartem.agent-cleanup - Data lifecycle and cleanup operations for agent communication data

# Check current database usage and statistics
smartem.agent-cleanup --operation=stats

# Run full cleanup with scientific retention policy (7-year audit trail)
smartem.agent-cleanup --operation=cleanup --policy=scientific

# Run cleanup with development retention policy (shorter retention)
smartem.agent-cleanup --operation=cleanup --policy=development

# Clean only stale connections
smartem.agent-cleanup --operation=connections

# Preview what would be cleaned without making changes
smartem.agent-cleanup --operation=cleanup --dry-run

ML Model and Prediction Management

smartem.register-prediction-model - Register new machine learning prediction models

smartem.register-prediction-model "ResNet-50" "Deep learning model for cryo-EM image quality assessment"

smartem.init-model-weight - Initialize model weights for specific grids
```
smartem.init-model-weight --help  # View available options
```
smartem.random-model-predictions - Generate test predictions for development and testing
```
smartem.random-model-predictions --help  # View available options
```
smartem.random-prior-updates - Simulate prior updates for testing workflows
```
smartem.random-prior-updates --help  # View available options
```

Development and Integration Tools

smartem_agent_tools - Agent development and debugging utilities
smartem-mcp - Model Context Protocol (MCP) integration tools

Data Retention Policies

The smartem.agent-cleanup utility supports two built-in retention policies:

Scientific Compliance (default): Conservative retention suitable for scientific research
- Connections: 48 hours
- Instructions: 365 days
- Completed sessions: 2 years
- Acknowledgements: 7 years (regulatory compliance)
Development: Shorter retention for development environments
- Connections: 4 hours
- Instructions: 7 days
- Completed sessions: 14 days
- Acknowledgements: 30 days

All utilities include comprehensive --help documentation for detailed usage instructions.

More Information

For detailed documentation including:

See https://DiamondLightSource.github.io/smartem-decisions for complete documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 397 Commits
.claude		.claude
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
docs		docs
k8s		k8s
src		src
tests		tests
tools		tools
.copier-answers.yml		.copier-answers.yml
.dockerignore		.dockerignore
.env.example		.env.example
.env.example.k8s.development		.env.example.k8s.development
.env.example.k8s.production		.env.example.k8s.production
.env.example.k8s.staging		.env.example.k8s.staging
.env.example.local-test-run		.env.example.local-test-run
.env.example.mcp		.env.example.mcp
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.secrets.baseline		.secrets.baseline
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
alembic.ini		alembic.ini
entrypoint.sh		entrypoint.sh
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SmartEM Decisions

Quick Links

System Components

Quick Start

Architecture Overview

Development Setup

Administrative Utilities

Agent Communication Management

ML Model and Prediction Management

Development and Integration Tools

Data Retention Policies

More Information

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

DiamondLightSource/smartem-decisions

Folders and files

Latest commit

History

Repository files navigation

SmartEM Decisions

Quick Links

System Components

Quick Start

Architecture Overview

Development Setup

Administrative Utilities

Agent Communication Management

ML Model and Prediction Management

Development and Integration Tools

Data Retention Policies

More Information

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages