Spike Sorting Pipeline

Internal Akrami Lab pipeline for spike sorting Neuropixels and other high-density recordings using SpikeInterface and Kilosort 4.

Quick Start

0. Clone Repository

# On HPC, clone to your home directory
git clone [email protected]:LIMLabSWC/spikesorting-pipeline.git
cd spikesorting-pipeline

1. Setup Environment

You can set up the environment with any of these tools:

# On HPC
micromamba create -n si_ks4_env -f si_ks4_env.yml
micromamba activate si_ks4_env

2. Run Interactive Pipeline

# Get a compute node
srun --partition=gpu --gres=gpu:1 --cpus-per-task=8 --mem=50G --pty bash -i

# Run the script
python spikeinterface_script_interactive.py

3. View Results

python view_sorting_results.py /path/to/sorting/results

4. Manual Curation (Optional)

# Install Phy
micromamba create -n phy2 -y python=3.11 cython dask h5py joblib matplotlib numpy pillow pip pyopengl pyqt pyqtwebengine pytest python qtconsole requests responses scikit-learn scipy traitlets
micromamba activate phy2
pip install git+https://github.com/cortex-lab/phy.git

# Launch GUI
cd /path/to/sorting/results
phy template-gui params.py

What's Included

Interactive script (spikeinterface_script_interactive.py) for step-by-step testing
Batch processing (spikeinterface_batch.py) for large datasets
Results viewer (view_sorting_results.py) for analysis and plots
Environment file (si_ks4_env.yml) with all dependencies
SLURM script (run_spikeinterface.sh) for HPC submission

Output Structure

output_path/
├── sorting/           # Kilosort 4 output
├── postprocessing/    # Waveforms and quality metrics
│   └── quality_metrics.csv
└── plots/            # Visualization plots
    ├── unit_waveforms.png
    ├── raster_plot.png
    └── ...

Detailed Documentation

Environment Setup

The pipeline requires a Python environment with SpikeInterface, Kilosort 4, and related dependencies.

Installation:

micromamba create -n si_ks4_env -f si_ks4_env.yml
micromamba activate si_ks4_env

Key dependencies:

SpikeInterface (latest)
Kilosort 4 (via SpikeInterface)
CUDA toolkit (for GPU acceleration)
Other dependencies for Open Ephys and SpikeGLX reading

Interactive Development on HPC

The interactive pipeline is designed for step-by-step testing and debugging of spike sorting workflows. Recommended approach: Run on HPC for GPU acceleration and better performance.

Setup for HPC Development

Install the Remote-SSH extension in VS Code.

Configure SSH access by adding your HPC login to ~/.ssh/config:

Host bastion
    Hostname ssh.swc.ucl.ac.uk
    User vplattner

Host hpc2
    Hostname hpc-gw2
    User vplattner
    ProxyJump bastion

Install the environment on HPC:

micromamba create -n si_ks4_env -f si_ks4_env.yml

Clone the repository:

git clone [email protected]:LIMLabSWC/spikesorting-pipeline.git
cd spikesorting-pipeline

Request a compute node for GPU-accelerated processing:

# Typical command for testing:
srun --partition=gpu --gres=gpu:1 --cpus-per-task=8 --mem=50G --pty bash -i

# For production runs, adjust resources as needed:
# srun --partition=gpu --gres=gpu:1 --cpus-per-task=16 --mem=100G --pty bash -i

Activate the environment in your compute node session:
```
micromamba activate si_ks4_env
```

Running the Interactive Script

Connect via VS Code:
- Press F1 → "Remote-SSH: Connect to Host..."
- Choose hpc2 (or your configured host)
- Enter your password
- Wait for the connection to establish
Open the repository:
- Once connected, open this repository folder from the HPC filesystem
- Navigate to /nfs/nhome/live/vplattner/spikesorting-pipeline
Configure the Python interpreter:
- Press Ctrl+Shift+P → "Python: Select Interpreter"
- Choose the interpreter from your activated environment: /nfs/nhome/live/vplattner/micromamba/envs/si_ks4_env/bin/python
Run the interactive script:
- Open spikeinterface_script_interactive.py
- Run cells individually using Shift+Enter or the "Run Cell" button
- The script will execute on the compute node with GPU acceleration

Note: Make sure your compute node session remains active while working in VS Code. If the session expires, you'll need to request a new compute node and reconnect.

Local Development (Alternative)

For local development (requires GPU and CUDA setup):

# Install environment locally
micromamba create -n si_ks4_env -f si_ks4_env.yml
micromamba activate si_ks4_env

# Run the script
python spikeinterface_script_interactive.py

Note: The interactive script is currently configured with hardcoded paths for a specific dataset. You'll need to modify the path variables in the script for your own data.

Manual Curation with Phy

Goal: Interactive visualization and manual spike sorting curation using Phy.

Phy is an open-source Python library providing a graphical user interface for visualization and manual curation of large-scale electrophysiological data. It's optimized for high-density multielectrode arrays containing hundreds to thousands of recording sites (mostly Neuropixels probes).

Installation

On HPC (recommended):

# Create new conda environment for Phy
micromamba create -n phy2 -y python=3.11 cython dask h5py joblib matplotlib numpy pillow pip pyopengl pyqt pyqtwebengine pytest python qtconsole requests responses scikit-learn scipy traitlets

# Activate environment
micromamba activate phy2

# Install Phy development version
pip install git+https://github.com/cortex-lab/phy.git

# Optional: Install klusta/klustakwik2 for Kwik GUI
pip install klusta klustakwik2

Alternative installation using environment file:

# If the above method has issues, try the automatic install
micromamba env create -f environment.yml
micromamba activate phy2
pip install git+https://github.com/cortex-lab/phy.git

Usage

Launch Phy Template GUI (recommended for Kilosort outputs):

# Navigate to your sorting output directory
cd /path/to/sorting/results

# Launch the template GUI
phy template-gui params.py

Launch from Python script:

Create a launch.py file in your data directory:

from phy.apps.template import template_gui
template_gui("params.py")

Phy Features

Template GUI: Optimized for datasets sorted with Kilosort and Spyking Circus
Kwik GUI: Legacy interface for datasets sorted with klusta and klustakwik2
Interactive visualization: Large-scale electrophysiological data
Manual curation: Refine spike sorting results
High-density support: Hundreds to thousands of recording sites

Hardware Requirements

Storage: SSD recommended for performance
Graphics: Recent graphics and OpenGL drivers
No specific GPU requirements for the GUI itself

Troubleshooting

Common issues:

PyQt5.QtWebEngineWidget error: Run pip install PyQtWebEngine
Mac M-series chips: Not officially supported, may require workarounds
Upgrading from phy 1: Don't install phy 1 and phy 2 in the same environment

For more help:

Batch Processing on HPC (SLURM)

Goal: Submit large dataset jobs to the GPU queue.

Uses pre-installed HPC modules (see run_spikeinterface.sh).
No local Conda env needed.

Example:

sbatch run_spikeinterface.sh \
    /path/to/rawdata/.../recording1 \
    /path/to/output_dir \
    --show_preprocessing

Arguments:

Input recording folder
Output folder
(Optional) flags, e.g. --show_preprocessing

Results Viewing and Analysis

Goal: Inspect sorting results with summary statistics and plots.

python view_sorting_results.py /path/to/sorting/results

Features:

Summary statistics (units, spikes, firing rates)
Quality metrics (SNR, ISI violations, presence ratio)
Plots:
- Waveforms
- Raster plots
- Metrics distributions
- Unit locations
- Autocorrelograms

Output structure:

output_path/
├── sorting/
├── postprocessing/
│   └── quality_metrics.csv
└── plots/
    ├── unit_waveforms.png
    ├── raster_plot.png
    ├── quality_metrics_distributions.png
    ├── unit_locations.png
    └── autocorrelograms.png

Data Format Support

Open Ephys:

TTLs in separate events file
Geometry from settings.xml

SpikeGLX:

TTLs as extra channel
Geometry manual or from metadata

Pipeline Features

Preprocessing:
- Phase shift correction
- Bandpass filtering (300-6000 Hz)
- Common reference (global median)
Spike sorting:
- Kilosort 4 algorithm
- GPU-accelerated processing
- Automatic channel grouping
Post-processing:
- Waveform extraction
- Quality metrics computation
- Automatic curation (empty units, excess spikes)
Quality metrics:
- Signal-to-noise ratio (SNR)
- ISI violations ratio
- Presence ratio
- Firing rate
- Number of spikes

Kilosort GUI Not Required

This pipeline uses Kilosort 4 through SpikeInterface's wrapper, which runs the sorter natively without requiring the MATLAB GUI. The singularity_image parameter controls execution:

singularity_image=False: Runs natively (requires local Kilosort installation)
singularity_image=True: Runs in Singularity container (requires Singularity)

Known Issues and Limitations

Short recordings: Quality metrics may be unreliable for recordings < 2-5 minutes
Memory usage: Large datasets may require significant RAM
GPU memory: Kilosort 4 requires sufficient GPU memory for the dataset size
Path dependencies: Interactive script has hardcoded paths that need modification

Troubleshooting

Common issues:

Import errors: Ensure the correct Python environment is activated
GPU errors: Check CUDA installation and GPU memory availability
Path errors: Verify data paths in the interactive script
Memory errors: Reduce dataset size or increase allocated memory

Getting help:

Check the SpikeInterface documentation
Review the Kilosort 4 documentation
Contact the Akrami Lab for pipeline-specific issues

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.vscode		.vscode
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
run_spikeinterface.sh		run_spikeinterface.sh
si_ks4_env.yml		si_ks4_env.yml
spikeinterface_batch.py		spikeinterface_batch.py
spikeinterface_script_interactive.py		spikeinterface_script_interactive.py
view_sorting_results.py		view_sorting_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Spike Sorting Pipeline

Quick Start

0. Clone Repository

1. Setup Environment

2. Run Interactive Pipeline

3. View Results

4. Manual Curation (Optional)

What's Included

Output Structure

Detailed Documentation

Environment Setup

Interactive Development on HPC

Setup for HPC Development

Running the Interactive Script

Local Development (Alternative)

Manual Curation with Phy

Installation

Usage

Phy Features

Hardware Requirements

Troubleshooting

Batch Processing on HPC (SLURM)

Results Viewing and Analysis

Data Format Support

Pipeline Features

Kilosort GUI Not Required

Known Issues and Limitations

Troubleshooting

About

Uh oh!

Releases

Packages

Languages

LIMLabSWC/spikesorting-pipeline

Folders and files

Latest commit

History

Repository files navigation

Spike Sorting Pipeline

Quick Start

0. Clone Repository

1. Setup Environment

2. Run Interactive Pipeline

3. View Results

4. Manual Curation (Optional)

What's Included

Output Structure

Detailed Documentation

Environment Setup

Interactive Development on HPC

Setup for HPC Development

Running the Interactive Script

Local Development (Alternative)

Manual Curation with Phy

Installation

Usage

Phy Features

Hardware Requirements

Troubleshooting

Batch Processing on HPC (SLURM)

Results Viewing and Analysis

Data Format Support

Pipeline Features

Kilosort GUI Not Required

Known Issues and Limitations

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages