One-Shot Handwritten Character Classifier with Bayesian Program Learning

About the Project

What is it?

This project presents an implementation of One-Shot Learning with Bayesian Program Learning (BPL), a machine learning approach that enables models to recognize patterns or concepts from just one example. In traditional machine learning, models typically require extensive training data to learn and generalize. However, BPL allows models to learn from a single example by constructing probabilistic programs that represent the relationships between variables and uncertain states.

In this project, we apply One-Shot Learning with BPL to handwritten character classification. By leveraging the Bayesian network representation of these relationships, our system learns to recognize new characters based on just one example. This approach enables novel applications in handwriting recognition and machine learning-based writing tools.

How It Works

Data Preparation: Handwritten character images are organized into labeled runs.
Program Learning: For each example, a probabilistic program representing the character's structure is learned.
Classification: New characters are classified by comparing them to previously learned programs using the Modified Hausdorff Distance (MHD).

Features

One-shot learning: Classify new characters using only a single example.
Probabilistic Program Induction: Uses BPL to learn and classify handwritten characters.
Flexible data preparation with a customizable folder structure. The system requires preparing datasets using a specific folder structure for easy processing

Getting Started

Prerequisites

Python 3.8 or higher
requirements.txt (install via pip)

pip install -r requirements.txt

Folder Structure

The project expects a specific directory structure for input data:

📂 Project Root/
├── 📁 all_runs/            # Directory containing *all* your data runs
│   ├── run01/              # Individual data run 1
│   ├── run02/              # Individual data run 2
│   └── ...                 # Each run has its own label file (`class_labels.txt`)
│
├── 📄 README.md           # This file
└── 📘 main.py              # Main implementation script

Data Organization

Store your handwritten character images in the all_runs/ directory, organized into subfolders (run01, run02, etc.). Note: Add more character images to create more runs.
Ensure each run subfolder (e.g., run01/) contains a file named class_labels.txt.
The class_labels.txt format should be a list of pairs, each line representing one training/test example: image_test_path image_train_path.

Here is a concrete example of what the class_labels.txt file should look like for one run:

run01/images/image_test_01.png run01/images/image_train_01.png
run01/images/image_test_02.png run01/images/image_train_02.png
run01/images/image_test_03.png run01/images/image_train_03.png
...

Running Script

To run the classifier, use:

python main.py [number_of_runs]

The script reads data from all_runs/ and performs classification.
It displays error rates for each run, concluding with the average across all runs.

Expectations When Running This Script

Input: Organized data in all_runs/ with class_labels.txt files.
Output:
- Error rates for each run
- Final average error rate across all runs

Example Output:

Running one-shot handwritten character classifier

[INFO] Run 01: Error rate X.X%
[INFO] Run 02: Error rate X.X%
...
[INFO] Run 20: Error rate Y.Y%

[RESULT] Average error rate across *n* independent experiments (where n is the number of runs): Z.Z%

Acknowledgments

This project builds on the Bayesian Program Learning (BPL) framework from academic research.

Research Paper

Lake, B. M., Salakhutdinov, R., and Tenenbaum, J. B. (2015).
Human-level concept learning through probabilistic program induction.
Science, 350(6266), 1332-1338.

Original GitHub Code

This is an adaptation based on the MATLAB code from Brenden Lake's BPL repository.

License

This project is licensed under the MIT License.

For details, please refer to the LICENSE file included in this project's root directory.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
assets		assets
data/all_runs		data/all_runs
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

One-Shot Handwritten Character Classifier with Bayesian Program Learning

Table of Contents

About the Project

What is it?

How It Works

Features

Getting Started

Prerequisites

Folder Structure

Data Organization

Running Script

Expectations When Running This Script

Acknowledgments

Research Paper

Original GitHub Code

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

pabs-lop/BPL-One-Shot-Handwritten-Classifier

Folders and files

Latest commit

History

Repository files navigation

One-Shot Handwritten Character Classifier with Bayesian Program Learning

Table of Contents

About the Project

What is it?

How It Works

Features

Getting Started

Prerequisites

Folder Structure

Data Organization

Running Script

Expectations When Running This Script

Acknowledgments

Research Paper

Original GitHub Code

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages