Technical overview#

At a high level Annolid’s workflow consists of the following steps:

Labeling of frames (annotation) & COCO formatting
Training (fine-tuning) and inference (local worstation or Google Colab)
Post-processing and analysis

Overview of Annolid workflow

Accessibility and efficiency#

We work hard toward making Annolid as accessible as possible to anyone trying to use it and are striving to make all the code be also runnable on Google Colab.

Options for training on Google Colab as well as on a local workstation
Fast training with quality- and speed-optimized options
- Model training :
  - 200 labeled images
  - < 2 hours for 3000 iterations on Google Colab
  - 30 min on NVidia 1080Ti
- Inference (applying trained model to behavior videos)
  - Mask R-CNN: ~7 FPS
  - YOLACT: ~30 FPS
Transfer learning based on existing models trained on COCO dataset improves performance
Capacities for autolabeling and human-in-the-loop iterative model training

Technical overview

Contents

Technical overview#

Accessibility and efficiency#