Technical overview#
At a high level Annolid’s workflow consists of the following steps:
- Labeling of frames (annotation) & COCO formatting 
- Training (fine-tuning) and inference (local worstation or Google Colab) 
- Post-processing and analysis 

Accessibility and efficiency#
We work hard toward making Annolid as accessible as possible to anyone trying to use it and are striving to make all the code be also runnable on Google Colab.
- Options for training on Google Colab as well as on a local workstation 
- Fast training with quality- and speed-optimized options - Model training : - 200 labeled images 
- < 2 hours for 3000 iterations on Google Colab 
- 30 min on NVidia 1080Ti 
 
- Inference (applying trained model to behavior videos) - Mask R-CNN: ~7 FPS 
- YOLACT: ~30 FPS 
 
 
- Transfer learning based on existing models trained on COCO dataset improves performance 
- Capacities for autolabeling and human-in-the-loop iterative model training 
