Technical overview#
At a high level Annolid’s workflow consists of the following steps:
Labeling of frames (annotation) & COCO formatting
Training (fine-tuning) and inference (local worstation or Google Colab)
Post-processing and analysis
Accessibility and efficiency#
We work hard toward making Annolid as accessible as possible to anyone trying to use it and are striving to make all the code be also runnable on Google Colab.
Options for training on Google Colab as well as on a local workstation
Fast training with quality- and speed-optimized options
Model training :
200 labeled images
< 2 hours for 3000 iterations on Google Colab
30 min on NVidia 1080Ti
Inference (applying trained model to behavior videos)
Mask R-CNN: ~7 FPS
YOLACT: ~30 FPS
Transfer learning based on existing models trained on COCO dataset improves performance
Capacities for autolabeling and human-in-the-loop iterative model training