BoxMOT

Overview

BoxMOT is a Python package providing a modular architecture for multi-object tracking (MOT). It supports integration with various segmentation, object detection, and pose estimation models, enabling users to easily swap different SOTA tracking algorithms. The key value proposition lies in its pluggable architecture, universal model support, and benchmark-ready local evaluation pipelines for datasets like MOT17, MOT20, and DanceTrack. Performance modes include motion-only for lightweight CPU-efficient tracking and motion + appearance, combining motion cues with appearance embeddings (CLIPReID, LightMBN, OSNet) to maximize identity consistency and accuracy. It supports reusable detections and embeddings, which can be saved and reused for evaluations, eliminating redundant preprocessing. BoxMOT utilizes a command-line interface (CLI) for simplified syntax, allowing users to track objects, evaluate performance, tune hyperparameters, generate tracking data and export models.

Common tasks

Multi-Object Tracking Object Detection Pose Estimation Segmentation

FAQ

View all

What object detection models are compatible with BoxMOT?

BoxMOT supports any object detection model that outputs bounding boxes, including YOLOv8, YOLOX, and FasterRCNN.

What ReID models can be used with BoxMOT?

BoxMOT supports various ReID models, including OSNet, CLIPReID, and LightMBN.

How do I choose the right tracker for my application?

The choice of tracker depends on your specific needs. DeepOCSORT, BoTSORT, ByteTrack, StrongSORT, OCSort, HybridSORT, BoostTrack and SF-SORT provide different trade-offs between speed, accuracy, and computational resources.

Can I use BoxMOT for real-time tracking?

Yes, BoxMOT supports real-time tracking using the motion-only performance mode for CPU-efficient high-FPS performance.

FAQ+