CPU vs. GPU | Notion

Short Answer: Yes, but with significant optimizations and trade-offs since mobile devices lack dedicated GPUs.

GPU Dependency:
- DeepSORT relies on Re-ID models (CNNs) for appearance features, which are optimized for GPU parallelism.
- Mobile CPUs struggle with real-time inference due to limited compute power.
Memory Constraints:
- nn_budget stores appearance features in GPU VRAM. Mobiles share CPU/GPU memory, risking bottlenecks.
Thermal/Power Limits:
- Sustained high compute (e.g., YOLO + DeepSORT) can throttle performance or drain batteries quickly.

Replace Re-ID Model: Use lightweight models like:
- MobileNetV3 (TensorFlow Lite)
- OSNet (for Re-ID, paper)
- Quantized Models (INT8 precision to reduce compute).
Framework: Deploy via:
- TensorFlow Lite
- PyTorch Mobile
- ONNX Runtime (with hardware acceleration).

Lower nn_budget (e.g., 20 instead of 100) to limit feature storage.
Decrease input resolution (e.g., 320x320 instead of 640x640).
Use CPU-friendly trackers (e.g., SORT without Re-ID) if ID switches are acceptable.

Mobile GPUs (e.g., Qualcomm Adreno, Apple Neural Engine) via:
- CoreML (iOS)
- Android NNAPI
Edge TPUs (Google Coral) for quantized models.