Dense crowd tracking
5 dataset on human head with bounding box annotations
- Data_S-HEAD
- Head Detection (CCTV)
- Human head detection OpenVM + C270
- JHU-CROWD++
- SCUT-HEAD
- first attempt with jhu-crowd++ on yolov10
- second attempt with all 5 datasets merged on yolov10
- yolo models trained on jhu-crowd++ find the model here at kaggle.
- yolov10 from scratch and pretrained with jhu-crowd++ find the model here at kaggle.
1024×1024 frame → Split into 256×256 overlapping tiles.
Layer 1: Dilation rate=2 (captures head edges).
Layer 2: Dilation rate=4 (groups nearby heads).