This page has the most up-to-date information for our challenges. For detailed information on a method, please click the method name. To sort by a specific metric, click on the header in the table. For further questions, please contact us at jrdb@cs.stanford.edu.

Additional Information Used

  • Individual Image: Method uses individual images from each camera
  • Stitched Image: Method uses stitched images combined from the individual cameras
  • Pointcloud: Method uses 3D pointcloud data
  • Online Tracking: Method does frame-by-frame processing with no lookahead
  • Offline Tracking: Method does not do in-order frame processing
  • Public Detections: Method uses publicly available detections
  • Private Detections: Method uses its own private detections

2D Detection Leaderboard

Name AP ↑ Runtime ↓ CPU/GPU
T_HJ
68.10 0.1 s 1 GPU (Titan X)
Anonymous Submission
MMPAT_CVPR21
67.88 0.07 s 1 GPU (Titan X)
Y. He, W. Yu, J. Han, X. Wei, X. Hong and Y. Gong. Know Your Surroundings: Panoramic Multi-Object Tracking by Multimodality Collaboration. In CVPRW, 2021.
Team_HJ
67.38 0.07 s 1 GPU (Titan X)
Anonymous Submission
TEAM_Hojun
65.99 0.04 s 4 GPU (Gtx Titan X)
Anonymous Submission
TEST_KKANG
59.72 0.038 s 1 GPU (Titan X)
Anonymous Submission
Faster R-CNN
52.17 0.038 s 1 GPU (Titan X)
S. Ren, K. He, R. Girshick and J. Sun. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. In NeurIPS, 2015.
RetinaNet
50.38 0.056 s 1 GPU (Titan X)
T. Lin, P. Goyal, R. Girshick, K. He and P. Dollár. Focal Loss for Dense Object Detection. In ICCV, 2017.
DETR
48.66 0.35 s 1 GPU (GTX 1060)
N. Carion, F. Massa, G. Synnaeve, N. Usunier, A. Kirillov, S. Zagoruyko. End-to-End Object Detection with Transformers. In ECCV, 2020.
YOLOv3
41.73 0.051 s 1 GPU (Titan X)
J. Redmon and A. Farhadi. YOLOv3: An Incremental Improvement. In arXiv, 2018.
jihoo_S2
0.37 0.038 s 1 GPU (Titan X)
Anonymous Submission

3D Detection Leaderboard

Name AP ↑ Runtime ↓ CPU/GPU
Person-MinkUNet
76.42 0.059 s 1 TITAN RTX
D. Jia and B. Leibe. Person-MinkUNet: 3D Person Detection with LiDAR Point Cloud. In CVPRW, 2021.
Team_MJM
69.20 0.04 s 1 GPU (GTX 1080Ti)
Anonymous Submission
TANet++
63.92 0.28 s 1 Titan Tesla K40c
Cong Ma. "TANet++: Triple Attention Network with Filtered Pointcloud on 3D Detection. arXiv preprint arXiv:2106.15366 (2021).
Team_minjunmin
57.26 0.04 s 1 GPU (GTX 1080Ti)
Anonymous Submission
TANet
54.94 0.28 s 1 GPU (Titan Tesla K40c)
Zhe Liu, Xin Zhao, Tengteng Huang, Ruolan Hu, Yu Zhou and Xiang Bai. TANet: Robust 3D Object Detection from Point Clouds with Triple Attention. In AAAI, 2020.
TANet_on_JRDB
42.78 0.019 s 1 GPU(TITAN V)
Anonymous Submission
F-PointNet
38.21 0.17 s 1 GPU (Titan X)
C. Qi, W. Liu, C. Wu, H. Su and L. Guibas. Frustum PointNets for 3D Object Detection from RGB-D Data. In CVPR, 2018.
abc
0 0.026 s 1 GPU (GTX 1080Ti)
Anonymous Submission