VNL-STES: A Benchmark Dataset and Model for Spatiotemporal Event Spotting in Volleyball Analytics
- Authors
- Nguyen, Hoang Quoc; Jamsrandorj, Ankhzaya; Chao, Vanyi; Oo, Yin May; Robbani, Muhammad Amrulloh; Mun, Kyung Ryoul; Kim, Jinwook
- Issue Date
- 2025-06-12
- Publisher
- IEEE Computer Society
- Citation
- 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.5852 - 5861
- Abstract
- Volleyball video analytics require precisely detecting both the timing and location of key events. We introduce a novel task: Precise Spatiotemporal Event Spotting, which seeks to accurately determine when and where important events occur within a video. To this end, we created the Volleyball Nations League (VNL) Dataset, including 8 full games, 1,028 rally videos, and 6,137 annotated events with both temporal and spatial localization. Our best model, the Spatiotemporal Event Spotter (STES), outperforms the current state-of-the-art (SOTA) in temporal action spotting by 9.86 mean Temporal Average Precision (mTAP) and achieves a notable 80.21 mAP for spatial localization, accurately pinpointing event locations within a 2–6 pixel range. To the best of our knowledge, this is the first work addressing Precise Spatiotemporal Event Spotting in volleyball, establishing a strong baseline for future research in this domain. The code and data for this paper are available publicly at: https://hoangqnguyen.github.io/stes/
- URI
- https://pubs.kist.re.kr/handle/201004/153881
- DOI
- 10.1109/CVPRW67362.2025.00584
- Appears in Collections:
- KIST Conference Paper > 2025
- Export
- RIS (EndNote)
- XLS (Excel)
- XML
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.