Jinhwan and Yoonki won 1st place out of 23 teams in the Grounded Video Question Answering (GVQA) track at the ICCV 2025 Perception Test Challenge. The framework introduces a three-stage pipeline (Reasoning, Grounding, Tracking) and a novel “trigger moment” derived from a CORTEX prompt for robust anchoring. This method achieved a HOTA score of 0.4968, substantially surpassing the previous year’s winning score of 0.2704. Congratulations!


