-
Notifications
You must be signed in to change notification settings - Fork 58
Description
Great work! Thank you for sharing!
This work is for generating action proposals.
You mentioned in your paper, for temporal action detection, on activityNet-1.3, you adopt top-1 video-level classification results generated by method of [Zhao, Y., et al., CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2017], and use confidence scores of our proposals for detection results retrieving. I did not get how you do it. Could you please explain my following questions:
-
In the paper of Zhao, Y., et al., CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2017, it mentioned different methods for different tasks. Which method is adopted by you? Is it SSN(Structure Segment Networks)?
-
BSN will generate at least 100 proposals. Will you chose all proposals for action classification?
-
Assume you only choose top-k (such as k = 2, 3, 4, 5) proposals output from BSN for action classification. For each selected proposal generated from BSN, you do video-level action classification. Does it mean for a proposal "started on frame m, and ended on frame n", you will generate only one action label?
-
I wonder if it is possible to release your detection demo. That will be awesome!