EPIC-SOUNDS is a large scale dataset of audio annotations capturing temporal extents and class labels within the audio stream of the egocentric videos from EPIC-KITCHENS-100. EPIC-SOUNDS includes 78.4k categorised and 39.2k non-categorised segments of audible events and actions, distributed across 44 classes.
Source: Epic-Sounds: A Large-scale Dataset of Actions That SoundPaper | Code | Results | Date | Stars |
---|