a large video dataset captured with UAVs in different complex real-world scenes, with multiple representations, suitable for multi-task learning.
0 PAPER • NO BENCHMARKS YET