McQueen dataset contains 15k visual conversations and over 80k queries where each one is associated with a fully-specified rewrite version. In addition, for entities appearing in the rewrite, the corresponding image box annotation is provided.
Source: McQueen: a Benchmark for Multimodal Conversational Query RewritePaper | Code | Results | Date | Stars |
---|