Natural Language Processing

WNLI

4 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in WNLI

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Most implemented papers

Most implemented Social Latest No code

A Hybrid Neural Network Model for Commonsense Reasoning

namisan/mt-dnn • • WS 2019

An HNN consists of two component models, a masked language model and a semantic similarity model, which share a BERT-based contextual encoder but use different model-specific input and output layers.

Paper
Code

A Surprisingly Robust Trick for Winograd Schema Challenge

vid-koci/bert-commonsense • • 15 May 2019

The Winograd Schema Challenge (WSC) dataset WSC273 and its inference counterpart WNLI are popular benchmarks for natural language understanding and commonsense reasoning.

Paper
Code

WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

vid-koci/bert-commonsense • • IJCNLP 2019

We use a language-model-based approach for pronoun resolution in combination with our WikiCREM dataset.

Paper
Code

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

shahriargolchin/time-travel-in-llms • 16 Aug 2023

To estimate contamination of individual instances, we employ "guided instruction:" a prompt consisting of the dataset name, partition type, and the random-length initial segment of a reference instance, asking the LLM to complete it.

Paper
Code

WNLI

Benchmarks Add a Result

Most implemented papers

A Hybrid Neural Network Model for Commonsense Reasoning

A Surprisingly Robust Trick for Winograd Schema Challenge

WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

Content

Benchmarks

Add a Result