Natural Language Processing

trustable and focussed LLM generated content

2 papers with code • 0 benchmarks • 0 datasets

ensure that LLM step-by-step generation stays truthful and focussed to the user's goal

Benchmarks

Add a Result

These leaderboards are used to track progress in trustable and focussed LLM generated content

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Subtasks

Game Design

Most implemented papers

Most implemented Social Latest No code

Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles

ptarau/recursors • 24 Jun 2023

We automate deep step-by step reasoning in an LLM dialog thread by recursively exploring alternatives (OR-nodes) and expanding details (AND-nodes) up to a given depth.

Paper
Code

Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts

joycenerd/p4d • • 12 Sep 2023

In this work, we propose Prompting4Debugging (P4D) as a debugging and red-teaming tool that automatically finds problematic prompts for diffusion models to test the reliability of a deployed safety mechanism.