trustable and focussed LLM generated content
2 papers with code • 0 benchmarks • 0 datasets
ensure that LLM step-by-step generation stays truthful and focussed to the user's goal
Benchmarks
These leaderboards are used to track progress in trustable and focussed LLM generated content
No evaluation results yet. Help compare methods by
submitting
evaluation metrics.
Most implemented papers
Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles
We automate deep step-by step reasoning in an LLM dialog thread by recursively exploring alternatives (OR-nodes) and expanding details (AND-nodes) up to a given depth.
Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
In this work, we propose Prompting4Debugging (P4D) as a debugging and red-teaming tool that automatically finds problematic prompts for diffusion models to test the reliability of a deployed safety mechanism.