Epistory
Terug naar overzicht
Improving instruction hierarchy in frontier LLMs
OpenAI Blog··ongeveer 1 maand geleden

Improving instruction hierarchy in frontier LLMs

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.
Lees origineel artikel

Gerelateerde artikelen