Evidence Supporting
Framework adherence showed varied patterns in General interrogations, with stability in 2 of 33 cases. In 21 cases, adherence patterns shifted at pressure levels 2-3, typically involving transitions under counterfactual questioning specific to General scenarios. Mixed patterns (fluctuation without clear direction) appeared in 6 cases.
Based on 33 supporting scenarios
Methodology Note: Each scenario below contributed to the Stability Under Pressure assessment for GPT-4o in General scenarios. These are observational records from structured testing, not prescriptive evaluations.