A realistic AI control framework that breaks down at superintelligence
"Rather than assuming that agents will always execute the best attack strategies known to humans, we demonstrate how knowledge of an agent's actual capability profile can inform proportional control evaluations, resulting in more practical and cost-effective control measures."