AI Safety&Security
Code Agent can be an End-to-end System Hacker: Benchmarking Real-world Threats of Computer-use Agent