OpenAI's new GPT-5.4 clobbers humans on pro-level work in tests - by 83% ...
"In ChatGPT, GPT‑5.4 Thinking can now provide an upfront plan of its thinking, so you can adjust course mid-response while it’s working, and arrive at a final output that’s more closely aligned with ...
A benchmark called OSWorld-Verified, designed to monitor AI's ability to navigate desktop environments, found that GPT 5.4 scored 75%, up from 47.3% with its GPT 5.2 model. That also beats the average ...