OpenAI’s frontier model may not have astounded when it arrived earlier this year, but research indicates it’s now much better ...
OpenAI’s GPT 5.1 Codex Max runs 24-hour workflows, handles multifile refactors, reaches 80% accuracy, and uses 30% fewer ...
So, bottom line, if OpenAI can substantially reduce the cost of API calls and still deliver AI value, as it seems to have ...
OpenAI characterizes GPT-5.1-Codex-Max as the company’s first coding model explicitly trained to operate across multiple ...
The new model, which is based on the GPT-5.1 architecture, was trained using real-world software engineering tasks like creating pull requests, code reviews, website building, and answering technical ...
The SWE-Bench Verified evaluation is basically a test of AI processing accuracy. It measures how well the AI solves a set of coding problems. According to OpenAI, GPT-5.1-Codex-Max "reaches the same ...
In a new benchmark named Vibe Code Bench, OpenAI’s GPT-5.1 achieved the highest level of accuracy in completing a series of software engineering tasks, narrowly beating rival Anthropic’s Claude 4.5 ...
Acqui-hires feel like they’re here to stay: The team behind Alex Codes, a popular tool that lets developers use AI models within Apple’s development suite Xcode, is joining OpenAI. In a post on X, ...
Amazon has directed its software engineers to cease using third-party AI coding tools like OpenAI's Codex and Anthropic's ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results