Fresh daily

AI News

Latest AI tool releases, research breakthroughs, and industry news.

All Releases Research Funding Tutorials Opinion

Older

OpenAI o1 System Card

This report outlines the safety work carried out prior to releasing OpenAI o1 and o1-mini, including external red teaming and frontier risk evaluations according to our Preparedness Framework.

OpenAI Blog·Dec 5research

Morgan Stanley is shaping the future of financial services

Morgan Stanley uses AI evals to shape the future of financial services

OpenAI Blog·Dec 4research

Advancing red teaming with people and AI

OpenAI Blog·Nov 21research

Data-driven beauty and creativity with ChatGPT

Data-driven beauty: How The Estée Lauder Companies unlocks insights with ChatGPT

OpenAI Blog·Nov 12research

Introducing SimpleQA

A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.

OpenAI Blog·Oct 30research

Simplifying, stabilizing, and scaling continuous-time consistency models

We’ve simplified, stabilized, and scaled continuous-time consistency models, achieving comparable sample quality to leading diffusion models, while using only two sampling steps.

OpenAI Blog·Oct 23research

OpenAI and the Lenfest Institute AI Collaborative and Fellowship program

OpenAI Blog·Oct 21research

Evaluating fairness in ChatGPT

We've analyzed how ChatGPT responds to users based on their name, using AI research assistants to protect privacy.

OpenAI Blog·Oct 15research

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.

OpenAI Blog·Oct 10research

Creating agent and human collaboration with GPT 4o

Altera uses GPT-4o to build a new area of human collaboration

OpenAI Blog·Oct 1research

Using GPT-4 to improve teaching and learning in Brazil

Improving teaching and learning in Brazil

OpenAI Blog·Sep 16research

Learning to reason with LLMs

OpenAI Blog·Sep 12research

Decoding genetics with OpenAI o1

Geneticist Catherine Brownstein demonstrates how OpenAI o1 can speed up the process of diagnosing rare medical challenges.

OpenAI Blog·Sep 11research

Answering quantum physics questions with OpenAI o1

Quantum physicist Mario Krenn uses OpenAI o1 to help answer life's biggest questions.

OpenAI Blog·Sep 11research

Disrupting a covert Iranian influence operation

OpenAI Blog·Aug 16research

Introducing SWE-bench Verified

We’re releasing a human-validated subset of SWE-bench that more reliably evaluates AI models’ ability to solve real-world software issues.

OpenAI Blog·Aug 13research

Improving Model Safety Behavior with Rule-Based Rewards

We've developed and applied a new method leveraging Rule-Based Rewards (RBRs) that aligns models to behave safely without extensive human data collection.

OpenAI Blog·Jul 24research

Prover-Verifier Games improve legibility of language model outputs

Discover how prover-verifier games improve the legibility of language model outputs, making AI solutions clearer, easier to verify, and more trustworthy for both humans and machines.

OpenAI Blog·Jul 17research

OpenAI and Los Alamos National Laboratory announce research partnership

OpenAI and Los Alamos National Laboratory are working to develop safety evaluations to assess and measure biological capabilities and risks associated with frontier models.

OpenAI Blog·Jul 9research

Finding GPT-4’s mistakes with GPT-4

CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF

OpenAI Blog·Jun 27research

Search AI Workflow Pro