Policy page 14

OpenAI News May 22, 2023 07:00

Governance of superintelligence

Now is a good time to start thinking about the governance of superintelligence—future AI systems dramatically more capable than even AGI.

Policy

OpenAI News April 05, 2023 07:00

Our approach to AI safety

Ensuring that AI systems are built, deployed, and used safely is critical to our mission.

Policy

OpenAI News January 11, 2023 08:00

Forecasting potential misuses of language models for disinformation campaigns and how to reduce risk

OpenAI researchers collaborated with Georgetown University’s Center for Security and Emerging Technology and the Stanford Internet Observatory to investigate how large language models might be misused for disinformation purposes. The collaboration included...

Models Policy

OpenAI Models Policy

OpenAI News October 19, 2022 07:00

Scaling laws for reward model overoptimization

In reinforcement learning from human feedback, it is common to optimize against a reward model trained to predict human preferences. Because the reward model is an imperfect proxy, optimizing its value too much can hinder ground truth performance, in...

Policy

OpenAI News June 28, 2022 07:00

DALL·E 2 pre-training mitigations

In order to share the magic of DALL·E 2 with a broad audience, we needed to reduce the risks associated with powerful image generation models. To this end, we put various guardrails in place to prevent generated images from violating our content policy.

Models Policy Infrastructure

OpenAI News March 03, 2022 08:00

A research agenda for assessing the economic impacts of code generation models

OpenAI is developing a research program to assess the economic impacts of code generation models and is inviting collaboration with external researchers. Rapid advances in the capabilities of large language models (LLMs) trained on code have made it...

Models Policy

OpenAI Models Policy

OpenAI News May 03, 2021 07:00

Will Hurd joins OpenAI’s board of directors

OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires expertise in public policy as well as technology. So, we’re delighted to announce that Congressman Will...

Models Policy

OpenAI Models Policy

OpenAI News February 04, 2021 08:00

Understanding the capabilities, limitations, and societal impact of large language models

On October 14th, 2020, researchers from OpenAI, the Stanford Institute for Human-Centered Artificial Intelligence, and other universities convened to discuss open research questions surrounding GPT‑3, the largest publicly-disclosed dense language model at...

Models Policy

OpenAI Models Policy

OpenAI News April 16, 2020 07:00

Improving verifiability in AI development

We’ve contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, Mila, Schwartz Reisman Institute for Technology and Society, Center for Advanced Study in the Behavioral Sciences, and...

Policy

OpenAI News July 10, 2019 07:00

Why responsible AI development needs cooperation on safety

We’ve written a policy research paper identifying four strategies that can be used today to improve the likelihood of long-term industry cooperation on safety norms in AI: communicating risks and benefits, technical collaboration, increased transparency,...

Policy

OpenAI News February 19, 2019 08:00

AI safety needs social scientists

We’ve written a paper arguing that long-term AI safety research needs social scientists to ensure AI alignment algorithms succeed when actual humans are involved. Properly aligning advanced AI systems with human values requires resolving many uncertainties...

Models Policy

OpenAI Models Policy

OpenAI News October 22, 2018 07:00

Learning complex goals with iterated amplification

We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled data or...

Policy