Policy page 24

OpenAI News October 19, 2022 07:00

Scaling laws for reward model overoptimization

In reinforcement learning from human feedback, it is common to optimize against a reward model trained to predict human preferences. Because the reward model is an imperfect proxy, optimizing its value too much can hinder ground truth performance, in...

Policy

OpenAI News June 28, 2022 07:00

DALL·E 2 pre-training mitigations

In order to share the magic of DALL·E 2 with a broad audience, we needed to reduce the risks associated with powerful image generation models. To this end, we put various guardrails in place to prevent generated images from violating our content policy.

Models Policy Infrastructure

OpenAI News March 03, 2022 08:00

A research agenda for assessing the economic impacts of code generation models

OpenAI is developing a research program to assess the economic impacts of code generation models and is inviting collaboration with external researchers. Rapid advances in the capabilities of large language models (LLMs) trained on code have made it...

Models Policy

OpenAI Models Policy

OpenAI News May 03, 2021 07:00

Will Hurd joins OpenAI’s board of directors

OpenAI is committed to developing general-purpose artificial intelligence that benefits all humanity, and we believe that achieving our goal requires expertise in public policy as well as technology. So, we’re delighted to announce that Congressman Will...

Models Policy

OpenAI Models Policy

OpenAI News February 04, 2021 08:00

Understanding the capabilities, limitations, and societal impact of large language models

On October 14th, 2020, researchers from OpenAI, the Stanford Institute for Human-Centered Artificial Intelligence, and other universities convened to discuss open research questions surrounding GPT‑3, the largest publicly-disclosed dense language model at...

Models Policy

OpenAI Models Policy

OpenAI News April 16, 2020 07:00

Improving verifiability in AI development

We’ve contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, Mila, Schwartz Reisman Institute for Technology and Society, Center for Advanced Study in the Behavioral Sciences, and...

Policy

OpenAI News July 10, 2019 07:00

Why responsible AI development needs cooperation on safety

We’ve written a policy research paper identifying four strategies that can be used today to improve the likelihood of long-term industry cooperation on safety norms in AI: communicating risks and benefits, technical collaboration, increased transparency,...

Policy

OpenAI News February 19, 2019 08:00

AI safety needs social scientists

We’ve written a paper arguing that long-term AI safety research needs social scientists to ensure AI alignment algorithms succeed when actual humans are involved. Properly aligning advanced AI systems with human values requires resolving many uncertainties...

Models Policy

OpenAI Models Policy

OpenAI News October 22, 2018 07:00

Learning complex goals with iterated amplification

We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to decompose a task into simpler sub-tasks, rather than by providing labeled data or...

Policy

OpenAI News July 26, 2018 07:00

Variational option discovery algorithms

We explore methods for option discovery based on variational inference and make two algorithmic contributions. First: we highlight a tight connection between variational option discovery methods and variational autoencoders, and introduce Variational...

Policy Infrastructure

OpenAI News June 17, 2018 07:00

Learning policy representations in multiagent systems

Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems. Prior work in agent modeling has largely been task-specific and driven by hand-engineering domain-specific prior knowledge. We propose a general...

Policy

OpenAI News May 03, 2018 07:00

AI safety via debate

We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins.

Agents Policy