Models page 145

OpenAI News February 20, 2018 08:00

OpenAI supporters

We’re excited to welcome new donors to OpenAI.

Models

OpenAI Models

OpenAI News January 31, 2018 08:00

Requests for Research 2.0

We’re releasing a new batch of seven unsolved problems which have come up in the course of our research at OpenAI.

Models

OpenAI Models

OpenAI News October 17, 2017 07:00

Domain randomization and generative models for robotic grasping

Models

OpenAI News August 18, 2017 07:00

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives equal performance. ACKTR is a more sample-efficient reinforcement...

Models

OpenAI Models

OpenAI News July 20, 2017 07:00

Proximal Policy Optimization

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to implement and tune. PPO has become the default...

Models Policy

OpenAI Models Policy

OpenAI News June 13, 2017 07:00

Learning from human preferences

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration...

Models

OpenAI News May 24, 2017 07:00

OpenAI Baselines: DQN

We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months; today’s release includes DQN and three of its variants.

Models

OpenAI Models

OpenAI News May 15, 2017 07:00

Roboschool

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

Models

OpenAI Models

OpenAI News March 16, 2017 07:00

Learning to communicate

In this post we’ll outline new OpenAI research in which agents develop their own language.

Models Agents

OpenAI Models Agents

OpenAI News March 12, 2017 08:00

Prediction and control with temporal segment models

Models

OpenAI News February 24, 2017 08:00

Attacking machine learning with adversarial examples

Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions for machines. In this post we’ll show how adversarial examples work across different...

Models

OpenAI News January 30, 2017 08:00

Team update

The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.

Models

OpenAI Models