OpenAI News page 78

OpenAI News February 08, 2017 08:00

Adversarial attacks on neural network policies

Machine learning classifiers are known to be vulnerable to inputs maliciously constructed by adversaries to force misclassification. Such adversarial examples have been extensively studied in the context of computer vision applications. In this work, we...

Policy Infrastructure

OpenAI News January 30, 2017 08:00

Team update

The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine learning on robots.

Models

OpenAI Models

OpenAI News January 19, 2017 08:00

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

PixelCNNs are a recently proposed class of powerful generative models with tractable likelihood. Here we discuss our implementation of PixelCNNs which we make available at this https URL⁠(opens in a new window). Our implementation contains a number of...

Models Infrastructure

OpenAI News December 21, 2016 08:00

Faulty reward functions in the wild

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

OpenAI News December 05, 2016 08:00

Universe

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.

Infrastructure

OpenAI News November 15, 2016 08:00

#Exploration: A study of count-based exploration for deep reinforcement learning

Count-based exploration algorithms are known to perform near-optimally when used in conjunction with tabular reinforcement learning (RL) methods for solving small discrete Markov decision processes (MDPs). It is generally thought that count-based methods...

OpenAI News November 15, 2016 08:00

OpenAI and Microsoft

We’re working with Microsoft to start running most of our large-scale experiments on Azure.

Models

OpenAI Models Microsoft

OpenAI News November 14, 2016 08:00

On the quantitative analysis of decoder-based generative models

The past several years have seen remarkable progress in generative models which produce convincing samples of images and other modalities. A shared component of many powerful generative models is a decoder network, a parametric deep neural net that defines...

Models

OpenAI News November 11, 2016 08:00

A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models

Generative adversarial networks (GANs) are a recently proposed class of generative models in which a generator is trained to optimize a cost function that is being simultaneously learned by a discriminator. While the idea of learning cost functions is...

Models Infrastructure

OpenAI News November 09, 2016 08:00

RL²: Fast reinforcement learning via slow reinforcement learning

Deep reinforcement learning (deep RL) has been successful in learning sophisticated behaviors automatically; however, the learning process requires a huge number of trials. In contrast, animals can learn new tasks in just a few trials, benefiting from their...

OpenAI News November 08, 2016 08:00

Variational lossy autoencoder

Representation learning seeks to expose certain aspects of observed data in a learned representation that's amenable to downstream tasks like classification. For instance, a good representation for 2D images might be one that describes only global structure...

Models

OpenAI News November 02, 2016 07:00

Extensions and limitations of the neural GPU

The Neural GPU is a recent model that can learn algorithms such as multi-digit binary addition and binary multiplication in a way that generalizes to inputs of arbitrary length. We show that there are two simple ways of improving the performance of the...

Infrastructure