OpenAI News page 68

OpenAI News December 03, 2019 08:00

Procgen Benchmark

We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement learning agent learns generalizable skills.

OpenAI News November 21, 2019 08:00

Safety Gym

We’re releasing Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints while training.

Agents Infrastructure

OpenAI News November 21, 2019 08:00

Benchmarking safe exploration in deep reinforcement learning

Reinforcement learning (RL) agents need to explore their environments in order to learn optimal policies by trial and error. In many environments, safety is a critical concern and certain errors are unacceptable: for example, robotics systems that interact...

Agents Infrastructure

OpenAI News November 05, 2019 08:00

GPT-2: 1.5B release

As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection of outputs of GPT-2 models. While there have been larger language models released...

Models

OpenAI News October 15, 2019 07:00

Solving Rubik’s Cube with a robot hand

We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using the same reinforcement learning code as OpenAI Five paired with a new technique called Automatic...

Models Infrastructure

OpenAI Models Infrastructure

OpenAI News October 11, 2019 07:00

OpenAI Scholars 2020: Applications open

We are now accepting applications for our third class of OpenAI Scholars.

Models

OpenAI Models

OpenAI News September 19, 2019 07:00

Fine-tuning GPT-2 from human preferences

We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences did not always match our own. Specifically, for summarization...

Models

OpenAI News September 17, 2019 07:00

Emergent tool use from multi-agent interaction

We’ve observed agents discovering progressively more complex tool use while playing a simple game of hide-and-seek. Through training in our new simulated hide-and-seek environment, agents build a series of six distinct strategies and counterstrategies, some...

Agents Infrastructure

OpenAI News August 22, 2019 07:00

Testing robustness against unforeseen adversaries

We’ve developed a method to assess whether a neural network classifier can reliably defend against adversarial attacks not seen during training. Our method yields a new metric, UAR (Unforeseen Attack Robustness), which evaluates the robustness of a single...

Infrastructure

OpenAI News August 20, 2019 07:00

GPT-2: 6-month follow-up

We’re releasing the 774 million parameter GPT-2 language model after the release of our small 124M model in February, staged release of our medium 355M model in May, and subsequent research with partners and the AI community into the model’s potential for...

Models

OpenAI News August 01, 2019 07:00

Learning Day

At OpenAI, each Thursday is Learning Day: a day where employees have the option to self-study technical skills that will make them better at their job but which aren’t being learned from daily work.

Models

OpenAI Models

OpenAI News July 22, 2019 07:00

Microsoft invests in and partners with OpenAI to support us building beneficial AGI

Microsoft is investing $1 billion in OpenAI to support us building artificial general intelligence (AGI) with widely distributed economic benefits. We’re partnering to develop a hardware and software platform within Microsoft Azure which will scale to AGI....

Models

OpenAI Models Microsoft Microsoft Azure