906 articles

📰 OpenAI News

Articles from OpenAI News · 906 articles · Updated every 3 hours · View all news

All ⚡ AI Lessons (8687) ArXiv cs.AIForbes InnovationOpenAI NewsDev.to AIHugging Face BlogHackernoon
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Competitive self-play
We’ve found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball, without
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Meta-learning for wrestling
We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent, and also show that
OpenAI News 8y ago
Nonlinear computation in deep linear networks
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Learning to model other minds
We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit
OpenAI News 8y ago
Learning with opponent-learning awareness
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
OpenAI Baselines: ACKTR & A2C
We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
More on Dota 2
Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compu
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Dota 2
We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from scratch by
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Gathering human feedback
RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlyin
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Better exploration with parameter noise
We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple t
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Proximal Policy Optimization
We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art a
OpenAI News 👁️ Computer Vision ⚡ AI Lesson 8y ago
Robust adversarial inputs
We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that
OpenAI News 8y ago
Hindsight Experience Replay
OpenAI News 8y ago
Teacher–student curriculum learning
OpenAI News ⚡ AI Lesson 8y ago
Faster physics in Python
We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Learning from human preferences
One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting th
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Learning to cooperate, compete, and communicate
Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, th
OpenAI News 8y ago
UCB exploration via Q-ensembles
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
OpenAI Baselines: DQN
We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll r
OpenAI News 🧠 Large Language Models ⚡ AI Lesson 8y ago
Robots that learn
We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.