|
@
OpenAI
|
|
OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We're hiring: openai.com/jobs
|
|
|
447
Tweetovi
|
0
Pratim
|
403.549
Osobe koje vas prate
|
| Tweetovi |
|
OpenAI
@OpenAI
|
30. sij |
|
We're standardizing OpenAI's deep learning framework on PyTorch to increase our research productivity at scale on GPUs (and have just released a PyTorch version of Spinning Up in Deep RL): openai.com/blog/openai-py… pic.twitter.com/lgvqDdWDoB
|
||
|
|
||
|
OpenAI
@OpenAI
|
13. pro |
|
We're releasing "Dota 2 with Large Scale Deep Reinforcement Learning", a scientific paper analyzing our findings from our 3-year Dota project: openai.com/projects/five/
One highlight — we trained a new agent, Rerun, which has a 98% win rate vs the version that beat @OGEsports. pic.twitter.com/1kWvXwBHHp
|
||
|
|
||
|
OpenAI
@OpenAI
|
5. pro |
|
A surprising deep learning mystery:
Contrary to conventional wisdom, performance of unregularized CNNs, ResNets, and transformers is non-monotonic: improves, then gets worse, then improves again with increasing model size, data size, or training time.
openai.com/blog/deep-doub… pic.twitter.com/Zdox9dbIBv
|
||
|
|
||
|
OpenAI
@OpenAI
|
3. pro |
|
We're releasing Procgen Benchmark, 16 procedurally-generated environments for measuring how quickly a reinforcement learning agent learns generalizable skills.
This has become the standard research platform used by the OpenAI RL team: openai.com/blog/procgen-b… pic.twitter.com/OhECCCAeY3
|
||
|
|
||
|
OpenAI
@OpenAI
|
21. stu |
|
We're releasing Safety Gym, environments and tools to evaluate reinforcement learning with safety constraints: openai.com/blog/safety-gy…
Aims to ultimately help agents satisfy real-world safety requirements while training (eg not driving off a cliff, not writing abusive content). pic.twitter.com/VTwS4KoFS1
|
||
|
|
||
|
OpenAI
@OpenAI
|
7. stu |
|
We've analyzed compute used in major AI results for the past decades and identified two eras in AI:
1) Prior to 2012 - AI results closely tracked Moore's Law, w/ compute doubling every two years.
2) Post-2012 - compute has been doubling every 3.4 months openai.com/blog/ai-and-co… pic.twitter.com/ILN5MRrWYH
|
||
|
|
||
|
OpenAI
@OpenAI
|
5. stu |
|
We're releasing the 1.5billion parameter GPT-2 model as part of our staged release publication strategy.
- GPT-2 output detection model: github.com/openai/gpt-2-o…
- Research from partners on potential malicious uses: d4mucfpksywv.cloudfront.net/papers/GPT_2_R…
- More details: openai.com/blog/gpt-2-1-5… pic.twitter.com/O3k28rrE5l
|
||
|
|
||
|
OpenAI
@OpenAI
|
16. lis |
|
In case you missed it, here’s the unedited solve of the Rubik’s cube: youtube.com/watch?v=kVmp0u…
|
||
|
|
||
|
OpenAI
@OpenAI
|
16. lis |
|
Human hands let us solve a wide variety of tasks. Even so, solving a Rubik's Cube one-handed isn't easy for humans. We're excited to continue to develop new AI technology and ultimately ensure that these systems benefit all of humanity. youtube.com/watch?v=x4O8po…
|
||
|
|
||
|
OpenAI
@OpenAI
|
16. lis |
|
"Solving the Rubik's Cube with a Robot Hand" took many human hands over the past 2.5 years — meet our Robotics team! (PS they're hiring: openai.com/jobs/!) pic.twitter.com/jeCUEDtYY3
|
||
|
|
||
|
OpenAI
@OpenAI
|
15. lis |
|
We’re all used to robots that fail when their environment changes unpredictably. Our robotic system is adaptable enough to handle unexpected situations not seen during training, such as being prodded by a stuffed giraffe: pic.twitter.com/wBoh1nt9Kv
|
||
|
|
||
|
OpenAI
@OpenAI
|
15. lis |
|
We've trained an AI system to solve the Rubik's Cube with a human-like robot hand.
This is an unprecedented level of dexterity for a robot, and is hard even for humans to do.
The system trains in an imperfect simulation and quickly adapts to reality: openai.com/blog/solving-r… pic.twitter.com/8lGhU2pPck
|
||
|
|
||
|
OpenAI
@OpenAI
|
11. lis |
|
Now accepting applications for our 3rd class of OpenAI Scholars: a 4 month full-time program for individuals from underrepresented groups to study deep learning and produce an open-source project. Mentors include @mcleavey, @karlcobbe, @AlecRad: openai.com/blog/openai-sc…
|
||
|
|
||
|
OpenAI
@OpenAI
|
19. ruj |
|
Wondering why the hiders did not cage in the seekers instead of building their own fort? In one environment variant where hiders have to protect glowing orbs, that's exactly what they learned to do! pic.twitter.com/yifS7rI4eR
|
||
|
|
||
|
OpenAI
@OpenAI
|
19. ruj |
|
We've fine-tuned GPT-2 using human feedback for tasks such as summarizing articles, matching the preferences of human labelers (if not always our own). We're hoping this brings safety methods closer to machines learning values by talking with humans. openai.com/blog/fine-tuni…
|
||
|
|
||
|
OpenAI
@OpenAI
|
17. ruj |
|
And seekers learn that if they run at a wall with a ramp at the right angle, they can launch themselves upward. pic.twitter.com/SJv9SzctEp
|
||
|
|
||
|
OpenAI
@OpenAI
|
17. ruj |
|
Unexpected and surprising behaviors included box surfing, where seekers learn to bring a box to a locked ramp in order to jump on top of the box and then “surf” it to the hider’s shelter. pic.twitter.com/v0kGfCYZna
|
||
|
|
||
|
OpenAI
@OpenAI
|
17. ruj |
|
We've observed AIs discovering complex tool use while competing in a simple game of hide-and-seek. They develop a series of six distinct strategies and counterstrategies, ultimately using tools in the environment to break our simulated physics: openai.com/blog/emergent-…
|
||
|
|
||
|
OpenAI
@OpenAI
|
27. kol |
|
.@sama recently chatted with @kevin_scott in his latest podcast, "Behind the Tech," and shared his thoughts on AI progress, building a mission-driven company, and the future of transformative technologies: microsoft.com/en-us/behind-t…
|
||
|
|
||
|
OpenAI
@OpenAI
|
22. kol |
|
We're releasing a new method to test for model robustness against adversaries not seen during training, and open-sourcing a new metric, UAR (Unforeseen Attack Robustness), which measures how robust a model is to an unanticipated attack: openai.com/blog/testing-r… pic.twitter.com/8yJdd6oD5T
|
||
|
|
||