Twitter | Search | |
This is the legacy version of twitter.com. We will be shutting it down on 15 December 2020. Please switch to a supported browser or device. You can see a list of supported browsers in our Help Center.
Search Refresh
Phil Winder Nov 30
Reinforcement Learning: Industrial Applications of Intelligent Agents I'm pleased to announce the release of my new book! Buy now on Amazon, Safari, or your local retailer! More info at
Reply Retweet Like
Odemakinde Elisha Nov 25
Welcome Team Venus - A team of enthusiasts who are working on based projects with and are also building up their professional portfolio.
Reply Retweet Like
Rabeeh Abbasi Dec 2
Resolving Cross-site Scripting Attacks through Genetic Algorithm and Reinforcement Learning
Reply Retweet Like
Amii 4h
Congrats to Amii and alumni and for their contributions on using to create a high-performing flight controller for stratospheric balloons – another real-world win for this rapidly growing field of !
Reply Retweet Like
SeanRedmond 13h
Vitaq AI uses innovative technology to find answers in vast space of test automation. The same branch that Google DeepMind used to breakthrough in protein folding, opening the road for the creation of new life-saving medications.
Reply Retweet Like
Mathias Åsberg 🍏 Nov 26
Squeezed everything possible out of PPO algorithm to solve LunarLander-v2 on 133 Episodes (4th best). Now switching to my favorite reinforcement learning algorithm .. twin-delayed deep deterministic policy gradient (TD3)
Reply Retweet Like
Odemakinde Elisha Nov 27
Replying to @Elishatofunmi
This is beginners friendly and it also exposes you to all you need to do, to get started in .
Reply Retweet Like
Odemakinde Elisha Nov 27
Replying to @Elishatofunmi
The environment comes with real-time game updates and real-time tracking. Watch how Team B ended up defeating A in 3 mins. Game formation for each Team: 4:3:3 Source: video: (2/2)
Reply Retweet Like
Sammy Deprez Nov 26
Reply Retweet Like
Fintech Brand Names For Sale Nov 24
Reply Retweet Like
andrea panizza Nov 29
Love this alignment chart for Policy Gradient methods!
Reply Retweet Like
Kirk Borne Nov 30
Reply Retweet Like
Abhijit Gosavi Dec 2
is going to be online this year, as expected, but nonetheless promises to be as exciting as usual. I’m looking forward to presenting work on bounded .
Reply Retweet Like
Ravi Tej @NeurIPS 2020 Dec 2
Excited to share that this work got accepted to . TL;DR: Our method computes policy gradients more accurately while using fewer samples and with negligible computational overhead. Joint work with Mohammad,
Reply Retweet Like
Immersive Limit Dec 2
ML-Agents content is back!! We are excited to jump in. Join in getting set up with the latest version (release 10).
Reply Retweet Like
WiML Nov 29
4/5 We would also like to highlight Meire’s paper on “Generalization of Reinforcement Learners with Working and Episodic Memory”, which represents an interesting study on the use of memory in agents:
Reply Retweet Like
UCLFinComp Dec 2
“The practical implication is that naive application of RL to complex real life problems can result in the same types of policy errors that humans make.” Causal Campbell-Goodhart's law and
Reply Retweet Like
Analog Aldo Nov 27
Excited to develop with the great team methods for constructing of human experts
Reply Retweet Like
MONTREAL.AI Nov 26
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? Kurin et al.:
Reply Retweet Like
Reinforcement Learning Turkiye Nov 25
Blooooogggs Hello all of our dear community members & fellows. We are free to publish any of your Reinforcement Learning,Robotics and Deep Learning/ML related blogs/works/projects to our publishing branch on medium ! Just send a DM ^. --- Türkçe altta !
Reply Retweet Like