{"id":1793,"date":"2023-06-20T14:37:52","date_gmt":"2023-06-20T09:07:52","guid":{"rendered":"https:\/\/www.analyticsvidhya.com\/datahack-summit-2023\/?page_id=1793"},"modified":"2023-07-19T19:05:49","modified_gmt":"2023-07-19T13:35:49","slug":"edge-of-ai-with-reinforcement-learning","status":"publish","type":"page","link":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/","title":{"rendered":"Edge of AI with Reinforcement Learning"},"content":{"rendered":"<p style=\"text-align: left;\"><span data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;In 2016, Lee Sedol, the world's most recognized Go player was asked what he took from his\\nexperience of playing against AlphaGo, one of DeepMind's many Reinforcement learning (RL)\\nbased initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said - \\&quot;I have\\ngrown through the experience and understood the reason I play this game\\&quot;. Experiencing\\nprofound humane emotions against a faceless, artificially driven, inanimate opponent, creative in\\ndimensions different than what humans can perceive has ubiquitously changed the way we\\ninteract and exchange information. RL and in general AI field have grown exponentially since\\nthen. It's evident that RL has a lot of potential to solve some of the most difficult and nagging\\nproblems in the world. It can and will impact people from all walks of life, from business\\nrevenue, government policy to climate change.\\n\\nIn this talk, you will learn reinforcement learning starting from basic algorithms to applications\\nin various domains such as,\\n1. AlphaGo\/AlphaStar - Board and Video Games\\n2. AlphaFold - New protien structures\\n3. Energy and Sorting Optimization\\n4. Chip Designs in VLSI\\n5. RLHF - ChatGPT, Conversational Systems\\n\\nThis would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to\\nsee the fascinating impact that it can and will leave on the world.&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:1021,&quot;3&quot;:{&quot;1&quot;:0},&quot;5&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;6&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;7&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;8&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;9&quot;:1,&quot;10&quot;:1,&quot;11&quot;:3,&quot;12&quot;:0}\">In 2016, Lee Sedol, the world&#8217;s most recognized Go player was asked what he took from his <\/span><span data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;In 2016, Lee Sedol, the world's most recognized Go player was asked what he took from his\\nexperience of playing against AlphaGo, one of DeepMind's many Reinforcement learning (RL)\\nbased initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said - \\&quot;I have\\ngrown through the experience and understood the reason I play this game\\&quot;. Experiencing\\nprofound humane emotions against a faceless, artificially driven, inanimate opponent, creative in\\ndimensions different than what humans can perceive has ubiquitously changed the way we\\ninteract and exchange information. RL and in general AI field have grown exponentially since\\nthen. It's evident that RL has a lot of potential to solve some of the most difficult and nagging\\nproblems in the world. It can and will impact people from all walks of life, from business\\nrevenue, government policy to climate change.\\n\\nIn this talk, you will learn reinforcement learning starting from basic algorithms to applications\\nin various domains such as,\\n1. AlphaGo\/AlphaStar - Board and Video Games\\n2. AlphaFold - New protien structures\\n3. Energy and Sorting Optimization\\n4. Chip Designs in VLSI\\n5. RLHF - ChatGPT, Conversational Systems\\n\\nThis would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to\\nsee the fascinating impact that it can and will leave on the world.&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:1021,&quot;3&quot;:{&quot;1&quot;:0},&quot;5&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;6&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;7&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;8&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;9&quot;:1,&quot;10&quot;:1,&quot;11&quot;:3,&quot;12&quot;:0}\">experience of playing against AlphaGo, one of DeepMind&#8217;s many Reinforcement learning (RL) based initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said &#8211; &#8220;I have grown through the experience and understood the reason I play this game&#8221;. Experiencing profound humane emotions against a faceless, artificially driven, inanimate opponent, creative in dimensions different than what humans can perceive has ubiquitously changed the way we interact and exchange information. RL and in general AI field have grown exponentially since then. It&#8217;s evident that RL has a lot of potential to solve some of the most difficult and nagging problems in the world. It can and will impact people from all walks of life, from business revenue, government policy to climate change.<\/span><\/p>\n<p>In this talk, you will learn reinforcement learning starting from basic algorithms to applications<br \/>\nin various domains such as,<\/p>\n<ol>\n<li><span data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;In 2016, Lee Sedol, the world's most recognized Go player was asked what he took from his\\nexperience of playing against AlphaGo, one of DeepMind's many Reinforcement learning (RL)\\nbased initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said - \\&quot;I have\\ngrown through the experience and understood the reason I play this game\\&quot;. Experiencing\\nprofound humane emotions against a faceless, artificially driven, inanimate opponent, creative in\\ndimensions different than what humans can perceive has ubiquitously changed the way we\\ninteract and exchange information. RL and in general AI field have grown exponentially since\\nthen. It's evident that RL has a lot of potential to solve some of the most difficult and nagging\\nproblems in the world. It can and will impact people from all walks of life, from business\\nrevenue, government policy to climate change.\\n\\nIn this talk, you will learn reinforcement learning starting from basic algorithms to applications\\nin various domains such as,\\n1. AlphaGo\/AlphaStar - Board and Video Games\\n2. AlphaFold - New protien structures\\n3. Energy and Sorting Optimization\\n4. Chip Designs in VLSI\\n5. RLHF - ChatGPT, Conversational Systems\\n\\nThis would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to\\nsee the fascinating impact that it can and will leave on the world.&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:1021,&quot;3&quot;:{&quot;1&quot;:0},&quot;5&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;6&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;7&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;8&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;9&quot;:1,&quot;10&quot;:1,&quot;11&quot;:3,&quot;12&quot;:0}\">AlphaGo\/AlphaStar &#8211; Board and Video Games<br \/>\n<\/span><\/li>\n<li><span data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;In 2016, Lee Sedol, the world's most recognized Go player was asked what he took from his\\nexperience of playing against AlphaGo, one of DeepMind's many Reinforcement learning (RL)\\nbased initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said - \\&quot;I have\\ngrown through the experience and understood the reason I play this game\\&quot;. Experiencing\\nprofound humane emotions against a faceless, artificially driven, inanimate opponent, creative in\\ndimensions different than what humans can perceive has ubiquitously changed the way we\\ninteract and exchange information. RL and in general AI field have grown exponentially since\\nthen. It's evident that RL has a lot of potential to solve some of the most difficult and nagging\\nproblems in the world. It can and will impact people from all walks of life, from business\\nrevenue, government policy to climate change.\\n\\nIn this talk, you will learn reinforcement learning starting from basic algorithms to applications\\nin various domains such as,\\n1. AlphaGo\/AlphaStar - Board and Video Games\\n2. AlphaFold - New protien structures\\n3. Energy and Sorting Optimization\\n4. Chip Designs in VLSI\\n5. RLHF - ChatGPT, Conversational Systems\\n\\nThis would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to\\nsee the fascinating impact that it can and will leave on the world.&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:1021,&quot;3&quot;:{&quot;1&quot;:0},&quot;5&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;6&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;7&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;8&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;9&quot;:1,&quot;10&quot;:1,&quot;11&quot;:3,&quot;12&quot;:0}\">AlphaFold &#8211; New protien structures<br \/>\n<\/span><\/li>\n<li><span data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;In 2016, Lee Sedol, the world's most recognized Go player was asked what he took from his\\nexperience of playing against AlphaGo, one of DeepMind's many Reinforcement learning (RL)\\nbased initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said - \\&quot;I have\\ngrown through the experience and understood the reason I play this game\\&quot;. Experiencing\\nprofound humane emotions against a faceless, artificially driven, inanimate opponent, creative in\\ndimensions different than what humans can perceive has ubiquitously changed the way we\\ninteract and exchange information. RL and in general AI field have grown exponentially since\\nthen. It's evident that RL has a lot of potential to solve some of the most difficult and nagging\\nproblems in the world. It can and will impact people from all walks of life, from business\\nrevenue, government policy to climate change.\\n\\nIn this talk, you will learn reinforcement learning starting from basic algorithms to applications\\nin various domains such as,\\n1. AlphaGo\/AlphaStar - Board and Video Games\\n2. AlphaFold - New protien structures\\n3. Energy and Sorting Optimization\\n4. Chip Designs in VLSI\\n5. RLHF - ChatGPT, Conversational Systems\\n\\nThis would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to\\nsee the fascinating impact that it can and will leave on the world.&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:1021,&quot;3&quot;:{&quot;1&quot;:0},&quot;5&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;6&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;7&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;8&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;9&quot;:1,&quot;10&quot;:1,&quot;11&quot;:3,&quot;12&quot;:0}\">Energy and Sorting Optimization<br \/>\n<\/span><\/li>\n<li><span data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;In 2016, Lee Sedol, the world's most recognized Go player was asked what he took from his\\nexperience of playing against AlphaGo, one of DeepMind's many Reinforcement learning (RL)\\nbased initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said - \\&quot;I have\\ngrown through the experience and understood the reason I play this game\\&quot;. Experiencing\\nprofound humane emotions against a faceless, artificially driven, inanimate opponent, creative in\\ndimensions different than what humans can perceive has ubiquitously changed the way we\\ninteract and exchange information. RL and in general AI field have grown exponentially since\\nthen. It's evident that RL has a lot of potential to solve some of the most difficult and nagging\\nproblems in the world. It can and will impact people from all walks of life, from business\\nrevenue, government policy to climate change.\\n\\nIn this talk, you will learn reinforcement learning starting from basic algorithms to applications\\nin various domains such as,\\n1. AlphaGo\/AlphaStar - Board and Video Games\\n2. AlphaFold - New protien structures\\n3. Energy and Sorting Optimization\\n4. Chip Designs in VLSI\\n5. RLHF - ChatGPT, Conversational Systems\\n\\nThis would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to\\nsee the fascinating impact that it can and will leave on the world.&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:1021,&quot;3&quot;:{&quot;1&quot;:0},&quot;5&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;6&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;7&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;8&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;9&quot;:1,&quot;10&quot;:1,&quot;11&quot;:3,&quot;12&quot;:0}\">Chip Designs in VLSI<\/span><\/li>\n<li>RLHF &#8211; ChatGPT, Conversational Systems<\/li>\n<\/ol>\n<p><span data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;In 2016, Lee Sedol, the world's most recognized Go player was asked what he took from his\\nexperience of playing against AlphaGo, one of DeepMind's many Reinforcement learning (RL)\\nbased initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said - \\&quot;I have\\ngrown through the experience and understood the reason I play this game\\&quot;. Experiencing\\nprofound humane emotions against a faceless, artificially driven, inanimate opponent, creative in\\ndimensions different than what humans can perceive has ubiquitously changed the way we\\ninteract and exchange information. RL and in general AI field have grown exponentially since\\nthen. It's evident that RL has a lot of potential to solve some of the most difficult and nagging\\nproblems in the world. It can and will impact people from all walks of life, from business\\nrevenue, government policy to climate change.\\n\\nIn this talk, you will learn reinforcement learning starting from basic algorithms to applications\\nin various domains such as,\\n1. AlphaGo\/AlphaStar - Board and Video Games\\n2. AlphaFold - New protien structures\\n3. Energy and Sorting Optimization\\n4. Chip Designs in VLSI\\n5. RLHF - ChatGPT, Conversational Systems\\n\\nThis would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to\\nsee the fascinating impact that it can and will leave on the world.&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:1021,&quot;3&quot;:{&quot;1&quot;:0},&quot;5&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;6&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;7&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;8&quot;:{&quot;1&quot;:[{&quot;1&quot;:2,&quot;2&quot;:0,&quot;5&quot;:{&quot;1&quot;:2,&quot;2&quot;:0}},{&quot;1&quot;:0,&quot;2&quot;:0,&quot;3&quot;:3},{&quot;1&quot;:1,&quot;2&quot;:0,&quot;4&quot;:1}]},&quot;9&quot;:1,&quot;10&quot;:1,&quot;11&quot;:3,&quot;12&quot;:0}\">This would be a beginner-friendly talk, Join us to not just understand the RL algorithms but to<br \/>\nsee the fascinating impact that it can and will leave on the world.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In 2016, Lee Sedol, the world&#8217;s most recognized Go player was asked what he took from his experience of playing against AlphaGo, one of DeepMind&#8217;s many Reinforcement learning (RL) based initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said &#8211; &#8220;I have grown through the experience and understood the reason I play this game&#8221;. [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":1794,"parent":1126,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"session-details.php","meta":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Edge of AI with Reinforcement Learning - DataHack Summit 2023<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Edge of AI with Reinforcement Learning - DataHack Summit 2023\" \/>\n<meta property=\"og:description\" content=\"In 2016, Lee Sedol, the world&#8217;s most recognized Go player was asked what he took from his experience of playing against AlphaGo, one of DeepMind&#8217;s many Reinforcement learning (RL) based initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said &#8211; &#8220;I have grown through the experience and understood the reason I play this game&#8221;. [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/\" \/>\n<meta property=\"og:site_name\" content=\"DataHack Summit 2023\" \/>\n<meta property=\"article:modified_time\" content=\"2023-07-19T13:35:49+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-content\/uploads\/2023\/06\/s-reinforcement-learning.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"500\" \/>\n\t<meta property=\"og:image:height\" content=\"250\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/\",\"url\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/\",\"name\":\"Edge of AI with Reinforcement Learning - DataHack Summit 2023\",\"isPartOf\":{\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website\"},\"datePublished\":\"2023-06-20T09:07:52+00:00\",\"dateModified\":\"2023-07-19T13:35:49+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Session\",\"item\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Edge of AI with Reinforcement Learning\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website\",\"url\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/\",\"name\":\"DataHack Summit 2023\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.analyticsvidhya.com\/dhs-2023\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Edge of AI with Reinforcement Learning - DataHack Summit 2023","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/","og_locale":"en_US","og_type":"article","og_title":"Edge of AI with Reinforcement Learning - DataHack Summit 2023","og_description":"In 2016, Lee Sedol, the world&#8217;s most recognized Go player was asked what he took from his experience of playing against AlphaGo, one of DeepMind&#8217;s many Reinforcement learning (RL) based initiatives including AlphaZero (for Chess), AlphaStar (for StarCraft), etc. said &#8211; &#8220;I have grown through the experience and understood the reason I play this game&#8221;. [&hellip;]","og_url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/","og_site_name":"DataHack Summit 2023","article_modified_time":"2023-07-19T13:35:49+00:00","og_image":[{"width":500,"height":250,"url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-content\/uploads\/2023\/06\/s-reinforcement-learning.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/","url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/","name":"Edge of AI with Reinforcement Learning - DataHack Summit 2023","isPartOf":{"@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website"},"datePublished":"2023-06-20T09:07:52+00:00","dateModified":"2023-07-19T13:35:49+00:00","breadcrumb":{"@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/edge-of-ai-with-reinforcement-learning\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/"},{"@type":"ListItem","position":2,"name":"Session","item":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/session\/"},{"@type":"ListItem","position":3,"name":"Edge of AI with Reinforcement Learning"}]},{"@type":"WebSite","@id":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/#website","url":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/","name":"DataHack Summit 2023","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/1793"}],"collection":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/comments?post=1793"}],"version-history":[{"count":5,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/1793\/revisions"}],"predecessor-version":[{"id":2369,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/1793\/revisions\/2369"}],"up":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/pages\/1126"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/media\/1794"}],"wp:attachment":[{"href":"https:\/\/www.analyticsvidhya.com\/dhs-2023\/wp-json\/wp\/v2\/media?parent=1793"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}