{"id":24910,"date":"2020-08-25T06:00:00","date_gmt":"2020-08-25T13:00:00","guid":{"rendered":"https:\/\/insidebigdata.com\/?p=24910"},"modified":"2020-08-26T09:52:08","modified_gmt":"2020-08-26T16:52:08","slug":"book-review-deep-reinforcement-learning-hands-on","status":"publish","type":"post","link":"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/","title":{"rendered":"Book Review: Deep Reinforcement Learning Hands-On"},"content":{"rendered":"\n<div class=\"wp-block-image\"><figure class=\"alignright size-large is-resized\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2020\/08\/Packt_DeepReinforcementLearningHandsOn.jpg\" alt=\"\" class=\"wp-image-24911\" width=\"236\" height=\"290\" srcset=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2020\/08\/Packt_DeepReinforcementLearningHandsOn.jpg 300w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2020\/08\/Packt_DeepReinforcementLearningHandsOn-244x300.jpg 244w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2020\/08\/Packt_DeepReinforcementLearningHandsOn-122x150.jpg 122w\" sizes=\"(max-width: 236px) 100vw, 236px\" \/><\/figure><\/div>\n\n\n\n<p>Reinforcement learning (RL) is a hugely popular area of deep learning, and many data scientists are exploring this AI technology to broaden their skillet to include a number of important problem domains like chatbots, robotics, discrete optimization, web automation and much more. As a result of this wide-spread interest in RL, there are many available educational resources specifically tailored to this class of deep learning &#8211; boot camps, training certificates, educational specializations, etc. But if you&#8217;re a data scientist who has been programming in Python (with object oriented features) for a while, and has some experience with other forms of deep learning using a framework like TensorFlow, then maybe this new book, &#8220;Deep Reinforcement Learning Hands-On,&#8221; by Maxim Lapan from Packt, might be a great way to kick-start yourself into becoming productive with RL. <\/p>\n\n\n\n<p><strong>Reinforcement Learning<\/strong><\/p>\n\n\n\n<p>RL development is being driven by a number of large companies and research groups, including Google, Microsoft, and Facebook. RL requires considerable investment in research as the field is growing to enable data scientists to be able to take prescribed methods and apply them to a problem domain. This is similar to how the states of computer vision and NLP were several years ago. A good portion of the AI research papers appearing on the arXiv.org pre-print server deal with RL. <\/p>\n\n\n\n<p>As a journalist, I&#8217;m seeing that the field of RL is attracting lots of attention, both from researchers and practitioners. Fortunately, this hands-on book helps readers to understand RL methods using real-life problems, and make the exciting RL field accessible to a much wider audience than just research groups or large AI companies.<\/p>\n\n\n\n<p>The emphasis of the book is to enable readers to feel that modern RL is an approachable, practical, and useful aspect of machine learning. The book will help readers understand RL methods using real-life, relatable problems. Information about recent research is widely available via arXiv.org and research blogs, but it is too specialized and grounded with mathematics for many data scientists to easily consume. If readers are new to this field, these papers might not be the best source of information, at least not early on. This book gets around such complexities by addressing the lack of practical and structured information about RL methods and approaches. <\/p>\n\n\n\n<p><strong>New in the Second Edition<\/strong><\/p>\n\n\n\n<p>The second edition is devoted to the very latest RL tools and techniques, focusing on new innovations in this emerging field. It includes six new chapters that give readers the hands-on ability to code intelligent learning agents to perform a range of practical tasks. New topics include robotics and multiple agent learning. The chapter on trust regions has been updated with the new SAC algorithm. The author has also developed a library for easily interfacing the OpenAI environment which was used all through the first edition but is now properly documented.<\/p>\n\n\n\n<p>After a clear definition of RL and associated nomenclature, the Gym library and its classes are explained. Gym is a rich library used widely by RL practitioners and the book&#8217;s code depends on it. A straight-forward and impressive RL application using Gym is given and explained carefully.<\/p>\n\n\n\n<p>Fun topics include &#8211; using the TextWorld environment from Microsoft Research to solve text-based interactive fiction games, solving discrete optimization problems (showcased using Rubik\u2019s Cube), and learning to apply RL methods to the robotics domain.<\/p>\n\n\n\n<p>The book then dives into an important aspect of RL methods: advanced exploration. Several modern exploration techniques are described, implemented, and compared. Readers will also discover basic methods applied to a multi-agent environment.<\/p>\n\n\n\n<p>All examples have been updated for PyTorch 1.3. A good introduction to PyTorch is given in Chapter 3. PyTorch Ignite has also been introduced to make the RL coding more concise.<\/p>\n\n\n\n<p>To demonstrate the breadth of coverage of the subject, here are the chapters included in the book:<\/p>\n\n\n\n<ul><li>Chapter 1 &#8211; What is Reinforcement Learning?<\/li><li>Chapter 2 &#8211; OpenAI Gym<\/li><li>Chapter 3 &#8211; Deep Learning with PyTorch<\/li><li>Chapter 4 &#8211; The Cross-Entropy Method<\/li><li>Chapter 5 &#8211; Tabular Learning and the Bellman Equation<\/li><li>Chapter 6 &#8211; Deep Q-Networks<\/li><li>Chapter 7 &#8211; Higher-Level RL Libraries<\/li><li>Chapter 8 &#8211; DQN Extensions<\/li><li>Chapter 9 &#8211; Ways  to Speed up RL Methods<\/li><li>Chapter 10 &#8211; Stocks Trading Using RL<\/li><li>Chapter 11 &#8211; Policy Gradients <\/li><li>Chapter 12 &#8211; The Actor-Critic Method<\/li><li>Chapter 13 &#8211; Asynchronous Advantage Actor-Critic<\/li><li>Chapter 14 &#8211; Training Chatbots with RL<\/li><li>Chapter 15 &#8211; The TextWorld Environment<\/li><li>Chapter 16 &#8211; Web Navigation<\/li><li>Chapter 17 &#8211; Continuous Action Space<\/li><li>Chapter 18 &#8211; RL in Robotics<\/li><li>Chapter 19 &#8211; Trust Regions \u2013 TRPO, PPO, and ACKTR<\/li><li>Chapter 20 &#8211; Black-Box Optimization in RL<\/li><li>Chapter 21 &#8211; Advanced Exploration<\/li><li>Chapter 22 &#8211; Beyond Model-Free <\/li><li>Chapter 23 &#8211; AlphaGo Zero<\/li><li>Chapter 24 &#8211; RL in Discrete Optimization<\/li><li>Chapter 25 &#8211; Multi-agent RL <\/li><\/ul>\n\n\n\n<p>The book is targeted toward readers with a fluency in Python. Basic deep learning approaches should be familiar to readers and some practical experience in DL will be helpful. This book is a complete introduction to deep reinforcement learning and requires no background in RL.<\/p>\n\n\n\n<p>This is a very comprehensive book covering a range of RL techniques. With nearly 800 pages, it will also weigh down your bookshelf a bit. It comes with a <a rel=\"noreferrer noopener\" href=\"https:\/\/github.com\/PacktPublishing\/Deep-Reinforcement-Learning-Hands-On-Second-Edition\" target=\"_blank\">GitHub repo<\/a> containing all the code described in the book. Great place to start building your RL skillet.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"alignleft size-large is-resized\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic.png\" alt=\"\" class=\"wp-image-21778\" width=\"108\" height=\"123\" srcset=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic.png 200w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic-131x150.png 131w\" sizes=\"(max-width: 108px) 100vw, 108px\" \/><\/figure><\/div>\n\n\n\n<p>C<em>ontributed by Daniel D. Gutierrez, Editor-in-Chief and Resident Data Scientist for insideBIGDATA. In addition to being a tech journalist, Daniel also is a consultant in data scientist, author, educator and sits on a number of advisory boards for various start-up companies.&nbsp;<\/em><\/p>\n\n\n\n<p><em>Sign up for the free insideBIGDATA&nbsp;<a href=\"http:\/\/insidebigdata.com\/newsletter\/\" target=\"_blank\" rel=\"noreferrer noopener\">newsletter<\/a>.<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>RL is a hugely popular area of deep learning, and many data scientists are exploring this AI technology to broaden their skillet to include a number of important problem domains like chatbots, robotics, discrete optimization, web automation and much more. As a result of this wide-spread interest in RL, there are many available educational resources specifically tailored to this class of deep learning &#8211; boot camps, training certificates, educational specializations, etc. But if you&#8217;re a data scientist who has been programming in Python for a while, and has some experience with other forms of deep learning using a framework like TensorFlow, then maybe this new book, &#8220;Deep Reinforcement Learning Hands-On,&#8221; by Maxim Lapan, might be a great way to kick-start yourself into becoming productive with RL. <\/p>\n","protected":false},"author":37,"featured_media":24911,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":""},"categories":[526,92,182,87,180,56,1],"tags":[437,324,264,664,95],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Book Review: Deep Reinforcement Learning Hands-On - insideBIGDATA<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Book Review: Deep Reinforcement Learning Hands-On - insideBIGDATA\" \/>\n<meta property=\"og:description\" content=\"RL is a hugely popular area of deep learning, and many data scientists are exploring this AI technology to broaden their skillet to include a number of important problem domains like chatbots, robotics, discrete optimization, web automation and much more. As a result of this wide-spread interest in RL, there are many available educational resources specifically tailored to this class of deep learning - boot camps, training certificates, educational specializations, etc. But if you&#039;re a data scientist who has been programming in Python for a while, and has some experience with other forms of deep learning using a framework like TensorFlow, then maybe this new book, &quot;Deep Reinforcement Learning Hands-On,&quot; by Maxim Lapan, might be a great way to kick-start yourself into becoming productive with RL.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/\" \/>\n<meta property=\"og:site_name\" content=\"insideBIGDATA\" \/>\n<meta property=\"article:publisher\" content=\"http:\/\/www.facebook.com\/insidebigdata\" \/>\n<meta property=\"article:published_time\" content=\"2020-08-25T13:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2020-08-26T16:52:08+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2020\/08\/Packt_DeepReinforcementLearningHandsOn.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"300\" \/>\n\t<meta property=\"og:image:height\" content=\"369\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Daniel Gutierrez\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@AMULETAnalytics\" \/>\n<meta name=\"twitter:site\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Daniel Gutierrez\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/\",\"url\":\"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/\",\"name\":\"Book Review: Deep Reinforcement Learning Hands-On - insideBIGDATA\",\"isPartOf\":{\"@id\":\"https:\/\/insidebigdata.com\/#website\"},\"datePublished\":\"2020-08-25T13:00:00+00:00\",\"dateModified\":\"2020-08-26T16:52:08+00:00\",\"author\":{\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed\"},\"breadcrumb\":{\"@id\":\"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/insidebigdata.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Book Review: Deep Reinforcement Learning Hands-On\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/insidebigdata.com\/#website\",\"url\":\"https:\/\/insidebigdata.com\/\",\"name\":\"insideBIGDATA\",\"description\":\"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/insidebigdata.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed\",\"name\":\"Daniel Gutierrez\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g\",\"caption\":\"Daniel Gutierrez\"},\"description\":\"Daniel D. Gutierrez is a Data Scientist with Los Angeles-based AMULET Analytics, a service division of AMULET Development Corp. He's been involved with data science and Big Data long before it came in vogue, so imagine his delight when the Harvard Business Review recently deemed \\\"data scientist\\\" as the sexiest profession for the 21st century. Previously, he taught computer science and database classes at UCLA Extension for over 15 years, and authored three computer industry books on database technology. He also served as technical editor, columnist and writer at a major computer industry monthly publication for 7 years. Follow his data science musings at @AMULETAnalytics.\",\"sameAs\":[\"http:\/\/www.insidebigdata.com\",\"https:\/\/twitter.com\/@AMULETAnalytics\"],\"url\":\"https:\/\/insidebigdata.com\/author\/dangutierrez\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Book Review: Deep Reinforcement Learning Hands-On - insideBIGDATA","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/","og_locale":"en_US","og_type":"article","og_title":"Book Review: Deep Reinforcement Learning Hands-On - insideBIGDATA","og_description":"RL is a hugely popular area of deep learning, and many data scientists are exploring this AI technology to broaden their skillet to include a number of important problem domains like chatbots, robotics, discrete optimization, web automation and much more. As a result of this wide-spread interest in RL, there are many available educational resources specifically tailored to this class of deep learning - boot camps, training certificates, educational specializations, etc. But if you're a data scientist who has been programming in Python for a while, and has some experience with other forms of deep learning using a framework like TensorFlow, then maybe this new book, \"Deep Reinforcement Learning Hands-On,\" by Maxim Lapan, might be a great way to kick-start yourself into becoming productive with RL.","og_url":"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/","og_site_name":"insideBIGDATA","article_publisher":"http:\/\/www.facebook.com\/insidebigdata","article_published_time":"2020-08-25T13:00:00+00:00","article_modified_time":"2020-08-26T16:52:08+00:00","og_image":[{"width":300,"height":369,"url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2020\/08\/Packt_DeepReinforcementLearningHandsOn.jpg","type":"image\/jpeg"}],"author":"Daniel Gutierrez","twitter_card":"summary_large_image","twitter_creator":"@AMULETAnalytics","twitter_site":"@insideBigData","twitter_misc":{"Written by":"Daniel Gutierrez","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/","url":"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/","name":"Book Review: Deep Reinforcement Learning Hands-On - insideBIGDATA","isPartOf":{"@id":"https:\/\/insidebigdata.com\/#website"},"datePublished":"2020-08-25T13:00:00+00:00","dateModified":"2020-08-26T16:52:08+00:00","author":{"@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed"},"breadcrumb":{"@id":"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/insidebigdata.com\/2020\/08\/25\/book-review-deep-reinforcement-learning-hands-on\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/insidebigdata.com\/"},{"@type":"ListItem","position":2,"name":"Book Review: Deep Reinforcement Learning Hands-On"}]},{"@type":"WebSite","@id":"https:\/\/insidebigdata.com\/#website","url":"https:\/\/insidebigdata.com\/","name":"insideBIGDATA","description":"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/insidebigdata.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed","name":"Daniel Gutierrez","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g","caption":"Daniel Gutierrez"},"description":"Daniel D. Gutierrez is a Data Scientist with Los Angeles-based AMULET Analytics, a service division of AMULET Development Corp. He's been involved with data science and Big Data long before it came in vogue, so imagine his delight when the Harvard Business Review recently deemed \"data scientist\" as the sexiest profession for the 21st century. Previously, he taught computer science and database classes at UCLA Extension for over 15 years, and authored three computer industry books on database technology. He also served as technical editor, columnist and writer at a major computer industry monthly publication for 7 years. Follow his data science musings at @AMULETAnalytics.","sameAs":["http:\/\/www.insidebigdata.com","https:\/\/twitter.com\/@AMULETAnalytics"],"url":"https:\/\/insidebigdata.com\/author\/dangutierrez\/"}]}},"jetpack_featured_media_url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2020\/08\/Packt_DeepReinforcementLearningHandsOn.jpg","jetpack_shortlink":"https:\/\/wp.me\/p9eA3j-6tM","jetpack-related-posts":[{"id":21994,"url":"https:\/\/insidebigdata.com\/2019\/01\/16\/best-of-arxiv-org-for-ai-machine-learning-and-deep-learning-december-2018\/","url_meta":{"origin":24910,"position":0},"title":"Best of arXiv.org for AI, Machine Learning, and Deep Learning \u2013 December 2018","date":"January 16, 2019","format":false,"excerpt":"In this recurring monthly feature, we will filter all the recent research papers appearing in the arXiv.org preprint server for subjects relating to AI, machine learning and deep learning \u2013 from disciplines including statistics, mathematics and computer science \u2013 and provide you with a useful \u201cbest of\u201d list for the\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2013\/12\/arxiv.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":21709,"url":"https:\/\/insidebigdata.com\/2018\/12\/19\/best-arxiv-org-ai-machine-learning-deep-learning-november-2018\/","url_meta":{"origin":24910,"position":1},"title":"Best of arXiv.org for AI, Machine Learning, and Deep Learning \u2013 November 2018","date":"December 19, 2018","format":false,"excerpt":"In this recurring monthly feature, we will filter all the recent research papers appearing in the arXiv.org preprint server for subjects relating to AI, machine learning and deep learning \u2013 from disciplines including statistics, mathematics and computer science \u2013 and provide you with a useful \u201cbest of\u201d list for the\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2013\/12\/arxiv.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":24345,"url":"https:\/\/insidebigdata.com\/2020\/05\/02\/new-salesforce-research-ai-simulates-millions-of-years-of-economic-data-using-rl\/","url_meta":{"origin":24910,"position":2},"title":"New Salesforce Research AI Simulates Millions of Years of Economic Data Using RL","date":"May 2, 2020","format":false,"excerpt":"Salesforce Research published a groundbreaking paper, The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies, which applies AI and reinforcement learning to create tax policy for the first time.","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2020\/04\/Salesforce_freemarket-ai-1.gif?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":33609,"url":"https:\/\/insidebigdata.com\/2023\/10\/10\/reinforcement-learning-balancing-exploration-and-exploitation\/","url_meta":{"origin":24910,"position":3},"title":"Reinforcement Learning: Balancing Exploration and Exploitation","date":"October 10, 2023","format":false,"excerpt":"In this contributed article, Anthony Chong, CEO\/Co-Founder of IKASI, discusses the three types of machine learning approaches, the benefits and requirements of each, and offer examples of how organizations are applying these tactics to address real world business challenges.","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2023\/08\/Machine_Learning_shutterstock_742653250_special.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":30972,"url":"https:\/\/insidebigdata.com\/2022\/11\/25\/d-matrix-unlocks-new-potential-with-reinforcement-learning-based-compiler-for-at-scale-digital-in-memory-compute-platforms\/","url_meta":{"origin":24910,"position":4},"title":"d-Matrix Unlocks New Potential with Reinforcement Learning based Compiler for at Scale Digital In-Memory Compute Platforms","date":"November 25, 2022","format":false,"excerpt":"d-Matrix, a leader in high-efficiency AI-compute and inference, announced a collaboration with Microsoft using its low-code reinforcement learning (RL) platform, Project Bonsai, to enable an AI-trained compiler for d-Matrix\u2019s unique digital in memory compute (DIMC) products. The user-friendly Project Bonsai platform accelerates time to value, with a product-ready solution that\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2022\/11\/d-Matrix_logo.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":24385,"url":"https:\/\/insidebigdata.com\/2020\/05\/13\/best-of-arxiv-org-for-ai-machine-learning-and-deep-learning-april-2020\/","url_meta":{"origin":24910,"position":5},"title":"Best of arXiv.org for AI, Machine Learning, and Deep Learning \u2013 April 2020","date":"May 13, 2020","format":false,"excerpt":"In this recurring monthly feature, we will filter all the recent research papers appearing in the arXiv.org preprint server for subjects relating to AI, machine learning and deep learning \u2013 from disciplines including statistics, mathematics and computer science \u2013 and provide you with a useful \u201cbest of\u201d list for the\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2013\/12\/arxiv.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/24910"}],"collection":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/users\/37"}],"replies":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/comments?post=24910"}],"version-history":[{"count":0,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/24910\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media\/24911"}],"wp:attachment":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media?parent=24910"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/categories?post=24910"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/tags?post=24910"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}