{"id":30739,"date":"2023-02-23T06:00:00","date_gmt":"2023-02-23T14:00:00","guid":{"rendered":"https:\/\/insidebigdata.com\/?p=30739"},"modified":"2023-02-24T09:15:08","modified_gmt":"2023-02-24T17:15:08","slug":"book-review-tree-based-methods-for-statistical-learning-in-r","status":"publish","type":"post","link":"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/","title":{"rendered":"Book Review: Tree-based Methods for Statistical Learning in R"},"content":{"rendered":"<div class=\"wp-block-image\">\n<figure class=\"alignright size-full is-resized\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2022\/10\/Tree-based-Methods-book.png\" alt=\"\" class=\"wp-image-30740\" width=\"194\" height=\"294\" srcset=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2022\/10\/Tree-based-Methods-book.png 300w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2022\/10\/Tree-based-Methods-book-197x300.png 197w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2022\/10\/Tree-based-Methods-book-99x150.png 99w\" sizes=\"(max-width: 194px) 100vw, 194px\" \/><\/figure><\/div>\n\n\n<p>Here&#8217;s a new title that is a &#8220;must have&#8221; for any data scientist who uses the R language. It&#8217;s a wonderful learning resource for tree-based techniques in statistical learning, one that&#8217;s become my go-to text when I find the need to do a deep dive into various ML topic areas for my work. The methods discussed represent the cornerstone for using tabular data sets for making predictions using decision trees, ensemble methods like random forest, and of course the industry&#8217;s darling gradient boosting machines (GBM). Algorithms like <a href=\"http:\/\/datascience.la\/xgboost-workshop-and-meetup-talk-with-tianqi-chen\/\" target=\"_blank\" rel=\"noreferrer noopener\">XGBoost<\/a> are king of the hill for solving problems involving tabular data. A number of timely and somewhat high-profile benchmarks show that this class of algorithm beats deep learning algorithms for many problem domains.  <\/p>\n\n\n\n<p>This book &#8220;<a href=\"https:\/\/www.routledge.com\/Tree-Based-Methods-for-Statistical-Learning-in-R\/Greenwell\/p\/book\/9780367532468\" target=\"_blank\" rel=\"noreferrer noopener\">Tree-based Methods for Statistical Learning in R<\/a>,&#8221; is by Brandon M. Greenwell, a data scientist with <a href=\"https:\/\/www.8451.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">84.51\u00b0<\/a> where he works on a diverse team to enable, empower, and enculturate statistical and machine learning best practices where applicable to help others solve real business problems. Greenwell&#8217;s book covers important topics such as: decision trees, tree-based ensembles such as random forests and gradient boosting machines. Chapter 7 on random forests, and Chapter 8 on GBMs are brimming over with information providing a strong foundation for doing real-world machine learning (along with a moderate amount of math throughout), coupled with plenty of code examples. <\/p>\n\n\n\n<p>The book is primarily aimed at researchers and practitioners who want to go beyond a fundamental understanding of tree-based methods. It could also serve as a useful supplementary text for a graduate level course on statistical\/machine learning. Some parts of the book necessarily involve more math and notation than others. For example, Chapter 3 on&nbsp;<em>conditional inference trees<\/em>&nbsp;involves a bit of linear algebra and matrix notation, but the math-oriented sections can often be skipped without sacrificing too much in the way of understanding the core concepts. The code examples should also help drive the main concepts home by connecting the math to simple coding logic.<\/p>\n\n\n\n<p>The book does assume some familiarity with the basics of machine learning, as well as the R programming language. Useful references and resources are provided in the introductory material in Chapter 1. While Greenwell tries to provide sufficient detail and background where possible, some topics receive only a cursory treatment. Whenever possible he makes an effort to point the more ambitious reader in the right direction in terms of useful references.<\/p>\n\n\n\n<p>The author developed an R package expressly for facilitating examples in the book, &#8220;<a href=\"https:\/\/cran.r-project.org\/web\/packages\/treemisc\/index.html\" target=\"_blank\" rel=\"noreferrer noopener\">treemisc<\/a>&#8221; which is available on CRAN and a <a href=\"https:\/\/github.com\/bgreenwell\/treemisc\" target=\"_blank\" rel=\"noreferrer noopener\">GitHub repo<\/a> set up by the author. The R code from the book is also <a href=\"https:\/\/bgreenwell.github.io\/treebook\/rscripts.html\">available<\/a>. I found the code in the book to be straightforward and easy to understand. There are also plenty of insightful data visualizations. NOTE: this is not a <a href=\"https:\/\/www.tidyverse.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">Tidyverse<\/a> book, opting rather to use traditional R coding practices. <\/p>\n\n\n\n<p>For background material, I thought Chapter 2 was superb in its coverage of <em>classification and regression trees<\/em> (CART), originally proposed by Leo Breiman in his 1984 seminal book on the subject. I found Chapters 7 and 8 to be the most useful. Chapter 7 does a great job of outlining and drilling down in to random forests, while Chapter 8 does the same for GBM. At the end of Chapter 8 you&#8217;ll find a brief discussion of the most popular boosting algorithms: XGBoost, LightGBM, and CatBoost. Section 8.9.4 has a very nice code example for using XGBoost. Chapter 5 on ensemble algorithms includes a useful treatment of bagging (bootstrap aggregating) and boosting. Finally, Chapter 6 is on the subject of ML interpretability, a hot topic in the industry right now. <\/p>\n\n\n\n<p><strong>So Many Packages, So Little Time<\/strong><\/p>\n\n\n\n<p>Another area in which this book excels is making the reader aware of all the great tree-based R packages are out there. I learned about a bunch of packages I never knew about. For example, Chapter 3 identifies implementations of CTree, one of the more important developments in recursive partitioning in the past two decades. I learned that it is only available in R (see the <code><a href=\"https:\/\/cran.r-project.org\/web\/packages\/party\/index.html\" target=\"_blank\" rel=\"noreferrer noopener\">party<\/a><\/code> and <code><a href=\"https:\/\/cran.r-project.org\/web\/packages\/partykit\/index.html\" target=\"_blank\" rel=\"noreferrer noopener\">partykit<\/a><\/code> packages), a good reason to have R programming in your data science arsenal. <\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"alignleft size-full is-resized\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic.png\" alt=\"\" class=\"wp-image-21778\" width=\"93\" height=\"107\" srcset=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic.png 200w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic-131x150.png 131w\" sizes=\"(max-width: 93px) 100vw, 93px\" \/><\/figure><\/div>\n\n\n<p>C<em>ontributed by Daniel D. Gutierrez, Editor-in-Chief and Resident Data Scientist for insideBIGDATA. In addition to being a tech journalist, Daniel also is a consultant in data science, author, educator, and sits on a number of advisory boards for various start-up companies.&nbsp;<\/em><\/p>\n\n\n\n<p><em>Sign up for the free insideBIGDATA&nbsp;<a href=\"http:\/\/inside-bigdata.com\/newsletter\/\" target=\"_blank\" rel=\"noreferrer noopener\">newsletter<\/a>.<\/em><\/p>\n\n\n\n<p><em>Join us on Twitter:&nbsp;<a href=\"https:\/\/twitter.com\/InsideBigData1\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/twitter.com\/InsideBigData1<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on LinkedIn:&nbsp;<a href=\"https:\/\/www.linkedin.com\/company\/insidebigdata\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.linkedin.com\/company\/insidebigdata\/<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on Facebook:&nbsp;<a href=\"https:\/\/www.facebook.com\/insideBIGDATANOW\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.facebook.com\/insideBIGDATANOW<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Here&#8217;s a new title that is a &#8220;must have&#8221; for any data scientist who uses the R language. It&#8217;s a wonderful learning resource for tree-based techniques in statistical learning, one that&#8217;s become my go-to text when I find the need to do a deep dive into various ML topic areas for my work. The methods [&hellip;]<\/p>\n","protected":false},"author":37,"featured_media":30740,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":""},"categories":[115,92,182,87,180,67,56,1],"tags":[1041,133,277,578,1224,95],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Book Review: Tree-based Methods for Statistical Learning in R - insideBIGDATA<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Book Review: Tree-based Methods for Statistical Learning in R - insideBIGDATA\" \/>\n<meta property=\"og:description\" content=\"Here&#8217;s a new title that is a &#8220;must have&#8221; for any data scientist who uses the R language. It&#8217;s a wonderful learning resource for tree-based techniques in statistical learning, one that&#8217;s become my go-to text when I find the need to do a deep dive into various ML topic areas for my work. The methods [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/\" \/>\n<meta property=\"og:site_name\" content=\"insideBIGDATA\" \/>\n<meta property=\"article:publisher\" content=\"http:\/\/www.facebook.com\/insidebigdata\" \/>\n<meta property=\"article:published_time\" content=\"2023-02-23T14:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-02-24T17:15:08+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2022\/10\/Tree-based-Methods-book.png\" \/>\n\t<meta property=\"og:image:width\" content=\"300\" \/>\n\t<meta property=\"og:image:height\" content=\"456\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Daniel Gutierrez\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@AMULETAnalytics\" \/>\n<meta name=\"twitter:site\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Daniel Gutierrez\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/\",\"url\":\"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/\",\"name\":\"Book Review: Tree-based Methods for Statistical Learning in R - insideBIGDATA\",\"isPartOf\":{\"@id\":\"https:\/\/insidebigdata.com\/#website\"},\"datePublished\":\"2023-02-23T14:00:00+00:00\",\"dateModified\":\"2023-02-24T17:15:08+00:00\",\"author\":{\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed\"},\"breadcrumb\":{\"@id\":\"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/insidebigdata.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Book Review: Tree-based Methods for Statistical Learning in R\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/insidebigdata.com\/#website\",\"url\":\"https:\/\/insidebigdata.com\/\",\"name\":\"insideBIGDATA\",\"description\":\"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/insidebigdata.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed\",\"name\":\"Daniel Gutierrez\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g\",\"caption\":\"Daniel Gutierrez\"},\"description\":\"Daniel D. Gutierrez is a Data Scientist with Los Angeles-based AMULET Analytics, a service division of AMULET Development Corp. He's been involved with data science and Big Data long before it came in vogue, so imagine his delight when the Harvard Business Review recently deemed \\\"data scientist\\\" as the sexiest profession for the 21st century. Previously, he taught computer science and database classes at UCLA Extension for over 15 years, and authored three computer industry books on database technology. He also served as technical editor, columnist and writer at a major computer industry monthly publication for 7 years. Follow his data science musings at @AMULETAnalytics.\",\"sameAs\":[\"http:\/\/www.insidebigdata.com\",\"https:\/\/twitter.com\/@AMULETAnalytics\"],\"url\":\"https:\/\/insidebigdata.com\/author\/dangutierrez\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Book Review: Tree-based Methods for Statistical Learning in R - insideBIGDATA","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/","og_locale":"en_US","og_type":"article","og_title":"Book Review: Tree-based Methods for Statistical Learning in R - insideBIGDATA","og_description":"Here&#8217;s a new title that is a &#8220;must have&#8221; for any data scientist who uses the R language. It&#8217;s a wonderful learning resource for tree-based techniques in statistical learning, one that&#8217;s become my go-to text when I find the need to do a deep dive into various ML topic areas for my work. The methods [&hellip;]","og_url":"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/","og_site_name":"insideBIGDATA","article_publisher":"http:\/\/www.facebook.com\/insidebigdata","article_published_time":"2023-02-23T14:00:00+00:00","article_modified_time":"2023-02-24T17:15:08+00:00","og_image":[{"width":300,"height":456,"url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2022\/10\/Tree-based-Methods-book.png","type":"image\/png"}],"author":"Daniel Gutierrez","twitter_card":"summary_large_image","twitter_creator":"@AMULETAnalytics","twitter_site":"@insideBigData","twitter_misc":{"Written by":"Daniel Gutierrez","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/","url":"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/","name":"Book Review: Tree-based Methods for Statistical Learning in R - insideBIGDATA","isPartOf":{"@id":"https:\/\/insidebigdata.com\/#website"},"datePublished":"2023-02-23T14:00:00+00:00","dateModified":"2023-02-24T17:15:08+00:00","author":{"@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed"},"breadcrumb":{"@id":"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/insidebigdata.com\/2023\/02\/23\/book-review-tree-based-methods-for-statistical-learning-in-r\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/insidebigdata.com\/"},{"@type":"ListItem","position":2,"name":"Book Review: Tree-based Methods for Statistical Learning in R"}]},{"@type":"WebSite","@id":"https:\/\/insidebigdata.com\/#website","url":"https:\/\/insidebigdata.com\/","name":"insideBIGDATA","description":"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/insidebigdata.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed","name":"Daniel Gutierrez","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g","caption":"Daniel Gutierrez"},"description":"Daniel D. Gutierrez is a Data Scientist with Los Angeles-based AMULET Analytics, a service division of AMULET Development Corp. He's been involved with data science and Big Data long before it came in vogue, so imagine his delight when the Harvard Business Review recently deemed \"data scientist\" as the sexiest profession for the 21st century. Previously, he taught computer science and database classes at UCLA Extension for over 15 years, and authored three computer industry books on database technology. He also served as technical editor, columnist and writer at a major computer industry monthly publication for 7 years. Follow his data science musings at @AMULETAnalytics.","sameAs":["http:\/\/www.insidebigdata.com","https:\/\/twitter.com\/@AMULETAnalytics"],"url":"https:\/\/insidebigdata.com\/author\/dangutierrez\/"}]}},"jetpack_featured_media_url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2022\/10\/Tree-based-Methods-book.png","jetpack_shortlink":"https:\/\/wp.me\/p9eA3j-7ZN","jetpack-related-posts":[{"id":4594,"url":"https:\/\/insidebigdata.com\/2013\/10\/06\/book-review-introduction-statistical-learning\/","url_meta":{"origin":30739,"position":0},"title":"Book Review: An Introduction to Statistical Learning","date":"October 6, 2013","format":false,"excerpt":"I'm excited to be writing this book review. It is a book for which I've been waiting a long time. An Introduction to Statistical Learning with Application in R by James, Witten, Hastie, and Tibshirani is a contemporary re-work of the classic machine learning text Elements of Statistical Learning by\u2026","rel":"","context":"In &quot;Book Review&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":14354,"url":"https:\/\/insidebigdata.com\/2016\/02\/12\/book-review-r-for-everyone-advanced-analytics-and-graphics\/","url_meta":{"origin":30739,"position":1},"title":"Book Review: R for Everyone &#8211; Advanced Analytics and Graphics","date":"February 12, 2016","format":false,"excerpt":"I'm frequently asked about educational resources for those making their entry into the data science and machine learning professions. There are plenty of good advanced books such as theoretical masterpieces. The book \"R for Everyone: Advanced Analytics and Graphics\" authored by Jared P. Lander covers the intermediate ground very well.","rel":"","context":"In &quot;Book Review&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2016\/02\/R_for_Everyone.png?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":9713,"url":"https:\/\/insidebigdata.com\/2014\/06\/11\/supervised-machine-learning\/","url_meta":{"origin":30739,"position":2},"title":"Supervised Machine Learning","date":"June 11, 2014","format":false,"excerpt":"To help our audience leverage the power of machine learning, the editors of insideBIGDATA have created this weekly article series called \u201cThe insideBIGDATA Guide to Machine Learning.\u201d This is our fifth installment, \"Supervised Machine Learning.\"","rel":"","context":"In &quot;Featured&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2014\/05\/insideBIGDATA_Guide_ML.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":15964,"url":"https:\/\/insidebigdata.com\/2016\/09\/09\/book-review-the-book-of-r-by-tilman-davies\/","url_meta":{"origin":30739,"position":3},"title":"Book Review: The Book of R by Tilman Davies","date":"September 9, 2016","format":false,"excerpt":"A fantastic new book just landed on my desk, \"The Book of R: A First Course in Programming and Statistics\" by Tilman M. Davies from No Starch Press. I've been looking for a book like this for some time - to use with the introductory data science and machine learning\u2026","rel":"","context":"In &quot;Book Review&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":18358,"url":"https:\/\/insidebigdata.com\/2017\/07\/17\/book-review-statistical-learning-sparsity-lasso-generalizations\/","url_meta":{"origin":30739,"position":4},"title":"Book Review: Statistical Learning with Sparsity &#8211; The Lasso and Generalizations","date":"July 17, 2017","format":false,"excerpt":"As a data scientist, I have a handful of books that serve as important resources for my work in the field - \"Statistical Learning with Sparsity - The Lasso and Generalizations\" by Trevor Hastie, Robert Tibshirani, and Martin Wainwright is one of them. This book earned a prominent position on\u2026","rel":"","context":"In &quot;Book Review&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":4934,"url":"https:\/\/insidebigdata.com\/2013\/10\/17\/book-review-machine-learning-hackers\/","url_meta":{"origin":30739,"position":5},"title":"Book Review: Machine Learning for Hackers","date":"October 17, 2013","format":false,"excerpt":"It isn't often that I get excited about a book in my field of data science, but I must admit that I was highly energized the first time I thumbed through a new copy of Machine Learning for Hackers by Drew Conway and John Myles White courtesy of O'Reilly Media.\u2026","rel":"","context":"In &quot;Book Review&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/30739"}],"collection":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/users\/37"}],"replies":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/comments?post=30739"}],"version-history":[{"count":0,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/30739\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media\/30740"}],"wp:attachment":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media?parent=30739"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/categories?post=30739"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/tags?post=30739"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}