{"id":31724,"date":"2023-02-27T06:00:00","date_gmt":"2023-02-27T14:00:00","guid":{"rendered":"https:\/\/insidebigdata.com\/?p=31724"},"modified":"2023-02-28T08:51:37","modified_gmt":"2023-02-28T16:51:37","slug":"data-science-101-the-data-science-process","status":"publish","type":"post","link":"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/","title":{"rendered":"Data Science 101: The Data Science Process"},"content":{"rendered":"\n<p>Welcome to insideBIGDATA&#8217;s <em>Data Science 101<\/em> channel brining you perspectives for the topics of the day in data science, machine learning, AI and deep learning. Many of the video presentations come from my lectures for my <em>Introduction to Data Science<\/em> class I teach at UCLA Extension. In today&#8217;s slide-based video presentation I discuss <em>The Data Science Process<\/em>, an overview of the steps that data scientists use solving problems with data science and machine learning technologies. Enjoy!<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\"  id=\"_ytid_42990\"  width=\"480\" height=\"270\"  data-origwidth=\"480\" data-origheight=\"270\" src=\"https:\/\/www.youtube.com\/embed\/Ut2WiTyPDPw?enablejsapi=1&#038;autoplay=0&#038;cc_load_policy=0&#038;cc_lang_pref=&#038;iv_load_policy=1&#038;loop=0&#038;modestbranding=0&#038;rel=1&#038;fs=1&#038;playsinline=0&#038;autohide=2&#038;theme=dark&#038;color=red&#038;controls=1&#038;\" class=\"__youtube_prefs__  epyt-is-override  no-lazyload\" title=\"YouTube player\"  allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen data-no-lazy=\"1\" data-skipgform_ajax_framebjll=\"\"><\/iframe>\n<\/div><\/figure>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"alignleft size-full is-resized\"><img decoding=\"async\" loading=\"lazy\" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic.png\" alt=\"\" class=\"wp-image-21778\" width=\"103\" height=\"118\" srcset=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic.png 200w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Daniel_2018_pic-131x150.png 131w\" sizes=\"(max-width: 103px) 100vw, 103px\" \/><\/figure><\/div>\n\n\n<p><em>Contributed by Daniel D. Gutierrez, Editor-in-Chief and Resident Data Scientist for insideBIGDATA. In addition to being a tech journalist who keeps a pulse on the big data ecosystem, Daniel also is an independent consultant in data science, author, and educator.<\/em><\/p>\n\n\n\n<p><em>Sign up for the free insideBIGDATA&nbsp;<a href=\"http:\/\/inside-bigdata.com\/newsletter\/\" target=\"_blank\" rel=\"noreferrer noopener\">newsletter<\/a>.<\/em><\/p>\n\n\n\n<p><em>Join us on Twitter:&nbsp;<a href=\"https:\/\/twitter.com\/InsideBigData1\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/twitter.com\/InsideBigData1<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on LinkedIn:&nbsp;<a href=\"https:\/\/www.linkedin.com\/company\/insidebigdata\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.linkedin.com\/company\/insidebigdata\/<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on Facebook:&nbsp;<a href=\"https:\/\/www.facebook.com\/insideBIGDATANOW\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.facebook.com\/insideBIGDATANOW<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Welcome to insideBIGDATA&#8217;s Data Science 101 channel brining you perspectives for the topics of the day in data science, machine learning, AI and deep learning. Many of the video presentations come from my lectures for my Introduction to Data Science class I teach at UCLA Extension. In today&#8217;s slide-based video presentation I discuss The Data Science Process, an overview of the steps that data scientists use solving problems with data science and machine learning technologies. <\/p>\n","protected":false},"author":37,"featured_media":22510,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":""},"categories":[115,182,170,87,180,67,56,97,1,85],"tags":[133,261,277,95],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Data Science 101: The Data Science Process - insideBIGDATA<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data Science 101: The Data Science Process - insideBIGDATA\" \/>\n<meta property=\"og:description\" content=\"Welcome to insideBIGDATA&#039;s Data Science 101 channel brining you perspectives for the topics of the day in data science, machine learning, AI and deep learning. Many of the video presentations come from my lectures for my Introduction to Data Science class I teach at UCLA Extension. In today&#039;s slide-based video presentation I discuss The Data Science Process, an overview of the steps that data scientists use solving problems with data science and machine learning technologies.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/\" \/>\n<meta property=\"og:site_name\" content=\"insideBIGDATA\" \/>\n<meta property=\"article:publisher\" content=\"http:\/\/www.facebook.com\/insidebigdata\" \/>\n<meta property=\"article:published_time\" content=\"2023-02-27T14:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-02-28T16:51:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2019\/04\/DataScience_shutterstock_1054542323.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"5000\" \/>\n\t<meta property=\"og:image:height\" content=\"4000\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Daniel Gutierrez\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@AMULETAnalytics\" \/>\n<meta name=\"twitter:site\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Daniel Gutierrez\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/\",\"url\":\"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/\",\"name\":\"Data Science 101: The Data Science Process - insideBIGDATA\",\"isPartOf\":{\"@id\":\"https:\/\/insidebigdata.com\/#website\"},\"datePublished\":\"2023-02-27T14:00:00+00:00\",\"dateModified\":\"2023-02-28T16:51:37+00:00\",\"author\":{\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed\"},\"breadcrumb\":{\"@id\":\"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/insidebigdata.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data Science 101: The Data Science Process\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/insidebigdata.com\/#website\",\"url\":\"https:\/\/insidebigdata.com\/\",\"name\":\"insideBIGDATA\",\"description\":\"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/insidebigdata.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed\",\"name\":\"Daniel Gutierrez\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g\",\"caption\":\"Daniel Gutierrez\"},\"description\":\"Daniel D. Gutierrez is a Data Scientist with Los Angeles-based AMULET Analytics, a service division of AMULET Development Corp. He's been involved with data science and Big Data long before it came in vogue, so imagine his delight when the Harvard Business Review recently deemed \\\"data scientist\\\" as the sexiest profession for the 21st century. Previously, he taught computer science and database classes at UCLA Extension for over 15 years, and authored three computer industry books on database technology. He also served as technical editor, columnist and writer at a major computer industry monthly publication for 7 years. Follow his data science musings at @AMULETAnalytics.\",\"sameAs\":[\"http:\/\/www.insidebigdata.com\",\"https:\/\/twitter.com\/@AMULETAnalytics\"],\"url\":\"https:\/\/insidebigdata.com\/author\/dangutierrez\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Data Science 101: The Data Science Process - insideBIGDATA","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/","og_locale":"en_US","og_type":"article","og_title":"Data Science 101: The Data Science Process - insideBIGDATA","og_description":"Welcome to insideBIGDATA's Data Science 101 channel brining you perspectives for the topics of the day in data science, machine learning, AI and deep learning. Many of the video presentations come from my lectures for my Introduction to Data Science class I teach at UCLA Extension. In today's slide-based video presentation I discuss The Data Science Process, an overview of the steps that data scientists use solving problems with data science and machine learning technologies.","og_url":"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/","og_site_name":"insideBIGDATA","article_publisher":"http:\/\/www.facebook.com\/insidebigdata","article_published_time":"2023-02-27T14:00:00+00:00","article_modified_time":"2023-02-28T16:51:37+00:00","og_image":[{"width":5000,"height":4000,"url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2019\/04\/DataScience_shutterstock_1054542323.jpg","type":"image\/jpeg"}],"author":"Daniel Gutierrez","twitter_card":"summary_large_image","twitter_creator":"@AMULETAnalytics","twitter_site":"@insideBigData","twitter_misc":{"Written by":"Daniel Gutierrez","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/","url":"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/","name":"Data Science 101: The Data Science Process - insideBIGDATA","isPartOf":{"@id":"https:\/\/insidebigdata.com\/#website"},"datePublished":"2023-02-27T14:00:00+00:00","dateModified":"2023-02-28T16:51:37+00:00","author":{"@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed"},"breadcrumb":{"@id":"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/insidebigdata.com\/2023\/02\/27\/data-science-101-the-data-science-process\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/insidebigdata.com\/"},{"@type":"ListItem","position":2,"name":"Data Science 101: The Data Science Process"}]},{"@type":"WebSite","@id":"https:\/\/insidebigdata.com\/#website","url":"https:\/\/insidebigdata.com\/","name":"insideBIGDATA","description":"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/insidebigdata.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2540da209c83a68f4f5922848f7376ed","name":"Daniel Gutierrez","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5780282e7e567e2a502233e948464542?s=96&d=mm&r=g","caption":"Daniel Gutierrez"},"description":"Daniel D. Gutierrez is a Data Scientist with Los Angeles-based AMULET Analytics, a service division of AMULET Development Corp. He's been involved with data science and Big Data long before it came in vogue, so imagine his delight when the Harvard Business Review recently deemed \"data scientist\" as the sexiest profession for the 21st century. Previously, he taught computer science and database classes at UCLA Extension for over 15 years, and authored three computer industry books on database technology. He also served as technical editor, columnist and writer at a major computer industry monthly publication for 7 years. Follow his data science musings at @AMULETAnalytics.","sameAs":["http:\/\/www.insidebigdata.com","https:\/\/twitter.com\/@AMULETAnalytics"],"url":"https:\/\/insidebigdata.com\/author\/dangutierrez\/"}]}},"jetpack_featured_media_url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2019\/04\/DataScience_shutterstock_1054542323.jpg","jetpack_shortlink":"https:\/\/wp.me\/p9eA3j-8fG","jetpack-related-posts":[{"id":32220,"url":"https:\/\/insidebigdata.com\/2023\/04\/28\/data-science-101-the-data-science-venn-diagram\/","url_meta":{"origin":31724,"position":0},"title":"Data Science 101: The Data Science Venn Diagram","date":"April 28, 2023","format":false,"excerpt":"Welcome to insideBIGDATA\u2019s\u00a0Data Science 101\u00a0channel bringing you perspectives for the topics of the day in data science, machine learning, AI and deep learning. Many of the video presentations come from my lectures for my\u00a0Introduction to Data Science\u00a0class I teach at UCLA Extension. In today\u2019s slide-based video presentation I discuss\u00a0The Data\u2026","rel":"","context":"In &quot;Big Data&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2019\/04\/DataScience_shutterstock_1054542323.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":12653,"url":"https:\/\/insidebigdata.com\/2015\/01\/27\/data-science-101-machine-learning-basics\/","url_meta":{"origin":31724,"position":1},"title":"Data Science 101: Machine Learning &#8211; The Basics","date":"January 27, 2015","format":false,"excerpt":"The next installment of insideBIGDATA's Data Science 101 series comes from our friends over at LinkedIn.","rel":"","context":"In &quot;Data Science&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":8698,"url":"https:\/\/insidebigdata.com\/2014\/04\/13\/data-science-101-k-means-clustering\/","url_meta":{"origin":31724,"position":2},"title":"Data Science 101: k-means Clustering","date":"April 13, 2014","format":false,"excerpt":"In this edition of insideBIGDATA's Data Science 101 series, I'm going to offer up a short instructional video describing the use of the popular unsupervised learning algorithm, k-means clustering.","rel":"","context":"In &quot;Data Science 101&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":20645,"url":"https:\/\/insidebigdata.com\/2018\/06\/30\/insidebigdata-ask-data-scientist-series\/","url_meta":{"origin":31724,"position":3},"title":"insideBIGDATA &#8220;Ask a Data Scientist&#8221; Series","date":"June 30, 2018","format":false,"excerpt":"Welcome to the series of articles sponsored by Intel \u2013 \u201cAsk a Data Scientist\u201d from insideBIGDATA's popular Data Science 101 channel. These articles constitute many of our site's most popular resources for newbie data scientists. The 12 articles listed below were from reader submitted questions of varying levels of technical\u2026","rel":"","context":"In &quot;Data Science&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2018\/06\/data-scientist-300x300_insidebigdata.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":12583,"url":"https:\/\/insidebigdata.com\/2015\/01\/10\/data-science-101-lessons-kaggle-competitions\/","url_meta":{"origin":31724,"position":4},"title":"Data Science 101: Lessons Learned from Kaggle Competitions","date":"January 10, 2015","format":false,"excerpt":"In the video presentation below, \"Machine learning best practices we've learned from hundreds of competitions,\" Ben Hamner, Chief Scientist at Kaggle, discusses some very intriguing insights into how find success in data science projects.","rel":"","context":"In &quot;Data Science&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":13621,"url":"https:\/\/insidebigdata.com\/2015\/08\/31\/data-science-101-an-introduction-to-scikit-learn-machine-learning-in-python\/","url_meta":{"origin":31724,"position":5},"title":"Data Science 101: An Introduction to scikit-learn &#8211; Machine Learning in Python","date":"August 31, 2015","format":false,"excerpt":"The tutorial presentation below offers an introduction to the scikit-learn package and to the central concepts of Machine Learning.","rel":"","context":"In &quot;Data Science&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/31724"}],"collection":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/users\/37"}],"replies":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/comments?post=31724"}],"version-history":[{"count":0,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/31724\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media\/22510"}],"wp:attachment":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media?parent=31724"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/categories?post=31724"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/tags?post=31724"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}