{"id":21318,"date":"2018-10-24T08:30:11","date_gmt":"2018-10-24T15:30:11","guid":{"rendered":"https:\/\/insidebigdata.com\/?p=21318"},"modified":"2018-10-25T08:56:59","modified_gmt":"2018-10-25T15:56:59","slug":"introduction-statistical-analysis-outlier-detection-methods","status":"publish","type":"post","link":"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/","title":{"rendered":"Introduction to Statistical Analysis and Outlier Detection Methods"},"content":{"rendered":"<p><img decoding=\"async\" loading=\"lazy\" class=\"alignright wp-image-21316\" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/10\/Noah-Data-logo.jpg\" alt=\"\" width=\"166\" height=\"166\" srcset=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/10\/Noah-Data-logo.jpg 200w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/10\/Noah-Data-logo-150x150.jpg 150w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/10\/Noah-Data-logo-110x110.jpg 110w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/10\/Noah-Data-logo-50x50.jpg 50w\" sizes=\"(max-width: 166px) 100vw, 166px\" \/>Our friends over at <a href=\"http:\/\/www.noahdatatech.com\/\" target=\"_blank\" rel=\"noopener\">Noah Data<\/a> have written a research style paper, <a href=\"https:\/\/insidebigdata.com\/white-paper\/introduction-statistical-analysis-outlier-detection-methods\/\" target=\"_blank\" rel=\"noopener\"><em>Introduction to Statistical Analysis and Outlier Detection Methods<\/em><\/a>, that discusses how statistical data can generally be classified in terms of number of variables as Univariate, Bivariate or Multivariate. Univariate data has only one variable, Bivariate data has two variables and Multivariate data has more than two variables.<\/p>\n<p>The paper addresses <em>Multivariate Outlier<\/em> detection which is a ubiquitous use case across industries. These can be used on summarized (moving average, standard deviation etc.) high frequency IoT variables to calculate outlier source of those variables. These can be used for machine learning based predictive maintenance of physical equipment assets. The same can be applied in oil &amp; gas industry for very similar use case. This topic is extremely critical to a data scientist.<\/p>\n<p>The paper was authored by Ashish Kumar, a data scientist at <a href=\"http:\/\/www.noahdatatech.com\/\" target=\"_blank\" rel=\"noopener\">Noah Data<\/a>, is an author and a data science professional with several years of experience in the field of Advanced Analytics. He has a B.Tech from IIT Madras and is a Young India Fellow, an exclusive 1-year academic program on leadership &amp; liberal arts offered to 215 young bright Indians, who show exceptional intellectual &amp; leadership ability.<\/p>\n<p>Download the paper <a href=\"https:\/\/insidebigdata.com\/white-paper\/introduction-statistical-analysis-outlier-detection-methods\/\" target=\"_blank\" rel=\"noopener\">HERE<\/a>.<\/p>\n<p>&nbsp;<\/p>\n<p><em>Sign up for the free ins<\/em><em>ideBIGDATA\u00a0<a href=\"http:\/\/insidebigdata.com\/newsletter\/\" target=\"_blank\" rel=\"noopener\">newsletter<\/a>.<\/em><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Our friends over at Noah Data have written a research style paper, &#8220;Introduction to Statistical Analysis and Outlier Detection Methods,&#8221; that discusses how statistical data can generally be classified in terms of number of variables as Univariate, Bivariate or Multivariate. Univariate data has only one variable, Bivariate data has two variables and Multivariate data has more than two variables.<\/p>\n","protected":false},"author":10513,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":""},"categories":[87,180,67,56,84,1],"tags":[277,134,96],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Introduction to Statistical Analysis and Outlier Detection Methods - insideBIGDATA<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Introduction to Statistical Analysis and Outlier Detection Methods - insideBIGDATA\" \/>\n<meta property=\"og:description\" content=\"Our friends over at Noah Data have written a research style paper, &quot;Introduction to Statistical Analysis and Outlier Detection Methods,&quot; that discusses how statistical data can generally be classified in terms of number of variables as Univariate, Bivariate or Multivariate. Univariate data has only one variable, Bivariate data has two variables and Multivariate data has more than two variables.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/\" \/>\n<meta property=\"og:site_name\" content=\"insideBIGDATA\" \/>\n<meta property=\"article:publisher\" content=\"http:\/\/www.facebook.com\/insidebigdata\" \/>\n<meta property=\"article:published_time\" content=\"2018-10-24T15:30:11+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2018-10-25T15:56:59+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/10\/Noah-Data-logo.jpg\" \/>\n<meta name=\"author\" content=\"Editorial Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:site\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Editorial Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/\",\"url\":\"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/\",\"name\":\"Introduction to Statistical Analysis and Outlier Detection Methods - insideBIGDATA\",\"isPartOf\":{\"@id\":\"https:\/\/insidebigdata.com\/#website\"},\"datePublished\":\"2018-10-24T15:30:11+00:00\",\"dateModified\":\"2018-10-25T15:56:59+00:00\",\"author\":{\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9\"},\"breadcrumb\":{\"@id\":\"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/insidebigdata.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Introduction to Statistical Analysis and Outlier Detection Methods\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/insidebigdata.com\/#website\",\"url\":\"https:\/\/insidebigdata.com\/\",\"name\":\"insideBIGDATA\",\"description\":\"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/insidebigdata.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9\",\"name\":\"Editorial Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g\",\"caption\":\"Editorial Team\"},\"sameAs\":[\"http:\/\/www.insidebigdata.com\"],\"url\":\"https:\/\/insidebigdata.com\/author\/editorial\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Introduction to Statistical Analysis and Outlier Detection Methods - insideBIGDATA","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/","og_locale":"en_US","og_type":"article","og_title":"Introduction to Statistical Analysis and Outlier Detection Methods - insideBIGDATA","og_description":"Our friends over at Noah Data have written a research style paper, \"Introduction to Statistical Analysis and Outlier Detection Methods,\" that discusses how statistical data can generally be classified in terms of number of variables as Univariate, Bivariate or Multivariate. Univariate data has only one variable, Bivariate data has two variables and Multivariate data has more than two variables.","og_url":"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/","og_site_name":"insideBIGDATA","article_publisher":"http:\/\/www.facebook.com\/insidebigdata","article_published_time":"2018-10-24T15:30:11+00:00","article_modified_time":"2018-10-25T15:56:59+00:00","og_image":[{"url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/10\/Noah-Data-logo.jpg"}],"author":"Editorial Team","twitter_card":"summary_large_image","twitter_creator":"@insideBigData","twitter_site":"@insideBigData","twitter_misc":{"Written by":"Editorial Team","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/","url":"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/","name":"Introduction to Statistical Analysis and Outlier Detection Methods - insideBIGDATA","isPartOf":{"@id":"https:\/\/insidebigdata.com\/#website"},"datePublished":"2018-10-24T15:30:11+00:00","dateModified":"2018-10-25T15:56:59+00:00","author":{"@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9"},"breadcrumb":{"@id":"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/insidebigdata.com\/2018\/10\/24\/introduction-statistical-analysis-outlier-detection-methods\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/insidebigdata.com\/"},{"@type":"ListItem","position":2,"name":"Introduction to Statistical Analysis and Outlier Detection Methods"}]},{"@type":"WebSite","@id":"https:\/\/insidebigdata.com\/#website","url":"https:\/\/insidebigdata.com\/","name":"insideBIGDATA","description":"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/insidebigdata.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9","name":"Editorial Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g","caption":"Editorial Team"},"sameAs":["http:\/\/www.insidebigdata.com"],"url":"https:\/\/insidebigdata.com\/author\/editorial\/"}]}},"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p9eA3j-5xQ","jetpack-related-posts":[{"id":12871,"url":"https:\/\/insidebigdata.com\/2015\/03\/13\/sumo-logic-unveils-outlier-detection-and-predictive-analytics\/","url_meta":{"origin":21318,"position":0},"title":"Sumo Logic Unveils Outlier Detection and Predictive Analytics","date":"March 13, 2015","format":false,"excerpt":"Sumo Logic, a leading secure and purpose-built cloud-based machine data analytics service, announced the availability of Outlier Detection and Predictive Analytics capabilities that augment its rich and proven machine learning and Anomaly Detection engine with statistical analysis and projection models.","rel":"","context":"In &quot;Analytics&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":11405,"url":"https:\/\/insidebigdata.com\/2014\/09\/18\/classes-predictive-analytics\/","url_meta":{"origin":21318,"position":1},"title":"Classes of Predictive Analytics","date":"September 18, 2014","format":false,"excerpt":"This article is the third in\u00a0an editorial\u00a0series\u00a0that will review how predictive analytics helps your organization predict with confidence what will happen next so that you can make smarter decisions and improve business outcomes..\u00a0 It is important to adopt a predictive analytics solution that meets the specific needs of different users\u2026","rel":"","context":"In &quot;Analytics&quot;","img":{"alt_text":"TIBCO_PA_clusters","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2014\/09\/TIBCO_PA_clusters.jpg?resize=350%2C200","width":350,"height":200},"classes":[]},{"id":12481,"url":"https:\/\/insidebigdata.com\/2014\/12\/10\/ask-data-scientist-confounding-variables\/","url_meta":{"origin":21318,"position":2},"title":"Ask a Data Scientist: Confounding Variables","date":"December 10, 2014","format":false,"excerpt":"Welcome back to our series of articles sponsored by Intel \u2013 \u201cAsk a Data Scientist.\u201d This week\u2019s question is from a reader who asks for an explanation of confounding variables and why they're important in data science projects.","rel":"","context":"In &quot;Ask a Data Scientist&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2018\/06\/data-scientist-300x300_insidebigdata.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":9648,"url":"https:\/\/insidebigdata.com\/2014\/06\/05\/data-munging-exploratory-data-analysis-feature-engineering\/","url_meta":{"origin":21318,"position":3},"title":"Data Munging, Exploratory Data Analysis, and Feature Engineering","date":"June 5, 2014","format":false,"excerpt":"To help our audience leverage the power of machine learning, the editors of insideBIGDATA have created this weekly article series called \u201cThe insideBIGDATA Guide to Machine Learning.\u201d This is our fourth installment, \"Data Munging, Exploratory Data Analysis, and Feature Engineering.\"","rel":"","context":"In &quot;Featured&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2014\/05\/inisde-big-data-guide-to-machine-learning.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":12294,"url":"https:\/\/insidebigdata.com\/2014\/10\/29\/ask-data-scientist-handling-missing-data\/","url_meta":{"origin":21318,"position":4},"title":"Ask a Data Scientist: Handling Missing Data","date":"October 29, 2014","format":false,"excerpt":"Welcome back to our series of articles sponsored by Intel \u2013 \u201cAsk a Data Scientist.\u201d This week\u2019s question is from a reader who seeks a discussion of missing data handling methods such as imputation.","rel":"","context":"In &quot;Ask a Data Scientist&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":17106,"url":"https:\/\/insidebigdata.com\/2017\/02\/06\/making-big-data-requires-effective-training-data-science\/","url_meta":{"origin":21318,"position":5},"title":"Making the Most of Big Data Requires Effective Training in Data Science","date":"February 6, 2017","format":false,"excerpt":"In this special guest feature, Devavrat Shah, professor in MIT\u2019s Department of Electrical Engineering and Computer Science, discusses the type of training data scientists need in order to glean the most value from big data.","rel":"","context":"In &quot;Big Data&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/21318"}],"collection":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/users\/10513"}],"replies":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/comments?post=21318"}],"version-history":[{"count":0,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/21318\/revisions"}],"wp:attachment":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media?parent=21318"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/categories?post=21318"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/tags?post=21318"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}