{"id":32625,"date":"2023-06-15T03:00:00","date_gmt":"2023-06-15T10:00:00","guid":{"rendered":"https:\/\/insidebigdata.com\/?p=32625"},"modified":"2023-06-13T18:40:03","modified_gmt":"2023-06-14T01:40:03","slug":"the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption","status":"publish","type":"post","link":"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/","title":{"rendered":"The Problem with \u2018Dirty Data\u2019 &#8212; How Data Quality Can Impact Life Science AI Adoption"},"content":{"rendered":"\n<p>Where AI models are concerned, you get out what you put in. You can\u2019t expect to input poor-quality data and generate high-quality results. But all too often, that\u2019s exactly what\u2019s happening in life science. Successful AI models fail to deliver their full potential because the data they\u2019re based on isn\u2019t of sufficient quality. The challenge to effective AI adoption in life science doesn\u2019t lie with AI itself but with life science datasets.<\/p>\n\n\n\n<p><strong>Life science data: unclean, unstructured, and highly regulated<\/strong><\/p>\n\n\n\n<p>Life science companies sit on vast quantities of data. The \u2018data deluge\u2019 has swamped all industries, but none more so than life science \u2013 where data floods in from patients, payers, and healthcare professionals via countless streams. For example, the patient&#8217;s voice has been increasingly amplified in recent years. While this is undoubtedly an excellent thing for patients, life science teams face a challenge keeping pace with the number of online channels where opinions are shared and information can be mined. \u201c<em>There is a lot of data to be harnessed, and top life sciences companies have noticed,\u201d <\/em><a href=\"https:\/\/ca.nttdata.com\/en\/blog\/2022\/september\/how-critical-is-data-analytics-to-the-life-sciences-industry#:~:text=The%20rise%20of%20big%20data,exabytes%20over%20the%20past%20decade.\" target=\"_blank\" rel=\"noreferrer noopener\"><em>reports NTT Data<\/em><\/a><em>. \u201cWith rapid reductions in costs of genome sequencing, the amount of genomic data has skyrocketed to over 40 exabytes over the past decade.<\/em>\u201d<\/p>\n\n\n\n<p>Quantity does not always equate to quality, and rarely is an enterprise\u2019s entire data lake necessary to build an effective AI model. Instead, companies need to adopt a data-centric approach, thereby shifting away from large volumes of information to smaller samples with higher-quality data sets for training.<\/p>\n\n\n\n<p><strong>Data access and compliance<\/strong><\/p>\n\n\n\n<p>Data quantity is only one potential roadblock preventing the construction of high-quality life science datasets. Many industry data sources are subject to regulations such as the European <a href=\"https:\/\/gdpr-info.eu\/\" target=\"_blank\" rel=\"noreferrer noopener\">GDPR<\/a> or <a href=\"https:\/\/oag.ca.gov\/privacy\/ccpa\" target=\"_blank\" rel=\"noreferrer noopener\">CCPA<\/a>, among other regional laws, and may not be shared with other vendors or used to train AI models. Data access can be a real issue within highly-regulated industries such as life science, where regulatory requirements can change from region to region. \u201c<em>While most companies are embracing new technologies to deliver enhanced patient outcomes,\u201d notes <\/em><a href=\"https:\/\/www2.deloitte.com\/content\/dam\/Deloitte\/ch\/Documents\/life-sciences-health-care\/ch-en-lshc-challenge-of-compliance.pdf\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Deloitte<\/em><\/a><em>, \u201cthe ambiguity of regulations related to converging and emerging technologies results in a myriad of compliance challenges.<\/em>\u201d<\/p>\n\n\n\n<p>When building life science AI models, it\u2019s not uncommon to find that potentially valuable datasets are ringfenced by compliance issues, leading to models built on incomplete data.<\/p>\n\n\n\n<p><strong>Dirty data<\/strong><\/p>\n\n\n\n<p>Life science companies have access to a lot of data \u2013 in some cases, too much \u2013 and a lot of the more useful information is subject to strict regulatory processes and is effectively beyond reach. And to make matters worse, a significant proportion of life science data is \u2018dirty\u2019 \u2013 inaccurate, incomplete, or inconsistent \u2013 and not immediately usable.<\/p>\n\n\n\n<p>Life science data is often unstructured, coming in the form of typed MSL reports and field team observations that can vary drastically in length, format, and even language. Many healthcare organizations have fully migrated to electronic medical records (EMRs), but some have only partially migrated, while others are yet to begin the transition. These disparate and often-inconsistent data streams mean that life science data sets must often be cleaned before they are used to train effective AI models.<\/p>\n\n\n\n<p><strong>Dealing with data bias<\/strong><\/p>\n\n\n\n<p>The appeal of data-based decision-making is rooted in objectivity \u2013 that data tells the truth, and choices based on data will be correct. But bias can still play a role. Machine learning models are influenced by both the diversity of datasets and the way the model is trained. Therefore, if the datasets contain biased data, the model may exhibit the same bias in its decision-making. \u201c<em>AI can help identify and reduce the impact of human biases,\u201d reports <\/em><a href=\"https:\/\/hbr.org\/2019\/10\/what-do-we-do-about-the-biases-in-ai\" target=\"_blank\" rel=\"noreferrer noopener\"><em>HBR<\/em><\/a><em>. \u201cBut it can also make the problem worse by baking in and deploying biases at scale in sensitive application areas<\/em>.\u201d&nbsp;<\/p>\n\n\n\n<p>How can machine learning models overcome biased data? Last year, a group of <a href=\"https:\/\/news.mit.edu\/2022\/machine-learning-biased-data-0221\" target=\"_blank\" rel=\"noreferrer noopener\">researchers at MIT<\/a> discovered that how a model is trained can influence whether it is able to overcome a biased dataset. The authors of the study noted that it is possible to overcome dataset bias by taking care of dataset design. \u201cWe need to stop thinking that if you just collect a ton of raw data, that is going to get you somewhere,\u201d said research scientist and study author Xavier Boix.<\/p>\n\n\n\n<p><strong>Effective AI adoption in life science<\/strong><\/p>\n\n\n\n<p>Thus far, AI adoption in life science has been a mixed bag. In many cases, projects have gone awry not because the technology is immature but because the data it\u2019s based on is unclean, unstructured, or ringfenced by regulations. According to research from <a href=\"https:\/\/www2.deloitte.com\/us\/en\/insights\/industry\/life-sciences\/ai-and-pharma.html\" target=\"_blank\" rel=\"noreferrer noopener\">Deloitte<\/a>, \u201c<em>As AI moves from a \u201cnice to have\u201d to a \u201cmust have,\u201d companies and their leaders should build a vision and strategy to leverage AI, then put in place the building blocks needed to scale its use.\u201d<\/em><\/p>\n\n\n\n<p>Attempting to implement an AI model before the data is ready wastes time and resources. Data challenges leading to poor or biased models can impact the industry\u2019s confidence in the potential of AI to deliver business value. To succeed in training and deploying AI models, life science companies need to develop a clear data strategy and spend sufficient time cleaning and harmonizing their data.<\/p>\n\n\n\n<p><strong>About the Author<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"alignleft size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"150\" height=\"150\" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2023\/06\/Jason-Smith-Headshot-.jpeg\" alt=\"\" class=\"wp-image-32626\" srcset=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2023\/06\/Jason-Smith-Headshot-.jpeg 150w, https:\/\/insidebigdata.com\/wp-content\/uploads\/2023\/06\/Jason-Smith-Headshot--300x300.jpeg 300w\" sizes=\"(max-width: 150px) 100vw, 150px\" \/><\/figure><\/div>\n\n\n<p><em>Jason Smith is the Chief Technology Officer, AI &amp; Analytics at <a href=\"https:\/\/within3.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Within3<\/a>. He uses AI to understand the value of data and deliver products that empower our customers to make impactful decisions. Jason began his career at IBM and ATI Research while studying computer science at Harvard University, US. He is a leading-edge technologist and executive with over 20 years of industry experience.<\/em><\/p>\n\n\n\n<p><em>Sign up for the free insideBIGDATA&nbsp;<a href=\"http:\/\/inside-bigdata.com\/newsletter\/\" target=\"_blank\" rel=\"noreferrer noopener\">newsletter<\/a>.<\/em><\/p>\n\n\n\n<p><em>Join us on Twitter:&nbsp;<a href=\"https:\/\/twitter.com\/InsideBigData1\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/twitter.com\/InsideBigData1<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on LinkedIn:&nbsp;<a href=\"https:\/\/www.linkedin.com\/company\/insidebigdata\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.linkedin.com\/company\/insidebigdata\/<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on Facebook:&nbsp;<a href=\"https:\/\/www.facebook.com\/insideBIGDATANOW\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.facebook.com\/insideBIGDATANOW<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Jason Smith, Chief Technology Officer, AI &#038; Analytics at Within3, highlights how many life science data sets contain unclean, unstructured, or highly-regulated data that reduces the effectiveness of AI models. Life science companies must first clean and harmonize their data for effective AI adoption. <\/p>\n","protected":false},"author":10513,"featured_media":27298,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":""},"categories":[115,182,180,122,67,268,56,97],"tags":[990,281,96],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>The Problem with \u2018Dirty Data\u2019 - How Data Quality Can Impact Life Science AI Adoption - insideBIGDATA<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"The Problem with \u2018Dirty Data\u2019 - How Data Quality Can Impact Life Science AI Adoption - insideBIGDATA\" \/>\n<meta property=\"og:description\" content=\"Jason Smith, Chief Technology Officer, AI &amp; Analytics at Within3, highlights how many life science data sets contain unclean, unstructured, or highly-regulated data that reduces the effectiveness of AI models. Life science companies must first clean and harmonize their data for effective AI adoption.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/\" \/>\n<meta property=\"og:site_name\" content=\"insideBIGDATA\" \/>\n<meta property=\"article:publisher\" content=\"http:\/\/www.facebook.com\/insidebigdata\" \/>\n<meta property=\"article:published_time\" content=\"2023-06-15T10:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-06-14T01:40:03+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2021\/10\/data_quality_shutterstock_243064750.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"300\" \/>\n\t<meta property=\"og:image:height\" content=\"283\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Editorial Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:site\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Editorial Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/\",\"url\":\"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/\",\"name\":\"The Problem with \u2018Dirty Data\u2019 - How Data Quality Can Impact Life Science AI Adoption - insideBIGDATA\",\"isPartOf\":{\"@id\":\"https:\/\/insidebigdata.com\/#website\"},\"datePublished\":\"2023-06-15T10:00:00+00:00\",\"dateModified\":\"2023-06-14T01:40:03+00:00\",\"author\":{\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9\"},\"breadcrumb\":{\"@id\":\"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/insidebigdata.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"The Problem with \u2018Dirty Data\u2019 &#8212; How Data Quality Can Impact Life Science AI Adoption\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/insidebigdata.com\/#website\",\"url\":\"https:\/\/insidebigdata.com\/\",\"name\":\"insideBIGDATA\",\"description\":\"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/insidebigdata.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9\",\"name\":\"Editorial Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g\",\"caption\":\"Editorial Team\"},\"sameAs\":[\"http:\/\/www.insidebigdata.com\"],\"url\":\"https:\/\/insidebigdata.com\/author\/editorial\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"The Problem with \u2018Dirty Data\u2019 - How Data Quality Can Impact Life Science AI Adoption - insideBIGDATA","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/","og_locale":"en_US","og_type":"article","og_title":"The Problem with \u2018Dirty Data\u2019 - How Data Quality Can Impact Life Science AI Adoption - insideBIGDATA","og_description":"Jason Smith, Chief Technology Officer, AI & Analytics at Within3, highlights how many life science data sets contain unclean, unstructured, or highly-regulated data that reduces the effectiveness of AI models. Life science companies must first clean and harmonize their data for effective AI adoption.","og_url":"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/","og_site_name":"insideBIGDATA","article_publisher":"http:\/\/www.facebook.com\/insidebigdata","article_published_time":"2023-06-15T10:00:00+00:00","article_modified_time":"2023-06-14T01:40:03+00:00","og_image":[{"width":300,"height":283,"url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2021\/10\/data_quality_shutterstock_243064750.jpg","type":"image\/jpeg"}],"author":"Editorial Team","twitter_card":"summary_large_image","twitter_creator":"@insideBigData","twitter_site":"@insideBigData","twitter_misc":{"Written by":"Editorial Team","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/","url":"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/","name":"The Problem with \u2018Dirty Data\u2019 - How Data Quality Can Impact Life Science AI Adoption - insideBIGDATA","isPartOf":{"@id":"https:\/\/insidebigdata.com\/#website"},"datePublished":"2023-06-15T10:00:00+00:00","dateModified":"2023-06-14T01:40:03+00:00","author":{"@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9"},"breadcrumb":{"@id":"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/insidebigdata.com\/2023\/06\/15\/the-problem-with-dirty-data-how-data-quality-can-impact-life-science-ai-adoption\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/insidebigdata.com\/"},{"@type":"ListItem","position":2,"name":"The Problem with \u2018Dirty Data\u2019 &#8212; How Data Quality Can Impact Life Science AI Adoption"}]},{"@type":"WebSite","@id":"https:\/\/insidebigdata.com\/#website","url":"https:\/\/insidebigdata.com\/","name":"insideBIGDATA","description":"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/insidebigdata.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9","name":"Editorial Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g","caption":"Editorial Team"},"sameAs":["http:\/\/www.insidebigdata.com"],"url":"https:\/\/insidebigdata.com\/author\/editorial\/"}]}},"jetpack_featured_media_url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2021\/10\/data_quality_shutterstock_243064750.jpg","jetpack_shortlink":"https:\/\/wp.me\/p9eA3j-8ud","jetpack-related-posts":[{"id":23485,"url":"https:\/\/insidebigdata.com\/2019\/10\/26\/four-big-factors-shaping-the-future-of-data-science\/","url_meta":{"origin":32625,"position":0},"title":"Four Big Factors Shaping the Future of Data Science","date":"October 26, 2019","format":false,"excerpt":"In this special guest feature, Ryohei Fujimaki, Ph.D., Founder and CEO of dotData, discusses how AI and ML are having a profound impact on enterprise digital transformation becoming crucial as a competitive advantage and even for survival. As the field grows, four trends emerge, shaping data science in the next\u2026","rel":"","context":"In &quot;Data Science&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":32790,"url":"https:\/\/insidebigdata.com\/2023\/07\/07\/top-10-insidebigdata-articles-for-june-2023\/","url_meta":{"origin":32625,"position":1},"title":"TOP 10 insideBIGDATA Articles for June 2023","date":"July 7, 2023","format":false,"excerpt":"In this continuing regular feature, we give all our valued readers a monthly heads-up for the top 10 most viewed articles appearing on insideBIGDATA. Over the past several months, we\u2019ve heard from many of our followers that this feature will enable them to catch up with important news and features\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2023\/07\/Top10-column-banner_special.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":29887,"url":"https:\/\/insidebigdata.com\/2022\/07\/20\/pecan-ai-announces-one-click-data-science-model-deployment-integration-with-core-business-systems-and-automated-live-model-monitoring\/","url_meta":{"origin":32625,"position":2},"title":"Pecan AI Announces One-Click Data Science Model Deployment, Integration with Core Business Systems, and Automated Live Model Monitoring","date":"July 20, 2022","format":false,"excerpt":"Pecan AI, a leader in AI-based predictive analytics for BI analysts and business teams, announced the addition of one-click model deployment and integration with common CRMs, marketing automation tools and other core business systems. Pecan\u2019s customers can now take immediate actions based on the highly accurate predictions for future churn,\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":22919,"url":"https:\/\/insidebigdata.com\/2019\/07\/14\/the-harvard-data-science-initiative-and-the-mit-press-launch-the-harvard-data-science-review\/","url_meta":{"origin":32625,"position":3},"title":"The Harvard Data Science Initiative and The MIT Press Launch the HARVARD DATA SCIENCE REVIEW","date":"July 14, 2019","format":false,"excerpt":"The Harvard Data Science Initiative (HDSI) and the MIT Press are pleased to announce the launch of the Harvard Data Science Review (HDSR). The multimedia platform will feature leading global thinkers in the burgeoning field of data science, making research, educational resources, and commentary accessible to academics, professionals, and the\u2026","rel":"","context":"In &quot;Big Data&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":25318,"url":"https:\/\/insidebigdata.com\/2020\/12\/03\/new-survey-of-data-science-pros-finds-that-ai-explainability-is-their-top-concern\/","url_meta":{"origin":32625,"position":4},"title":"New Survey of Data Science Pros Finds that AI Explainability is their Top Concern","date":"December 3, 2020","format":false,"excerpt":"In late October 2020, venture capital firm Wing conducted a survey, \"Chief Data Scientist Survey,\" of 320 of the senior-most data scientists at both global corporations and venture-backed startups, in advance of its annual Wing Data Science Summit. AI explainability came out on top as the leading concern.","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2018\/09\/artificial-intelligence-3382507_640.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":5602,"url":"https:\/\/insidebigdata.com\/2013\/11\/07\/leveraging-high-quality-data-discover-de-risk-drugs-target-new-pathologies\/","url_meta":{"origin":32625,"position":5},"title":"Leveraging High Quality Data to Discover Drugs","date":"November 7, 2013","format":false,"excerpt":"The IP & Science Business of Thomson Reuters has announced a strategic initiative with NuMedii. The companies will leverage high-quality data, knowledge and predictive technologies to identify therapeutic candidates with the greatest probability for clinical success.","rel":"","context":"In &quot;Academic&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/32625"}],"collection":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/users\/10513"}],"replies":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/comments?post=32625"}],"version-history":[{"count":0,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/32625\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media\/27298"}],"wp:attachment":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media?parent=32625"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/categories?post=32625"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/tags?post=32625"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}