{"id":33663,"date":"2023-10-16T02:59:00","date_gmt":"2023-10-16T09:59:00","guid":{"rendered":"https:\/\/insidebigdata.com\/?p=33663"},"modified":"2023-10-16T12:44:28","modified_gmt":"2023-10-16T19:44:28","slug":"ethical-web-data-collection-initiative-launches-certification-program","status":"publish","type":"post","link":"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/","title":{"rendered":"Ethical Web Data Collection Initiative Launches Certification Program"},"content":{"rendered":"\n<p>The <a href=\"https:\/\/ethicalwebdata.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Ethical Web Data Collection Initiative<\/a> (EWDCI) is an industry-led consortium of web data collectors focused on strengthening public trust, promoting ethical guidelines, and helping businesses and their customers make informed data extraction choices. The association aims to raise the bar for ethics in the process widely known as \u201cdata scraping\u201d with the goal of\u00a0enhancing trust\u2014a key component of a free, fair, and open Internet.\u00a0This international, industry-led, and member-driven consortium is announcing an accreditation program developed to bring greater accountability and build consumer confidence in the data collection industry.<\/p>\n\n\n\n<p>The EWDCI accreditation program was recently announced wherein eligible companies can receive an\u00a0<em>EWDCI Certified<\/em>\u00a0designation. All companies that receive the EWDCI Certified designation are showing the world that they adhere to these agreed-upon principles and the highest degree of ethics when collecting public web data, while also further advancing the industry\u2019s best practices and accountability.<\/p>\n\n\n\n<p>Companies may apply to become EWDCI Certified. We encourage companies who collect and manage web data to\u00a0join the consortium\u2014and, most importantly, join the conversation to further develop these principles.\u00a0The inaugural group of web data aggregators that have earned EWDCI accreditation includes Coresignal, Oxylabs, ProxyEmpire, Rayobyte, Smartproxy, and Zyte.<\/p>\n\n\n\n<p>The EWDCI Certified designation isn\u2019t so much the result of our work but rather the culmination of the first stage of a longer process. The web data collection industry is still young, but it\u2019s growing very quickly. As more data-hungry AI tools fall into corporate and private hands, there is a\u00a0limited opportunity to shape how data-collection practices are developed and perceived.\u00a0This is why the EWDCI is dedicated to defining positive and beneficial uses of the important abilities and potential of data collection and aggregation at scale.<\/p>\n\n\n\n<p>The EWDCI is now focused on furthering the consortium\u2019s mission and scope of practice through the acquisition of public commentary on various topics, which include:<\/p>\n\n\n\n<ul>\n<li>How scraped data can be used to ethically train large language models (LLMs) and generative AI models<\/li>\n\n\n\n<li>Government access to data and due process<\/li>\n\n\n\n<li>Balance between scrapers and target websites<\/li>\n\n\n\n<li>Privacy compliance when scraping personal data<\/li>\n\n\n\n<li>Preventing tactics that undermine consent and consumer choice<\/li>\n\n\n\n<li>Anti-stalkerware efforts<\/li>\n<\/ul>\n\n\n\n<p><em>\u201cThe EWDCI seal is a crucial stamp of approval, but it\u2019s also a way to build industry-led influence with a clear goal of making the free and open Internet a better and safer place,\u201d said Christian Dawson, Executive Director of the i2Coalition.<\/em><\/p>\n\n\n\n<p><em>Sign up for the free insideBIGDATA&nbsp;<a href=\"http:\/\/inside-bigdata.com\/newsletter\/\" target=\"_blank\" rel=\"noreferrer noopener\">newsletter<\/a>.<\/em><\/p>\n\n\n\n<p><em>Join us on Twitter:&nbsp;<a href=\"https:\/\/twitter.com\/InsideBigData1\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/twitter.com\/InsideBigData1<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on LinkedIn:&nbsp;<a href=\"https:\/\/www.linkedin.com\/company\/insidebigdata\/\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.linkedin.com\/company\/insidebigdata\/<\/a><\/em><\/p>\n\n\n\n<p><em>Join us on Facebook:&nbsp;<a href=\"https:\/\/www.facebook.com\/insideBIGDATANOW\" target=\"_blank\" rel=\"noreferrer noopener\">https:\/\/www.facebook.com\/insideBIGDATANOW<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Ethical Web Data Collection Initiative (EWDCI) is an industry-led consortium of web data collectors focused on strengthening public trust, promoting ethical guidelines, and helping businesses and their customers make informed data extraction choices. The association aims to raise the bar for ethics in the process widely known as \u201cdata scraping\u201d with the goal of\u00a0enhancing trust\u2014a key component of a free, fair, and open Internet.\u00a0<\/p>\n","protected":false},"author":10513,"featured_media":33238,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":""},"categories":[115,182,180,268,56,1],"tags":[280,133,721,96],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Ethical Web Data Collection Initiative Launches Certification Program - insideBIGDATA<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Ethical Web Data Collection Initiative Launches Certification Program - insideBIGDATA\" \/>\n<meta property=\"og:description\" content=\"The Ethical Web Data Collection Initiative (EWDCI) is an industry-led consortium of web data collectors focused on strengthening public trust, promoting ethical guidelines, and helping businesses and their customers make informed data extraction choices. The association aims to raise the bar for ethics in the process widely known as \u201cdata scraping\u201d with the goal of\u00a0enhancing trust\u2014a key component of a free, fair, and open Internet.\u00a0\" \/>\n<meta property=\"og:url\" content=\"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/\" \/>\n<meta property=\"og:site_name\" content=\"insideBIGDATA\" \/>\n<meta property=\"article:publisher\" content=\"http:\/\/www.facebook.com\/insidebigdata\" \/>\n<meta property=\"article:published_time\" content=\"2023-10-16T09:59:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-10-16T19:44:28+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2023\/08\/Data_shutterstock_1055190668_special.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1100\" \/>\n\t<meta property=\"og:image:height\" content=\"550\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Editorial Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:site\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Editorial Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/\",\"url\":\"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/\",\"name\":\"Ethical Web Data Collection Initiative Launches Certification Program - insideBIGDATA\",\"isPartOf\":{\"@id\":\"https:\/\/insidebigdata.com\/#website\"},\"datePublished\":\"2023-10-16T09:59:00+00:00\",\"dateModified\":\"2023-10-16T19:44:28+00:00\",\"author\":{\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9\"},\"breadcrumb\":{\"@id\":\"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/insidebigdata.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Ethical Web Data Collection Initiative Launches Certification Program\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/insidebigdata.com\/#website\",\"url\":\"https:\/\/insidebigdata.com\/\",\"name\":\"insideBIGDATA\",\"description\":\"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/insidebigdata.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9\",\"name\":\"Editorial Team\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g\",\"caption\":\"Editorial Team\"},\"sameAs\":[\"http:\/\/www.insidebigdata.com\"],\"url\":\"https:\/\/insidebigdata.com\/author\/editorial\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Ethical Web Data Collection Initiative Launches Certification Program - insideBIGDATA","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/","og_locale":"en_US","og_type":"article","og_title":"Ethical Web Data Collection Initiative Launches Certification Program - insideBIGDATA","og_description":"The Ethical Web Data Collection Initiative (EWDCI) is an industry-led consortium of web data collectors focused on strengthening public trust, promoting ethical guidelines, and helping businesses and their customers make informed data extraction choices. The association aims to raise the bar for ethics in the process widely known as \u201cdata scraping\u201d with the goal of\u00a0enhancing trust\u2014a key component of a free, fair, and open Internet.\u00a0","og_url":"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/","og_site_name":"insideBIGDATA","article_publisher":"http:\/\/www.facebook.com\/insidebigdata","article_published_time":"2023-10-16T09:59:00+00:00","article_modified_time":"2023-10-16T19:44:28+00:00","og_image":[{"width":1100,"height":550,"url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2023\/08\/Data_shutterstock_1055190668_special.jpg","type":"image\/jpeg"}],"author":"Editorial Team","twitter_card":"summary_large_image","twitter_creator":"@insideBigData","twitter_site":"@insideBigData","twitter_misc":{"Written by":"Editorial Team","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/","url":"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/","name":"Ethical Web Data Collection Initiative Launches Certification Program - insideBIGDATA","isPartOf":{"@id":"https:\/\/insidebigdata.com\/#website"},"datePublished":"2023-10-16T09:59:00+00:00","dateModified":"2023-10-16T19:44:28+00:00","author":{"@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9"},"breadcrumb":{"@id":"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/insidebigdata.com\/2023\/10\/16\/ethical-web-data-collection-initiative-launches-certification-program\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/insidebigdata.com\/"},{"@type":"ListItem","position":2,"name":"Ethical Web Data Collection Initiative Launches Certification Program"}]},{"@type":"WebSite","@id":"https:\/\/insidebigdata.com\/#website","url":"https:\/\/insidebigdata.com\/","name":"insideBIGDATA","description":"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/insidebigdata.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/2949e412c144601cdbcc803bd234e1b9","name":"Editorial Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/insidebigdata.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e137ce7ea40e38bd4d25bb7860cfe3e4?s=96&d=mm&r=g","caption":"Editorial Team"},"sameAs":["http:\/\/www.insidebigdata.com"],"url":"https:\/\/insidebigdata.com\/author\/editorial\/"}]}},"jetpack_featured_media_url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2023\/08\/Data_shutterstock_1055190668_special.jpg","jetpack_shortlink":"https:\/\/wp.me\/p9eA3j-8KX","jetpack-related-posts":[{"id":25712,"url":"https:\/\/insidebigdata.com\/2021\/03\/04\/interview-luminati-ceo-or-lenchner\/","url_meta":{"origin":33663,"position":0},"title":"Interview: Luminati CEO, Or Lenchner","date":"March 4, 2021","format":false,"excerpt":"I recently caught up with Or Lenchner, CEO at Luminati, to discuss his company's Data Collector product, an automated data collection tool, allowing customers to collect the most accurate data at scale quickly, easily, and without getting blocked. The Data Collector integrates and automates all stages of the data collection\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2021\/03\/Or-18086-color-.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":2789,"url":"https:\/\/insidebigdata.com\/2013\/04\/18\/big-data-renaissance-in-north-carolina\/","url_meta":{"origin":33663,"position":1},"title":"Big Data Renaissance in North Carolina","date":"April 18, 2013","format":false,"excerpt":"In Chapel Hill, there are big doings in Big Data these days. A new collaboration, known as the National Consortium for Data Science (NCDS) has been launched at RENCI, (Renaissance Computing Institute) at the University of North Carolina. The consortium has ambitious goals according to a recent RENCI press release,\u2026","rel":"","context":"In &quot;Academic&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":26739,"url":"https:\/\/insidebigdata.com\/2021\/07\/22\/multi-billion-dollar-businesses-benefit-from-web-scraping-can-yours\/","url_meta":{"origin":33663,"position":2},"title":"Multi-Billion Dollar Businesses Benefit From Web Scraping. Can Yours?","date":"July 22, 2021","format":false,"excerpt":"In this contributed article, Andrius Palionis,VP of Enterprise Solutions at Oxylabs, discusses how businesses of all sizes can benefit from web scraping. Billion-dollar businesses got to where they are today by leading the industry in technological innovation. That\u2019s because data continues to increase in importance and literally \u201cfuels\u201d the digital\u2026","rel":"","context":"In &quot;Analytics&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":23526,"url":"https:\/\/insidebigdata.com\/2019\/11\/07\/the-beginners-guide-to-web-data-integration\/","url_meta":{"origin":33663,"position":3},"title":"The Beginner&#8217;s Guide To Web Data Integration","date":"November 7, 2019","format":false,"excerpt":"In this contributed article, well-known tech journalist Luke Fitzpatrick believes that understanding web data integration is crucial in today\u2019s environment because it gives business owners an opportunity to take advantage of the immense volume of data that\u2019s available and gain key insights that would otherwise be impossible.","rel":"","context":"In &quot;Analytics&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2019\/11\/data_integration_shutterstock_600496559.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":22035,"url":"https:\/\/insidebigdata.com\/2019\/01\/26\/what-is-web-scraping\/","url_meta":{"origin":33663,"position":4},"title":"What is Web Scraping?","date":"January 26, 2019","format":false,"excerpt":"In this contributed article, Hoda Raissi, COO of ParseHub, introduces web scraping and its importance to researchers and to various industries. She also shares her insights on what to look out for when choosing a web scraping tool, and how to make sure it will provide you the data you\u2026","rel":"","context":"In &quot;Big Data&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":7463,"url":"https:\/\/insidebigdata.com\/2014\/02\/19\/challenges-solutions-genomics-age-big-data\/","url_meta":{"origin":33663,"position":5},"title":"Challenges and Solutions for Genomics in the Age of Big Data","date":"February 19, 2014","format":false,"excerpt":"Leading researchers in data science and genomics are recommending strategies to help genomic scientists better manage, share, analyze and archive massive research and clinical data sets in an effort to ensure that the big data explosion results in better health outcomes and faster research discoveries.","rel":"","context":"In &quot;Big Data Hardware&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"_links":{"self":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/33663"}],"collection":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/users\/10513"}],"replies":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/comments?post=33663"}],"version-history":[{"count":0,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/33663\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media\/33238"}],"wp:attachment":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media?parent=33663"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/categories?post=33663"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/tags?post=33663"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}