{"id":20034,"date":"2018-03-12T06:30:36","date_gmt":"2018-03-12T13:30:36","guid":{"rendered":"https:\/\/insidebigdata.com\/?p=20034"},"modified":"2018-03-05T08:25:24","modified_gmt":"2018-03-05T16:25:24","slug":"data-platform-deep-learning-future","status":"publish","type":"post","link":"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/","title":{"rendered":"Five Data Platform Considerations When Thinking About Your Deep Learning Future"},"content":{"rendered":"<p><em>This guest article from DDN Storage covers five data platform considerations to take into account when exploring the possibilities of deep learning.\u00a0<\/em><\/p>\n<div id=\"attachment_20035\" style=\"width: 288px\" class=\"wp-caption alignleft\"><a href=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/03\/deep-learning.jpg\"><img aria-describedby=\"caption-attachment-20035\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-20035 \" src=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/03\/deep-learning-300x201.jpg\" alt=\"deep learning\" width=\"278\" height=\"186\" \/><\/a><p id=\"caption-attachment-20035\" class=\"wp-caption-text\">With the current maturation of Artificial Intelligence applications and Deep Learning algorithms, many organizations are spinning up initiatives to figure out how they will extract competitive differentiation from their data.<\/p><\/div>\n<p>With the current maturation of Artificial Intelligence applications and Deep Learning algorithms, many organizations are spinning up initiatives to figure out how they will extract competitive differentiation from their data.\u00a0 In fact, many companies have been collecting data over the last 5-10 years with the knowledge that they will probably need it someday, but without the plan for how. We are now on the cusp of widespread adoption of Deep Learning to finally monetize all this data.<\/p>\n<p>Regardless of how the data is acquired, it is at the foundation of these nascent programs \u2013 and so data platforms should be evaluated carefully at the outset to ensure future plans are successful even if based on existing architectures.\u00a0 This requires forward thinking \u2013 gauging how a Deep Learning program will be deployed in production when current processing requirements and data sources may just be a fraction of the size they will be in production instances.\u00a0 Without making these plans now, organizations are at risk at falling behind the competition right when key breakthroughs are anticipated.\u00a0 To have to re-architect the entire Deep Learning infrastructure at the time of deployment could put companies well behind competitors that planned for the future.<\/p>\n<p>To ensure ultimate success, there are five key areas that businesses and research organizations should consider when creating and developing their Deep Learning data platform to ensure better answers, faster time to value, and capability for rapid scaling:<\/p>\n<ol>\n<li><strong>Saturate your AI Platform<\/strong><\/li>\n<\/ol>\n<p>The up-front investment in GPU-enabled Deep Learning compute systems may be taken for granted, but the backing storage systems are central to maximizing answers per day.\u00a0 The correct storage platform will ensure that GPU cycles don\u2019t remain idle due to applications waiting for the storage to respond.\u00a0 The impact to the storage system is vastly different depending upon the application behavior: GPU-enabled in-memory databases have lower start-up times when more quickly populated from the data warehousing area. GPU-accelerated analytics demand large thread counts &#8211; each with low-latency access to small pieces of data. Image-based deep learning for classification, object detection and segmentation benefit from high streaming bandwidth, random access, and, in most cases, fast memory mapped calls. In a similar vein, recurrent networks for text\/speech analysis also benefit from high performance random small file or small I\/O access.<\/p>\n<p>Typical AI compute systems house between four and eight GPUs along with high-end networking, often with multiple Infiniband ports for hundreds of Gbps (Gigabits per second) of low-latency bandwidth via RDMA (Remote Direct Memory Access) I\/O protocol.\u00a0 This means that any storage system under consideration should also leverage RDMA-capable networks such as Infiniband, which require no work to be done by CPUs, caches, or context switches vastly reducing latency and enabling far faster message transfer rates and eliminating application wait times.<\/p>\n<ol start=\"2\">\n<li><strong>Build massive ingest capability to cope with future scaling of data feeds.<\/strong><\/li>\n<\/ol>\n<p>Gathering data into a central repository will be a critical factor in creating a source that the Deep Learning model can run against once it is ready for production. Collecting data into this repository will require the ability quickly ingest information from a wide variety of sources.\u00a0 Ingest for storage systems means write performance and coping with large concurrent streams from distributed sources at huge scale. Fruitful AI implementations are not only a means to gain insight from data, but also can gather increasingly more data to aid in the continuous refinement of any model. Chosen storage systems must have highly balanced I\/O, performing writes just as fast as reads. Data sources developed to augment and improve acquisition need to satisfy all data gathering demands, while concurrently serving machine learning compute platforms.<\/p>\n<ol start=\"3\">\n<li><strong>Flexible and fast access to data <\/strong><\/li>\n<\/ol>\n<p>Flexibility covers multiple factors when it comes to AI storage platforms.\u00a0 In the end, ingesting, transforming, splitting, and otherwise manipulating large datasets is equally import to Deep Learning as pushing that data through neural network applications. Flexibility for organizations entering AI also implies good performance regardless of the choice of data formats. Considered storage platforms should support both support strong memory-mapped file performance and fast small-file access, useful when moving between all kinds of structured and unstructured data.<\/p>\n<p>[clickToTweet tweet=&#8221;DDN Storage &#8211; Delivering performance to the AI app is what matters, not how fast the storage can push out data.\u00a0 &#8221; quote=&#8221;DDN Storage &#8211; Delivering performance to the AI app is what matters, not how fast the storage can push out data.\u00a0 &#8220;]<\/p>\n<p>As an AI-enabled data center moves from initial prototyping and testing towards production and scale, a flexible data platform should provide the means to scale in any one of multiple areas: performance, capacity, ingest capability, Flash-HDD ratio and responsiveness for data scientists. Such flexibility also implies expansion of a namespace without disruption, eliminating data copies and complexity during growth phases.<\/p>\n<ol start=\"4\">\n<li><strong>Start Small, but Scale Simply and Economically<\/strong><\/li>\n<\/ol>\n<p>Scalability is measurable in terms of not only performance, but also manageability and economics. Successful AI program should be designed to start with a few TBs (terabytes) of data, but easily ramp to multiple PBs (petabytes) without architecting the environment.<\/p>\n<p>One way to scale economically is to optimize the use of storage media depending on workload.\u00a0 While Flash should always be the media for live AI training data, it can become unfeasible to hold hundreds of TBs or PBs of data all on Flash, but many alternatives just don\u2019t work at scale.\u00a0 Hybrid models often suffer limitations around data management and data movement and loosely coupled architectures that combine all-flash arrays with separate HDD-based data lakes present complicated environments for managing hot data efficiently.<\/p>\n<blockquote><p>One way to scale economically is to optimize the use of storage media depending on workload.<\/p><\/blockquote>\n<p>AI platform architects should consider tightly integrated, scale-out hybrid architectures designed specifically for AI. Start small with a flash deployment and then choose your scaling strategy according to demand; either scale with flash only, or combine with deeply integrated HDD pools. The integration and data movement techniques are key here, make sure to select solutions with the utmost transparency to users.<\/p>\n<ol start=\"5\">\n<li><strong>Partner with a vendor who understands the whole environment, not just storage.<\/strong><\/li>\n<\/ol>\n<p>Delivering performance to the AI application is what matters, not how fast the storage can push out data.\u00a0 The chosen storage platform vendor must recognize that integration and support services span the whole environment, beyond just storage, deliver results faster. Given the sheer processing power of AI compute platforms \u2013 each system akin to a mini-Super Computer \u2014 the vendor must deliver high-performance solutions for the most demanding data-at-scale workflows and partner closely with you as your AI requirements evolve.<\/p>\n<p><em>This guest article comes from <a href=\"http:\/\/www.ddn.com\/\" target=\"_blank\" rel=\"noopener\">DDN Storage,<\/a> a provider of high performance, high capacity big data storage systems, processing solutions and services to data-intensive, global organizations.<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>With the current maturation of Artificial Intelligence applications and Deep Learning algorithms, many organizations are spinning up initiatives to figure out how they will extract competitive differentiation from their data.\u00a0This guest article comes from DDN Storage, a provider of high performance, high capacity big data storage systems, processing solutions and services to data-intensive, global organizations.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"footnotes":""},"categories":[115,181,205,87,180,61],"tags":[370,601,264,95],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Deep Learning: 5 Data Platform Considerations When Considering AI Possibilities<\/title>\n<meta name=\"description\" content=\"This guest article from DDN Storage explores data platform considerations to take into account with exploring the possibilities of deep learning.\u00a0\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Deep Learning: 5 Data Platform Considerations When Considering AI Possibilities\" \/>\n<meta property=\"og:description\" content=\"This guest article from DDN Storage explores data platform considerations to take into account with exploring the possibilities of deep learning.\u00a0\" \/>\n<meta property=\"og:url\" content=\"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/\" \/>\n<meta property=\"og:site_name\" content=\"insideBIGDATA\" \/>\n<meta property=\"article:publisher\" content=\"http:\/\/www.facebook.com\/insidebigdata\" \/>\n<meta property=\"article:published_time\" content=\"2018-03-12T13:30:36+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2018-03-05T16:25:24+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/03\/deep-learning-300x201.jpg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:site\" content=\"@insideBigData\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/\",\"url\":\"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/\",\"name\":\"Deep Learning: 5 Data Platform Considerations When Considering AI Possibilities\",\"isPartOf\":{\"@id\":\"https:\/\/insidebigdata.com\/#website\"},\"datePublished\":\"2018-03-12T13:30:36+00:00\",\"dateModified\":\"2018-03-05T16:25:24+00:00\",\"author\":{\"@id\":\"\"},\"description\":\"This guest article from DDN Storage explores data platform considerations to take into account with exploring the possibilities of deep learning.\u00a0\",\"breadcrumb\":{\"@id\":\"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/insidebigdata.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Five Data Platform Considerations When Thinking About Your Deep Learning Future\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/insidebigdata.com\/#website\",\"url\":\"https:\/\/insidebigdata.com\/\",\"name\":\"insideBIGDATA\",\"description\":\"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/insidebigdata.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"\",\"url\":\"https:\/\/insidebigdata.com\/author\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Deep Learning: 5 Data Platform Considerations When Considering AI Possibilities","description":"This guest article from DDN Storage explores data platform considerations to take into account with exploring the possibilities of deep learning.\u00a0","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/","og_locale":"en_US","og_type":"article","og_title":"Deep Learning: 5 Data Platform Considerations When Considering AI Possibilities","og_description":"This guest article from DDN Storage explores data platform considerations to take into account with exploring the possibilities of deep learning.\u00a0","og_url":"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/","og_site_name":"insideBIGDATA","article_publisher":"http:\/\/www.facebook.com\/insidebigdata","article_published_time":"2018-03-12T13:30:36+00:00","article_modified_time":"2018-03-05T16:25:24+00:00","og_image":[{"url":"https:\/\/insidebigdata.com\/wp-content\/uploads\/2018\/03\/deep-learning-300x201.jpg"}],"twitter_card":"summary_large_image","twitter_creator":"@insideBigData","twitter_site":"@insideBigData","twitter_misc":{"Written by":"","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/","url":"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/","name":"Deep Learning: 5 Data Platform Considerations When Considering AI Possibilities","isPartOf":{"@id":"https:\/\/insidebigdata.com\/#website"},"datePublished":"2018-03-12T13:30:36+00:00","dateModified":"2018-03-05T16:25:24+00:00","author":{"@id":""},"description":"This guest article from DDN Storage explores data platform considerations to take into account with exploring the possibilities of deep learning.\u00a0","breadcrumb":{"@id":"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/insidebigdata.com\/2018\/03\/12\/data-platform-deep-learning-future\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/insidebigdata.com\/"},{"@type":"ListItem","position":2,"name":"Five Data Platform Considerations When Thinking About Your Deep Learning Future"}]},{"@type":"WebSite","@id":"https:\/\/insidebigdata.com\/#website","url":"https:\/\/insidebigdata.com\/","name":"insideBIGDATA","description":"Your Source for AI, Data Science, Deep Learning &amp; Machine Learning Strategies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/insidebigdata.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"","url":"https:\/\/insidebigdata.com\/author\/"}]}},"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p9eA3j-5d8","jetpack-related-posts":[{"id":20165,"url":"https:\/\/insidebigdata.com\/2018\/03\/29\/ddn-storage-announces-groundbreaking-33gb-s-performance-nvidia-dgx-servers-accelerate-machine-learning-ai-initiatives\/","url_meta":{"origin":20034,"position":0},"title":"DDN Storage Announces Groundbreaking 33GB\/s Performance to NVIDIA DGX Servers to Accelerate Machine Learning and AI Initiatives","date":"March 29, 2018","format":false,"excerpt":"DataDirect Networks (DDN\u00ae) today announced its\u00a0 EXAScaler DGX solution, a unique solution that delivers leading-edge performance using a new optimized, accelerated client integrating tightly and seamlessly with the NVIDIA DGX Architecture. Using the EXAScaler\u00ae ES14KX\u00ae high-performance all-flash array, the new solution smashed existing records by demonstrating a massive 33GB\/s of\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":21612,"url":"https:\/\/insidebigdata.com\/2018\/12\/05\/insidebigdata-guide-data-platforms-artificial-intelligence-deep-learning-part-4\/","url_meta":{"origin":20034,"position":1},"title":"insideBIGDATA Guide to Data Platforms for Artificial Intelligence and Deep Learning \u2013 Part 4","date":"December 5, 2018","format":false,"excerpt":"With AI and DL, storage is cornerstone to handling the deluge of data constantly generated in today\u2019s hyperconnected world. It is a vehicle that captures and shares data to create business value. In this technology guide, insideBIGDATA Guide to Data Platforms for Artificial Intelligence and Deep Learning, we\u2019ll see how\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2018\/11\/TensorFlow_benchmark.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":22340,"url":"https:\/\/insidebigdata.com\/2019\/03\/21\/transformative-solutions-for-accelerating-ai-analytics-and-deep-learning-at-nvidia-gtc19\/","url_meta":{"origin":20034,"position":2},"title":"Transformative Solutions for Accelerating AI, Analytics and Deep Learning at NVIDIA #GTC19","date":"March 21, 2019","format":false,"excerpt":"One pivotal message received by attendees of this week's NVIDIA GPU Technology Conference (GTC) in Silicon Valley is the importance of game-changing storage solutions and applications that empower users to accomplish their most challenging AI objectives.","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":23348,"url":"https:\/\/insidebigdata.com\/2019\/10\/01\/insidebigdata-guide-to-optimized-storage-for-ai-and-deep-learning-workloads-part-3\/","url_meta":{"origin":20034,"position":3},"title":"insideBIGDATA Guide to Optimized Storage for AI and Deep Learning Workloads \u2013 Part 3","date":"October 1, 2019","format":false,"excerpt":"Artificial Intelligence (AI) and Deep Learning (DL) represent some of the most demanding workloads in modern computing history as they present unique challenges to compute, storage and network resources. In this technology guide, insideBIGDATA Guide to Optimized Storage for AI and Deep Learning Workloads, we\u2019ll see how traditional file storage\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2019\/09\/IBD_DDNSRCover_2019-09-10_8-36-01.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":21638,"url":"https:\/\/insidebigdata.com\/2018\/12\/12\/insidebigdata-guide-data-platforms-artificial-intelligence-deep-learning-part-5\/","url_meta":{"origin":20034,"position":4},"title":"insideBIGDATA Guide to Data Platforms for Artificial Intelligence and Deep Learning \u2013 Part 5","date":"December 12, 2018","format":false,"excerpt":"With AI and DL, storage is cornerstone to handling the deluge of data constantly generated in today\u2019s hyperconnected world. It is a vehicle that captures and shares data to create business value. In this technology guide, insideBIGDATA Guide to Data Platforms for Artificial Intelligence and Deep Learning, we\u2019ll see how\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2018\/12\/Life-Sciences-use-case-image.png?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]},{"id":33385,"url":"https:\/\/insidebigdata.com\/2023\/09\/14\/ddn-storage-solutions-deliver-700-gains-in-ai-and-machine-learning-for-image-segmentation-and-natural-language-processing\/","url_meta":{"origin":20034,"position":5},"title":"DDN Storage Solutions Deliver 700% Gains in AI and Machine Learning for Image Segmentation and Natural Language Processing","date":"September 14, 2023","format":false,"excerpt":"DDN\u00ae, a leader in artificial intelligence (AI) and multi-cloud data management solutions, announced impressive performance results of its AI storage platform for the inaugural AI storage benchmarks released this week by MLCommons Association. The MLPerfTM\u00a0Storage v0.5 benchmark results confirm DDN storage solutions as the gold standard for AI and machine\u2026","rel":"","context":"In &quot;AI Deep Learning&quot;","img":{"alt_text":"","src":"https:\/\/i0.wp.com\/insidebigdata.com\/wp-content\/uploads\/2023\/08\/Data_shutterstock_1055190668_special.jpg?resize=350%2C200&ssl=1","width":350,"height":200},"classes":[]}],"_links":{"self":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/20034"}],"collection":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/comments?post=20034"}],"version-history":[{"count":0,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/posts\/20034\/revisions"}],"wp:attachment":[{"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/media?parent=20034"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/categories?post=20034"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/insidebigdata.com\/wp-json\/wp\/v2\/tags?post=20034"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}