Welcome!

Machine Learning Authors: Elizabeth White, Yeshim Deniz, Liz McMillan, Pat Romanski, Ed Featherston

Related Topics: @CloudExpo, @BigDataExpo, @ThingsExpo

@CloudExpo: Blog Post

The #IoT and #Analytics | @ThingsExpo #BigData #BI #AI #DX #MachineLearning

The Internet of Things promises to change everything by enabling “smart” environments and smart products

The Internet of Things (IoT) and Analytics at The Edge

The Internet of Things (IoT) promises to change everything by enabling “smart” environments (homes, cities, hospitals, schools, stores, etc.) and smart products (cars, trucks, airplanes, trains, wind turbines, lawnmowers, etc.). I recently wrote about the importance of moving beyond “connected” to “smart” in a blog titled “Internet of Things: Connected Does Not Equal Smart”. The article discusses the importance of moving beyond just collecting the data, to transitioning to leveraging this new wealth of IoT data to improve the decisions that these smart environments and products need to make: to help these environments and products to self-monitor, self-diagnose and eventually, self-direct.

But one of the key concepts in enabling this transition from connected to smart is the ability to perform “analytics at the edge.” Shawn Rogers, Chief Research Officer at Dell Statistica, had the following quote in an article in Information Management titled “Will the Citizen Data Scientist Inherit the World?”:

“Organizations are fast coming to the realization that IoT implementations are only going to become more vast and more pervasive, and that as that happens, the traditional analytic model of pulling all data in to a centralized source such as a data warehouse or analytic sandbox is going to make less and less sense.

So, most of the conversations I’m having around IoT analytics today revolve around looking at how companies can flip that model on its head and figure out ways to push the analytics out to the edge. If you can run analytics at the edge, you not only can eliminate the time, bandwidth and expense required to transport the data, but you make it possible to take immediate action in response to the insight. You speed up and simplify the analytic process in a way that’s never been done before.”

So I asked Shawn and his boss John Thompson, General Manager of Advanced Analytics at Dell, to help me understand what exactly they mean by “analytics at the edge.” It really boils down to these questions:

  • Are we really developing analytics at the edge?
  • If not, then what sorts of analytics are we performing at the edge?
  • Where are the analytic models actually being built?
  • And finally, what the heck does “at the edge” really mean?
  • So let’s actually start with that last question: What does “at the edge” really mean?

Question #1: What Is “At The Edge”?
“At the edge” refers to the multitude of devices or sensors that are scattered across any network or embedded throughout a product (car, jet engine, CT Scan) that is generating data about the operations and performance of that specific device or sensor.

For example, the current Airbus A350 model has close to 6,000 sensors and generates 2.5 Tb of data per day, while an even newer model – expected to be available in 2020 – will capture more than triple that amount! It is becoming more and more common for everyday common products to have hundreds if not thousands of embedded sensors that are generating readings every couple of seconds on the operations and performance of that particular product (see Figure 1).

Figure 1: Sensors at the Edge

But collecting these huge and real-time volumes of data doesn’t do anything to directly create business advantage. It is what you do with that data that drives the business value, which brings us to…

Question #2: Are We Really Developing Analytics “At The Edge”?
Are we really “performing analytics” (collecting the data, storing the data, preparing the data, running analytic algorithms, validating the analytic goodness of fit and then acting on the results) at the edges, or are we just “executing the analytic models” at the edges? It’s one thing to “execute the analytic models” (e.g., scores, rules, recommendations) at the edges, but something entirely different to actually “perform analytics” at the edges.

Per Shawn and John, “We can deliver analytic models to any end point. We can execute the analytic models in any environment – large or small. We can execute all the steps in performing analytics in a wide range of environments, but there are limits at the edge. The limits are on the robustness of the environment (i.e. cannot deliver an executable to an environment that does not have the memory or processing power to store it or execute it. We cannot change the laws of physics…;-).)”

Question #3: What Sorts Of Analytics Are We Performing At The Edge?
In our airplane example with 6,000 sensors on the plane generating over 2.5 Tb of data per day, how are we performing the analytics at the end?

Per John and Shawn, if the jet engine has a place to house a Java Virtual Machine (JVM) and an analytic model (i.e., lightweight rules based model), then we can execute the model on the engine itself. If the model streams the data to a network, we can execute the analytic model on a gateway, or intermediate server (see Figure 2).

Figure 2: Executing Analytic Models at The Edge

Think of the network as having concentric rings. Each ring can have many servers. Each server can do either – either executing an analytic model or building the analytic models. Now think of many network networks with concentric rings that interlock at various intersections. Analytics can be at any or all levels including at the core, in a data center or in the cloud.

Per Shawn, “By working in tandem with Dell Boomi, we’ve given users the ability to deploy JVM’s with the analytic models on any edge device or gateway anywhere on the network or device. This edge scoring capability enables organizations to address nearly any IoT analytics use case by executing the analytic models at the edge of the network where data is being created.”

Question #4: Where Are The Analytic Models Actually Being Built?
Okay, so we “execute” the pre-built modes at the edge, but we actually build (test, refine, test, refine) the analytic models by bringing the detailed sensor data back to a central data and analytics environment (a.k.a. the Data Lake). Figure 3, courtesy of Joel Dodd of Pivotal, shows the data flow and the supporting analytics execution.

Figure 3: “At the Edge” Analytic Model Execution

Final point, even if you are doing all the sensor/IoT analysis at the edges, you are likely still going to want to bring the raw IoT data back into the data lake for more extensive analysis in order to house the detailed IoT history. For example, we have major economic cycles every 4 to 7 years. You might want to quantify the impact of these economic changes on your network demand and performance. That would eventually require 8 to 14 years of data. And that’s why you are going to want a data lake as the foundation of the transition from a “connected” IoT world to a “smart” IoT world.

The post The Internet of Things (IoT) and Analytics at The Edge appeared first on InFocus.

Read the original blog entry...

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business”, is responsible for setting the strategy and defining the Big Data service line offerings and capabilities for the EMC Global Services organization. As part of Bill’s CTO charter, he is responsible for working with organizations to help them identify where and how to start their big data journeys. He’s written several white papers, avid blogger and is a frequent speaker on the use of Big Data and advanced analytics to power organization’s key business initiatives. He also teaches the “Big Data MBA” at the University of San Francisco School of Management.

Bill has nearly three decades of experience in data warehousing, BI and analytics. Bill authored EMC’s Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data Warehouse Institute’s faculty as the head of the analytic applications curriculum.

Previously, Bill was the Vice President of Advertiser Analytics at Yahoo and the Vice President of Analytic Applications at Business Objects.

@CloudExpo Stories
Today most companies are adopting or evaluating container technology - Docker in particular - to speed up application deployment, drive down cost, ease management and make application delivery more flexible overall. As with most new architectures, this dream takes significant work to become a reality. Even when you do get your application componentized enough and packaged properly, there are still challenges for DevOps teams to making the shift to continuous delivery and achieving that reducti...
As hybrid cloud becomes the de-facto standard mode of operation for most enterprises, new challenges arise on how to efficiently and economically share data across environments. In his session at 21st Cloud Expo, Dr. Allon Cohen, VP of Product at Elastifile, will explore new techniques and best practices that help enterprise IT benefit from the advantages of hybrid cloud environments by enabling data availability for both legacy enterprise and cloud-native mission critical applications. By rev...
The next XaaS is CICDaaS. Why? Because CICD saves developers a huge amount of time. CD is an especially great option for projects that require multiple and frequent contributions to be integrated. But… securing CICD best practices is an emerging, essential, yet little understood practice for DevOps teams and their Cloud Service Providers. The only way to get CICD to work in a highly secure environment takes collaboration, patience and persistence. Building CICD in the cloud requires rigorous ar...
Recently, REAN Cloud built a digital concierge for a North Carolina hospital that had observed that most patient call button questions were repetitive. In addition, the paper-based process used to measure patient health metrics was laborious, not in real-time and sometimes error-prone. In their session at 21st Cloud Expo, Sean Finnerty, Executive Director, Practice Lead, Health Care & Life Science at REAN Cloud, and Dr. S.P.T. Krishnan, Principal Architect at REAN Cloud, will discuss how they b...
SYS-CON Events announced today that SkyScale will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. SkyScale is a world-class provider of cloud-based, ultra-fast multi-GPU hardware platforms for lease to customers desiring the fastest performance available as a service anywhere in the world. SkyScale builds, configures, and manages dedicated systems strategically located in maximum-security...
As you move to the cloud, your network should be efficient, secure, and easy to manage. An enterprise adopting a hybrid or public cloud needs systems and tools that provide: Agility: ability to deliver applications and services faster, even in complex hybrid environments Easier manageability: enable reliable connectivity with complete oversight as the data center network evolves Greater efficiency: eliminate wasted effort while reducing errors and optimize asset utilization Security: imple...
High-velocity engineering teams are applying not only continuous delivery processes, but also lessons in experimentation from established leaders like Amazon, Netflix, and Facebook. These companies have made experimentation a foundation for their release processes, allowing them to try out major feature releases and redesigns within smaller groups before making them broadly available. In his session at 21st Cloud Expo, Brian Lucas, Senior Staff Engineer at Optimizely, will discuss how by using...
In this strange new world where more and more power is drawn from business technology, companies are effectively straddling two paths on the road to innovation and transformation into digital enterprises. The first path is the heritage trail – with “legacy” technology forming the background. Here, extant technologies are transformed by core IT teams to provide more API-driven approaches. Legacy systems can restrict companies that are transitioning into digital enterprises. To truly become a lead...
DevOps at Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to w...
SYS-CON Events announced today that Daiya Industry will exhibit at the Japanese Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Ruby Development Inc. builds new services in short period of time and provides a continuous support of those services based on Ruby on Rails. For more information, please visit https://github.com/RubyDevInc.
When it comes to cloud computing, the ability to turn massive amounts of compute cores on and off on demand sounds attractive to IT staff, who need to manage peaks and valleys in user activity. With cloud bursting, the majority of the data can stay on premises while tapping into compute from public cloud providers, reducing risk and minimizing need to move large files. In his session at 18th Cloud Expo, Scott Jeschonek, Director of Product Management at Avere Systems, discussed the IT and busine...
As businesses evolve, they need technology that is simple to help them succeed today and flexible enough to help them build for tomorrow. Chrome is fit for the workplace of the future — providing a secure, consistent user experience across a range of devices that can be used anywhere. In her session at 21st Cloud Expo, Vidya Nagarajan, a Senior Product Manager at Google, will take a look at various options as to how ChromeOS can be leveraged to interact with people on the devices, and formats th...
First generation hyperconverged solutions have taken the data center by storm, rapidly proliferating in pockets everywhere to provide further consolidation of floor space and workloads. These first generation solutions are not without challenges, however. In his session at 21st Cloud Expo, Wes Talbert, a Principal Architect and results-driven enterprise sales leader at NetApp, will discuss how the HCI solution of tomorrow will integrate with the public cloud to deliver a quality hybrid cloud e...
Is advanced scheduling in Kubernetes achievable? Yes, however, how do you properly accommodate every real-life scenario that a Kubernetes user might encounter? How do you leverage advanced scheduling techniques to shape and describe each scenario in easy-to-use rules and configurations? In his session at @DevOpsSummit at 21st Cloud Expo, Oleg Chunikhin, CTO at Kublr, will answer these questions and demonstrate techniques for implementing advanced scheduling. For example, using spot instances ...
SYS-CON Events announced today that Yuasa System will exhibit at the Japan External Trade Organization (JETRO) Pavilion at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Yuasa System is introducing a multi-purpose endurance testing system for flexible displays, OLED devices, flexible substrates, flat cables, and films in smartphones, wearables, automobiles, and healthcare.
Companies are harnessing data in ways we once associated with science fiction. Analysts have access to a plethora of visualization and reporting tools, but considering the vast amount of data businesses collect and limitations of CPUs, end users are forced to design their structures and systems with limitations. Until now. As the cloud toolkit to analyze data has evolved, GPUs have stepped in to massively parallel SQL, visualization and machine learning.
The session is centered around the tracing of systems on cloud using technologies like ebpf. The goal is to talk about what this technology is all about and what purpose it serves. In his session at 21st Cloud Expo, Shashank Jain, Development Architect at SAP, will touch upon concepts of observability in the cloud and also some of the challenges we have. Generally most cloud-based monitoring tools capture details at a very granular level. To troubleshoot problems this might not be good enough.
Organizations do not need a Big Data strategy; they need a business strategy that incorporates Big Data. Most organizations lack a road map for using Big Data to optimize key business processes, deliver a differentiated customer experience, or uncover new business opportunities. They do not understand what’s possible with respect to integrating Big Data into the business model.
When it comes to cloud computing, the ability to turn massive amounts of compute cores on and off on demand sounds attractive to IT staff, who need to manage peaks and valleys in user activity. With cloud bursting, the majority of the data can stay on premises while tapping into compute from public cloud providers, reducing risk and minimizing need to move large files. In his session at 18th Cloud Expo, Scott Jeschonek, Director of Product Management at Avere Systems, discussed the IT and busine...
Enterprises have taken advantage of IoT to achieve important revenue and cost advantages. What is less apparent is how incumbent enterprises operating at scale have, following success with IoT, built analytic, operations management and software development capabilities – ranging from autonomous vehicles to manageable robotics installations. They have embraced these capabilities as if they were Silicon Valley startups. As a result, many firms employ new business models that place enormous impor...