Welcome!

Machine Learning Authors: Elizabeth White, Yeshim Deniz, Rene Buest, Nate Vickery, Pat Romanski

Related Topics: @ThingsExpo, Machine Learning , @BigDataExpo

@ThingsExpo: Blog Post

Are You Thinking About Big Data When Doing IoT? – You Should Be | @ThingsExpo #ML #IoT #M2M #BigData

Based on all estimates by industry analysts and current trends, the IoT is growing at an incredible rate and is here to stay

Are You Thinking About Big Data When Doing IoT? - You Should Be

There is no denying the Internet of Things (IoT) is a hot topic. Gartner positions IoT as being at the peak of the ‘hype cycle.' From a size perspective, these ‘Things' can be anything, from a small sensor to a large appliance, and everything in between. The data transmitted by these devices, for the most part, tends to be small - tiny packets of information destined for consumption and analysis, bringing value to the business.

Is there hype? Yes. As with any new technology, there is always a level of hype involved. Are the data packets involved small? For the most part, yes (there are always exceptions). While both may be true, The Internet of Things is growing at breakneck speed. No matter which analyst you read, the growth predictions are staggering. Gartner predicts that we will hit over 20 billion (with a B) devices by 2020. IHS predicts even larger numbers, with 30 billion by 2020, and over 75 billion devices by 2025. No matter what, that's a lot of devices, and no matter how small the packets, multiplied by the number of devices, that's a lot of data.

It's not the things, it's the data
What I find interesting is that many times the focus of discussion when talking IoT are the devices, the sensors, the hardware itself. The latest Fitbit or smartwatch. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things'). Yes, those technologies are interesting (okay, fascinating, I will admit, my inner geek loves getting down into the actual technologies), but when we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing?

What I am about to say may sound like heresy to many. IoT is not about the devices. The devices are not the end goal. The devices are tools, mechanisms, conduits, conduits of information. They provide (and consume) information. Massive amounts of information. A former colleague of mine for years was always fond of saying, ‘Ed, It's all about the data.' In the burgeoning world of IoT that statement identifies the true business value of IoT. Information.

Watching out for potholes
Recently, Ford announced they were testing a pothole detector and alert system for cars. Living in New England, let me tell you, potholes are the bane of a car driver's existence. Many a car ends up in the repair shop during pothole season. Given that, the concept is intriguing. The manufacturer has cameras mounted on the vehicles. The cameras scan the roadway around the vehicle looking for signs of potholes. Image recognition allows it to make this determination. If a pothole is detected, the system will allow the car to avoid hitting the pothole, and thus potential damage to the vehicle.

Now some would say, ‘what does that have to do with big data?' The system is self-contained within the vehicle. To be useful, the system needs to react in near real-time to the situation. It doesn't have time to send all the data back to the cloud for analysis to determine if there is a pothole. Also, what if it loses network connection? All valid points. Let's take a step back, and look at the bigger picture.

  • How does the system recognize a pothole? Image recognition. What does image recognition need? Lots of data about what potholes look like. Machine learning algorithms help it determine if its seeing a pothole, and those algorithms need data to do that.
  • What will be the source of those pothole images? Wouldn't it be useful if images of any potholes the system encounters become part of the source data for the image recognition system to improve its detection? Wouldn't it be useful to provide that back to a central location to improve the algorithms and detection software, which could then be sent back to all the other vehicles to improve their capability?
  • What about all the cars without the system? Wouldn't it be nice if the pothole locations were flagged to the various GPS applications people use so they are aware of the pothole and its location?
  • What about the local public works department? Wouldn't it be nice if they were automatically notified about the new pothole identified so it could be repaired?

Ingestion considerations
Given the importance of the data to the success of any IoT implementation, ingesting that information is critical to the successful implementation.

  • Data Quality - In the world of data, quality has always been an important consideration. Data cleansing and scrubbing is standard practice already in many organizations. It has become critical for IoT implementations. Ingesting dirty data into even the best IoT implementation will bring it to a grinding halt.
  • Data Volume - As I have mentioned already, many times the data packets for an individual device/sensor are small. That being said, multiplied by the sheer number of devices, the volume can quickly overwhelm a network or storage environment if not planned for appropriately. These considerations also must take into account location
  • Data Timeliness - Besides volume, new and timely data is also a consideration. In the pothole example, if the last update was weeks ago, how valid is the location anymore?
  • Data Pedigree - Where did the data come from? Is it a valid source? The pedigree is less important when using internal systems, as the source is well known, but IoT systems, by their nature, frequently will be getting their data from devices and sources outside the normal perimeter. This requires extra effort to ensure you trust the information being consumed.

No technology negates the need for good design and planning
Based on all estimates by industry analysts and current trends, the Internet of Things is growing at an incredible rate and is here to stay. There is a big radar blip of data outside your data center that is not going anywhere. That data provides great value, but also many challenges that need to be taken into consideration. If you are doing IoT and are not looking at Big Data, you are missing an opportunity and business value. As many of my readers have heard me say frequently, no technology negates the need for good design and planning. The Internet of Things and the accompanying Big Data demands it if you are to be successful.

More Stories By Ed Featherston

Ed Featherston is VP, Principal Architect at Cloud Technology Partners. He brings 35 years of technology experience in designing, building, and implementing large complex solutions. He has significant expertise in systems integration, Internet/intranet, and cloud technologies. He has delivered projects in various industries, including financial services, pharmacy, government and retail.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@CloudExpo Stories
"With Digital Experience Monitoring what used to be a simple visit to a web page has exploded into app on phones, data from social media feeds, competitive benchmarking - these are all components that are only available because of some type of digital asset," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at DevOps Summit at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
21st International Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Me...
SYS-CON Events announced today that DXWorldExpo has been named “Global Sponsor” of SYS-CON's 21st International Cloud Expo, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Digital Transformation is the key issue driving the global enterprise IT business. Digital Transformation is most prominent among Global 2000 enterprises and government institutions.
SYS-CON Events announced today that Datera, that offers a radically new data management architecture, has been named "Exhibitor" of SYS-CON's 21st International Cloud Expo ®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Datera is transforming the traditional datacenter model through modern cloud simplicity. The technology industry is at another major inflection point. The rise of mobile, the Internet of Things, data storage and Big...
Kubernetes is an open source system for automating deployment, scaling, and management of containerized applications. Kubernetes was originally built by Google, leveraging years of experience with managing container workloads, and is now a Cloud Native Compute Foundation (CNCF) project. Kubernetes has been widely adopted by the community, supported on all major public and private cloud providers, and is gaining rapid adoption in enterprises. However, Kubernetes may seem intimidating and complex ...
"Outscale was founded in 2010, is based in France, is a strategic partner to Dassault Systémes and has done quite a bit of work with divisions of Dassault," explained Jackie Funk, Digital Marketing exec at Outscale, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We focus on SAP workloads because they are among the most powerful but somewhat challenging workloads out there to take into public cloud," explained Swen Conrad, CEO of Ocean9, Inc., in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We are still a relatively small software house and we are focusing on certain industries like FinTech, med tech, energy and utilities. We help our customers with their digital transformation," noted Piotr Stawinski, Founder and CEO of EARP Integration, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"I think DevOps is now a rambunctious teenager – it’s starting to get a mind of its own, wanting to get its own things but it still needs some adult supervision," explained Thomas Hooker, VP of marketing at CollabNet, in this SYS-CON.tv interview at DevOps Summit at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We've been engaging with a lot of customers including Panasonic, we've been involved with Cisco and now we're working with the U.S. government - the Department of Homeland Security," explained Peter Jung, Chief Product Officer at Pulzze Systems, in this SYS-CON.tv interview at @ThingsExpo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We're here to tell the world about our cloud-scale infrastructure that we have at Juniper combined with the world-class security that we put into the cloud," explained Lisa Guess, VP of Systems Engineering at Juniper Networks, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
Your homes and cars can be automated and self-serviced. Why can't your storage? From simply asking questions to analyze and troubleshoot your infrastructure, to provisioning storage with snapshots, recovery and replication, your wildest sci-fi dream has come true. In his session at @DevOpsSummit at 20th Cloud Expo, Dan Florea, Director of Product Management at Tintri, provided a ChatOps demo where you can talk to your storage and manage it from anywhere, through Slack and similar services with...
As enterprise cloud becomes the norm, businesses and government programs must address compounded regulatory compliance related to data privacy and information protection. The most recent, Controlled Unclassified Information and the EU’s GDPR have board level implications and companies still struggle with demonstrating due diligence. Developers and DevOps leaders, as part of the pre-planning process and the associated supply chain, could benefit from updating their code libraries and design by in...
"Peak 10 is a hybrid infrastructure provider across the nation. We are in the thick of things when it comes to hybrid IT," explained Michael Fuhrman, Chief Technology Officer at Peak 10, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
SYS-CON Events announced today that Calligo, an innovative cloud service provider offering mid-sized companies the highest levels of data privacy and security, has been named "Bronze Sponsor" of SYS-CON's 21st International Cloud Expo ®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Calligo offers unparalleled application performance guarantees, commercial flexibility and a personalised support service from its globally located cloud plat...
"We are an IT services solution provider and we sell software to support those solutions. Our focus and key areas are around security, enterprise monitoring, and continuous delivery optimization," noted John Balsavage, President of A&I Solutions, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"We were founded in 2003 and the way we were founded was about good backup and good disaster recovery for our clients, and for the last 20 years we've been pretty consistent with that," noted Marc Malafronte, Territory Manager at StorageCraft, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
There is a huge demand for responsive, real-time mobile and web experiences, but current architectural patterns do not easily accommodate applications that respond to events in real time. Common solutions using message queues or HTTP long-polling quickly lead to resiliency, scalability and development velocity challenges. In his session at 21st Cloud Expo, Ryland Degnan, a Senior Software Engineer on the Netflix Edge Platform team, will discuss how by leveraging a reactive stream-based protocol,...
"We are focused on SAP running in the clouds, to make this super easy because we believe in the tremendous value of those powerful worlds - SAP and the cloud," explained Frank Stienhans, CTO of Ocean9, Inc., in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"DivvyCloud as a company set out to help customers automate solutions to the most common cloud problems," noted Jeremy Snyder, VP of Business Development at DivvyCloud, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.