Welcome!

Machine Learning Authors: Liz McMillan, Pat Romanski, Yeshim Deniz, Elizabeth White, Leon Adato

Related Topics: @ThingsExpo, Machine Learning , @BigDataExpo

@ThingsExpo: Blog Post

Are You Thinking About Big Data When Doing IoT? – You Should Be | @ThingsExpo #ML #IoT #M2M #BigData

Based on all estimates by industry analysts and current trends, the IoT is growing at an incredible rate and is here to stay

Are You Thinking About Big Data When Doing IoT? - You Should Be

There is no denying the Internet of Things (IoT) is a hot topic. Gartner positions IoT as being at the peak of the ‘hype cycle.' From a size perspective, these ‘Things' can be anything, from a small sensor to a large appliance, and everything in between. The data transmitted by these devices, for the most part, tends to be small - tiny packets of information destined for consumption and analysis, bringing value to the business.

Is there hype? Yes. As with any new technology, there is always a level of hype involved. Are the data packets involved small? For the most part, yes (there are always exceptions). While both may be true, The Internet of Things is growing at breakneck speed. No matter which analyst you read, the growth predictions are staggering. Gartner predicts that we will hit over 20 billion (with a B) devices by 2020. IHS predicts even larger numbers, with 30 billion by 2020, and over 75 billion devices by 2025. No matter what, that's a lot of devices, and no matter how small the packets, multiplied by the number of devices, that's a lot of data.

It's not the things, it's the data
What I find interesting is that many times the focus of discussion when talking IoT are the devices, the sensors, the hardware itself. The latest Fitbit or smartwatch. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things'). Yes, those technologies are interesting (okay, fascinating, I will admit, my inner geek loves getting down into the actual technologies), but when we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing?

What I am about to say may sound like heresy to many. IoT is not about the devices. The devices are not the end goal. The devices are tools, mechanisms, conduits, conduits of information. They provide (and consume) information. Massive amounts of information. A former colleague of mine for years was always fond of saying, ‘Ed, It's all about the data.' In the burgeoning world of IoT that statement identifies the true business value of IoT. Information.

Watching out for potholes
Recently, Ford announced they were testing a pothole detector and alert system for cars. Living in New England, let me tell you, potholes are the bane of a car driver's existence. Many a car ends up in the repair shop during pothole season. Given that, the concept is intriguing. The manufacturer has cameras mounted on the vehicles. The cameras scan the roadway around the vehicle looking for signs of potholes. Image recognition allows it to make this determination. If a pothole is detected, the system will allow the car to avoid hitting the pothole, and thus potential damage to the vehicle.

Now some would say, ‘what does that have to do with big data?' The system is self-contained within the vehicle. To be useful, the system needs to react in near real-time to the situation. It doesn't have time to send all the data back to the cloud for analysis to determine if there is a pothole. Also, what if it loses network connection? All valid points. Let's take a step back, and look at the bigger picture.

  • How does the system recognize a pothole? Image recognition. What does image recognition need? Lots of data about what potholes look like. Machine learning algorithms help it determine if its seeing a pothole, and those algorithms need data to do that.
  • What will be the source of those pothole images? Wouldn't it be useful if images of any potholes the system encounters become part of the source data for the image recognition system to improve its detection? Wouldn't it be useful to provide that back to a central location to improve the algorithms and detection software, which could then be sent back to all the other vehicles to improve their capability?
  • What about all the cars without the system? Wouldn't it be nice if the pothole locations were flagged to the various GPS applications people use so they are aware of the pothole and its location?
  • What about the local public works department? Wouldn't it be nice if they were automatically notified about the new pothole identified so it could be repaired?

Ingestion considerations
Given the importance of the data to the success of any IoT implementation, ingesting that information is critical to the successful implementation.

  • Data Quality - In the world of data, quality has always been an important consideration. Data cleansing and scrubbing is standard practice already in many organizations. It has become critical for IoT implementations. Ingesting dirty data into even the best IoT implementation will bring it to a grinding halt.
  • Data Volume - As I have mentioned already, many times the data packets for an individual device/sensor are small. That being said, multiplied by the sheer number of devices, the volume can quickly overwhelm a network or storage environment if not planned for appropriately. These considerations also must take into account location
  • Data Timeliness - Besides volume, new and timely data is also a consideration. In the pothole example, if the last update was weeks ago, how valid is the location anymore?
  • Data Pedigree - Where did the data come from? Is it a valid source? The pedigree is less important when using internal systems, as the source is well known, but IoT systems, by their nature, frequently will be getting their data from devices and sources outside the normal perimeter. This requires extra effort to ensure you trust the information being consumed.

No technology negates the need for good design and planning
Based on all estimates by industry analysts and current trends, the Internet of Things is growing at an incredible rate and is here to stay. There is a big radar blip of data outside your data center that is not going anywhere. That data provides great value, but also many challenges that need to be taken into consideration. If you are doing IoT and are not looking at Big Data, you are missing an opportunity and business value. As many of my readers have heard me say frequently, no technology negates the need for good design and planning. The Internet of Things and the accompanying Big Data demands it if you are to be successful.

More Stories By Ed Featherston

Ed Featherston is VP, Principal Architect at Cloud Technology Partners. He brings 35 years of technology experience in designing, building, and implementing large complex solutions. He has significant expertise in systems integration, Internet/intranet, and cloud technologies. He has delivered projects in various industries, including financial services, pharmacy, government and retail.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@CloudExpo Stories
In his session at 20th Cloud Expo, Mike Johnston, an infrastructure engineer at Supergiant.io, discussed how to use Kubernetes to set up a SaaS infrastructure for your business. Mike Johnston is an infrastructure engineer at Supergiant.io with over 12 years of experience designing, deploying, and maintaining server and workstation infrastructure at all scales. He has experience with brick and mortar data centers as well as cloud providers like Digital Ocean, Amazon Web Services, and Rackspace. H...
The question before companies today is not whether to become intelligent, it’s a question of how and how fast. The key is to adopt and deploy an intelligent application strategy while simultaneously preparing to scale that intelligence. In her session at 21st Cloud Expo, Sangeeta Chakraborty, Chief Customer Officer at Ayasdi, will provide a tactical framework to become a truly intelligent enterprise, including how to identify the right applications for AI, how to build a Center of Excellence to ...
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
As businesses adopt functionalities in cloud computing, it’s imperative that IT operations consistently ensure cloud systems work correctly – all of the time, and to their best capabilities. In his session at @BigDataExpo, Bernd Harzog, CEO and founder of OpsDataStore, presented an industry answer to the common question, “Are you running IT operations as efficiently and as cost effectively as you need to?” He then expounded on the industry issues he frequently came up against as an analyst, and ...
SYS-CON Events announced today that Massive Networks will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Massive Networks mission is simple. To help your business operate seamlessly with fast, reliable, and secure internet and network solutions. Improve your customer's experience with outstanding connections to your cloud.
DevOps is under attack because developers don’t want to mess with infrastructure. They will happily own their code into production, but want to use platforms instead of raw automation. That’s changing the landscape that we understand as DevOps with both architecture concepts (CloudNative) and process redefinition (SRE). Rob Hirschfeld’s recent work in Kubernetes operations has led to the conclusion that containers and related platforms have changed the way we should be thinking about DevOps and...
SYS-CON Events announced today that Datera, that offers a radically new data management architecture, has been named "Exhibitor" of SYS-CON's 21st International Cloud Expo ®, which will take place on Oct 31 - Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Datera is transforming the traditional datacenter model through modern cloud simplicity. The technology industry is at another major inflection point. The rise of mobile, the Internet of Things, data storage and Big...
Everything run by electricity will eventually be connected to the Internet. Get ahead of the Internet of Things revolution and join Akvelon expert and IoT industry leader, Sergey Grebnov, in his session at @ThingsExpo, for an educational dive into the world of managing your home, workplace and all the devices they contain with the power of machine-based AI and intelligent Bot services for a completely streamlined experience.
Because IoT devices are deployed in mission-critical environments more than ever before, it’s increasingly imperative they be truly smart. IoT sensors simply stockpiling data isn’t useful. IoT must be artificially and naturally intelligent in order to provide more value In his session at @ThingsExpo, John Crupi, Vice President and Engineering System Architect at Greenwave Systems, will discuss how IoT artificial intelligence (AI) can be carried out via edge analytics and machine learning techn...
FinTechs use the cloud to operate at the speed and scale of digital financial activity, but are often hindered by the complexity of managing security and compliance in the cloud. In his session at 20th Cloud Expo, Sesh Murthy, co-founder and CTO of Cloud Raxak, showed how proactive and automated cloud security enables FinTechs to leverage the cloud to achieve their business goals. Through business-driven cloud security, FinTechs can speed time-to-market, diminish risk and costs, maintain continu...
Existing Big Data solutions are mainly focused on the discovery and analysis of data. The solutions are scalable and highly available but tedious when swapping in and swapping out occurs in disarray and thrashing takes place. The resolution for thrashing through machine learning algorithms and support nomenclature is through simple techniques. Organizations that have been collecting large customer data are increasingly seeing the need to use the data for swapping in and out and thrashing occurs ...
SYS-CON Events announced today that CA Technologies has been named "Platinum Sponsor" of SYS-CON's 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. CA Technologies helps customers succeed in a future where every business - from apparel to energy - is being rewritten by software. From planning to development to management to security, CA creates software that fuels transformation for companies in the applic...
As many know, the first generation of Cloud Management Platform (CMP) solutions were designed for managing virtual infrastructure (IaaS) and traditional applications. But that’s no longer enough to satisfy evolving and complex business requirements. In his session at 21st Cloud Expo, Scott Davis, Embotics CTO, will explore how next-generation CMPs ensure organizations can manage cloud-native and microservice-based application architectures, while also facilitating agile DevOps methodology. He wi...
From 2013, NTT Communications has been providing cPaaS service, SkyWay. Its customer’s expectations for leveraging WebRTC technology are not only typical real-time communication use cases such as Web conference, remote education, but also IoT use cases such as remote camera monitoring, smart-glass, and robotic. Because of this, NTT Communications has numerous IoT business use-cases that its customers are developing on top of PaaS. WebRTC will lead IoT businesses to be more innovative and address...
Blockchain is a shared, secure record of exchange that establishes trust, accountability and transparency across business networks. Supported by the Linux Foundation's open source, open-standards based Hyperledger Project, Blockchain has the potential to improve regulatory compliance, reduce cost as well as advance trade. Are you curious about how Blockchain is built for business? In her session at 21st Cloud Expo, René Bostic, Technical VP of the IBM Cloud Unit in North America, will discuss th...
While some vendors scramble to create and sell you a fancy solution for monitoring your spanking new Amazon Lambdas, hear how you can do it on the cheap using just built-in Java APIs yourself. By exploiting a little-known fact that Lambdas aren’t exactly single-threaded, you can effectively identify hot spots in your serverless code. In his session at @DevOpsSummit at 21st Cloud Expo, Dave Martin, Product owner at CA Technologies, will give a live demonstration and code walkthrough, showing how ...
SYS-CON Events announced today that CA Technologies has been named “Platinum Sponsor” of SYS-CON's 21st International Cloud Expo®, which will take place October 31-November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. CA Technologies helps customers succeed in a future where every business – from apparel to energy – is being rewritten by software. From planning to development to management to security, CA creates software that fuels transformation for companies in the applic...
Cloud adoption is often driven by a desire to increase efficiency, boost agility and save money. All too often, however, the reality involves unpredictable cost spikes and lack of oversight due to resource limitations. In his session at 20th Cloud Expo, Joe Kinsella, CTO and Founder of CloudHealth Technologies, tackled the question: “How do you build a fully optimized cloud?” He will examine: Why TCO is critical to achieving cloud success – and why attendees should be thinking holistically ab...
Internet of @ThingsExpo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 21st Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devic...
As more and more companies are making the shift from on-premises to public cloud, the standard approach to DevOps is evolving. From encryption, compliance and regulations like GDPR, security in the cloud has become a hot topic. Many DevOps-focused companies have hired dedicated staff to fulfill these requirements, often creating further siloes, complexity and cost. This session aims to highlight existing DevOps cultural approaches, tooling and how security can be wrapped in every facet of the bu...