Click here to close now.


IoT User Interface Authors: Yeshim Deniz, Elizabeth White, Pat Romanski, Gary Kaiser, Liz McMillan

Related Topics: Microservices Expo, Java IoT, Industrial IoT, IoT User Interface, Agile Computing, @BigDataExpo

Microservices Expo: Article

With Confidence Through the Holiday Season: Manage Success in Production

The top goal for eCommerce sites is to ensure high conversion rates as this converts into business

In our last two articles, we discussed what we have learned from last year's holiday season as well as things that we can do in the preparation phase for this year's upcoming event. In this blog we show you those dashboards and data points you need throughout the holiday season to make it a success.

The top goal for eCommerce sites is to ensure high conversion rates as this converts into business. IT's responsibility is to ensure that consumers can use the eCommerce site in an "enjoyable" way. But there is much more than measuring the UpTime or Response Time of your services. The dashboards shown are taken from other eCommerce sites used to monitor the health of their application, infrastructure as well as end user satisfaction and conversion rate.

#1: Infrastructure and Application Health

Dashboards need to show the system health impact on applications, services and processes. If no systems are impacted it has less priority to deal with high CPU, Memory, ...

Applications ultimately run on an IT Infrastructure; whether these machines are "physical", virtualized, or running in the cloud. Ensuring a healthy infrastructure is the key requirement for IT. But it is more important to know whether there is an immediate impact on the hosted applications, services and processes. Before upgrading any hardware, reconfiguring your routing tables, or bouncing your application, it is important to understand whether it actually impacts the application and the end user. Just because you run on 95% of CPU doesn't mean it's a problem - maybe your developers just built a perfect system that consumes all resources available in an optimum manner.

You need a dashboard that alerts on system monitoring issues but also take into account the applications, services, and processes running on them. Are these impacted by the resource shortage or not? That answer dictates your action if you know what is actually impacted.

#2: Application Performance

What are they key performance indicators per application? Are end users impacted by bad response times or failures? Is it the App or the underlying infrastructure?

The second dashboard you need focuses on the application, its performance, and impact on the end user. It answers the following critical questions:

  1. How much traffic is currently on the page? Is it still climbing? Is it outside the norm?
  2. Do we have an unusual high failure rate, e.g., failed credit card transactions, abandoned carts?
  3. What is the overall response time and is it violating my baseline?
  4. Is the application impacted by unhealthy app or web servers, e.g., high GC
  5. Are the hosts (physical, virtual or in the cloud) running into CPU, Memory or I/O limits?
  6. Are end users currently impacted when accessing the app? Are they leaving because of bad user experience?
  7. What is the current conversion rate or are we making money?

#3: Regional Availability and User Experience

How is user experience in our target markets? Any regional availability or performance problems?

The first two dashboards in this blog analyzed performance from within our datacenter. The dashboards above and below now focus on performance perceived from the outside - meaning - from the real end user perspective. You need to know if your app is not reachable from a specific region or when conversion rate drops even though your servers are doing fine. These two dashboards answer the following important questions for you:

  1. Is my site reachable from my key regional markets?
  2. If I am not reachable: How long did the outage last and did it impact Users?
  3. How many users do we have per region and what was their User Experience?
  4. How does the traffic per region develop over time?
  5. How is our conversion rate over time and how many orders do we actually get in?

How is conversion rate and number of orders evolving over time? If we have a drop in conversions - is it related to a regional problem or is it related to general system health issues in our data center?

#4: Real User Experience on the Conversion Funnel

Learning how users move through the conversion funnel, where they drop off and how response time and end user experience (APDEX) impacts the conversion funnel

You need a dedicated dashboard for all important actions along your conversion funnel. That includes landing pages and actions such as search, product details, add to cart and checkout. The dashboard helps you to understand:

  1. How many users you have on each conversion funnel step?
  2. Do they encounter problems during a particular action and is that the reason for a drop?
  3. How fast is each step and does it have an impact on end user experience?

#5: Third Party Monitoring dashboard

How fast is static content delivered by Akamai & Co? Are there spikes or outages that impact my end users?

Most eCommerce sites rely on third-party content which not only impacts the feature set of the site but also performance and with that end user experience. Third Party Monitoring requires a view from two different angles: The third parties that are directly included into your website or mobile app and the external services you call from your backend.

These are the questions your Third Party Monitoring dashboard has to answer:

  1. Are the resources delivered via CDN fast or do we have regional problems?
  2. Is the integrated social media (Facebook, LinkedIn, Xing...) slow?
  3. Are the backend services facing bad requests to the integrate third parties?
  4. Is the performance of the third party good?

How fast and reliable are third party services such as facebook or Google API? Does it impact the failure rate of my application?

#6: Desktop Web vs. Mobile Web vs. Mobile App dashboard

Get to know your users: what devices to they use and does that impact user experience?

Your potential customers can use desktops, tablets or smart phones to access your site. They will either have fast WiFi or slow dial-up speed. All of this impacts user experience. In order to analyze performance and optimize your site for these types of browsers, devices and connection speed you need a dashboard that tells you:

  1. How many users are accessing my portal via Mobile App or Mobile Browser?
  2. What are the top browsers used? Do we need specific optimized pages for older browsers?
  3. Do we need to optimize for lower bandwidths, e.g: use better image compression?
  4. Is there a difference between the Key Performance Indicators (KPI) depending on the different types of devices, browsers, mobile native vs. mobile web?

When disaster strikes: Collaborate with R&D
It is likely that you have smaller hiccups throughout the holiday season. To avoid lengthy and painful war room situations it is important to level-up your monitoring system and provide data your engineering team needs to speed up error resolution. Here is a list of capabilities that will speed up triage and error resolution:

  1. Capture all actions of each visitor
  2. Collect Crashes, JavaScript Errors, iOS/Andorid Exceptions from your mobile app
  3. Provide method level visibility on the server side including context information such as method arguments and return values
  4. Provide the ability for memory heap dumps and access to all requested application performance metrics, e.g: connection pool, thread count, heap sizes, ...
  5. Use tools to capture this data that developers already use and that also allow sharing data from different environments.

This level of detail is what developers need to understand what exactly went wrong instead of digging through giga bytes of log files

Crash information from mobile native apps by mobile device and version makes it easy to fix specific problems

Having these types of dashboards make it easy to monitor the success of the holiday season and also easy to react on problems and prevent larger damage by executing the right actions. Make sure you do not waste your time with problems that are not real, e.g., an individual user complains or trying to find a problem related to a regional outage of an ISP. Focus on those problems that impact a large number of users and that you can fix. This will make sure you keep conversion rate high and business flowing.

For further reading check out our other recent blogs such as DevOps Survival Guide: 2013 Online Holiday Shopping Season and With Confidence into the Holiday Season: Verifying Readiness in Test / Pre-Production

More Stories By Klaus Enzenhofer

Klaus Enzenhofer has several years of experience and expertise in the field of Web Performance Optimization and User Experience Management. He works as Technical Strategist in the Center of Excellence Team at dynaTrace Software. In this role he influences the development of the dynaTrace Application Performance Management Solution and the Web Performance Optimization Tool dynaTrace AJAX Edition. He mainly gathered his experience in web and performance by developing and running large-scale web portals at Tiscover GmbH.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.

@CloudExpo Stories
Containers are changing the security landscape for software development and deployment. As with any security solutions, security approaches that work for developers, operations personnel and security professionals is a requirement. In his session at @DevOpsSummit, Kevin Gilpin, CTO and Co-Founder of Conjur, will discuss various security considerations for container-based infrastructure and related DevOps workflows.
As-a-service models offer huge opportunities, but also complicate security. It may seem that the easiest way to migrate to a new architectural model is to let others, experts in their field, do the work. This has given rise to many as-a-service models throughout the industry and across the entire technology stack, from software to infrastructure. While this has unlocked huge opportunities to accelerate the deployment of new capabilities or increase economic efficiencies within an organization, i...
There are so many tools and techniques for data analytics that even for a data scientist the choices, possible systems, and even the types of data can be daunting. In his session at @ThingsExpo, Chris Harrold, Global CTO for Big Data Solutions for EMC Corporation, will show how to perform a simple, but meaningful analysis of social sentiment data using freely available tools that take only minutes to download and install. Participants will get the download information, scripts, and complete en...
Containers are revolutionizing the way we deploy and maintain our infrastructures, but monitoring and troubleshooting in a containerized environment can still be painful and impractical. Understanding even basic resource usage is difficult - let alone tracking network connections or malicious activity. In his session at DevOps Summit, Gianluca Borello, Sr. Software Engineer at Sysdig, will cover the current state of the art for container monitoring and visibility, including pros / cons and li...
IT data is typically silo'd by the various tools in place. Unifying all the log, metric and event data in one analytics platform stops finger pointing and provides the end-to-end correlation. Logs, metrics and custom event data can be joined to tell the holistic story of your software and operations. For example, users can correlate code deploys to system performance to application error codes.
Between the compelling mockups and specs produced by analysts, and resulting applications built by developers, there exists a gulf where projects fail, costs spiral, and applications disappoint. Methodologies like Agile attempt to address this with intensified communication, with partial success but many limitations. In his session at DevOps Summit, Charles Kendrick, CTO and Chief Architect at Isomorphic Software, will present a revolutionary model enabled by new technologies. Learn how busine...
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data...
There are many considerations when moving applications from on-premise to cloud. It is critical to understand the benefits and also challenges of this migration. A successful migration will result in lower Total Cost of Ownership, yet offer the same or higher level of robustness. Migration to cloud shifts computing resources from your data center, which can yield significant advantages provided that the cloud vendor an offer enterprise-grade quality for your application.
Internet of Things (IoT) will be a hybrid ecosystem of diverse devices and sensors collaborating with operational and enterprise systems to create the next big application. In their session at @ThingsExpo, Bramh Gupta, founder and CEO of, and Fred Yatzeck, principal architect leading product development at, discussed how choosing the right middleware and integration strategy from the get-go will enable IoT solution developers to adapt and grow with the industry, while at th...
Manufacturing has widely adopted standardized and automated processes to create designs, build them, and maintain them through their life cycle. However, many modern manufacturing systems go beyond mechanized workflows to introduce empowered workers, flexible collaboration, and rapid iteration. Such behaviors also characterize open source software development and are at the heart of DevOps culture, processes, and tooling.
The web app is agile. The REST API is agile. The testing and planning are agile. But alas, data infrastructures certainly are not. Once an application matures, changing the shape or indexing scheme of data often forces at best a top down planning exercise and at worst includes schema changes that force downtime. The time has come for a new approach that fundamentally advances the agility of distributed data infrastructures. Come learn about a new solution to the problems faced by software organ...
The last decade was about virtual machines, but the next one is about containers. Containers enable a service to run on any host at any time. Traditional tools are starting to show cracks because they were not designed for this level of application portability. Now is the time to look at new ways to deploy and manage applications at scale. In his session at @DevOpsSummit, Brian “Redbeard” Harrington, a principal architect at CoreOS, will examine how CoreOS helps teams run in production. Attende...
The buzz continues for cloud, data analytics and the Internet of Things (IoT) and their collective impact across all industries. But a new conversation is emerging - how do companies use industry disruption and technology enablers to lead in markets undergoing change, uncertainty and ambiguity? Organizations of all sizes need to evolve and transform, often under massive pressure, as industry lines blur and merge and traditional business models are assaulted and turned upside down. In this new da...
“All our customers are looking at the cloud ecosystem as an important part of their overall product strategy. Some see it evolve as a multi-cloud / hybrid cloud strategy, while others are embracing all forms of cloud offerings like PaaS, IaaS and SaaS in their solutions,” noted Suhas Joshi, Vice President – Technology, at Harbinger Group, in this exclusive Q&A with Cloud Expo Conference Chair Roger Strukhoff.
Can call centers hang up the phones for good? Intuitive Solutions did. WebRTC enabled this contact center provider to eliminate antiquated telephony and desktop phone infrastructure with a pure web-based solution, allowing them to expand beyond brick-and-mortar confines to a home-based agent model. It also ensured scalability and better service for customers, including MUY! Companies, one of the country's largest franchise restaurant companies with 232 Pizza Hut locations. This is one example of...
Cloud computing delivers on-demand resources that provide businesses with flexibility and cost-savings. The challenge in moving workloads to the cloud has been the cost and complexity of ensuring the initial and ongoing security and regulatory (PCI, HIPAA, FFIEC) compliance across private and public clouds. Manual security compliance is slow, prone to human error, and represents over 50% of the cost of managing cloud applications. Determining how to automate cloud security compliance is critical...
SYS-CON Events announced today that VividCortex, the monitoring solution for the modern data system, will exhibit at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. The database is the heart of most applications, but it’s also the part that’s hardest to scale, monitor, and optimize even as it’s growing 50% year over year. VividCortex is the first unified suite of database monitoring tools specifically desi...
Saviynt Inc. has announced the availability of the next release of Saviynt for AWS. The comprehensive security and compliance solution provides a Command-and-Control center to gain visibility into risks in AWS, enforce real-time protection of critical workloads as well as data and automate access life-cycle governance. The solution enables AWS customers to meet their compliance mandates such as ITAR, SOX, PCI, etc. by including an extensive risk and controls library to detect known threats and b...
You have your devices and your data, but what about the rest of your Internet of Things story? Two popular classes of technologies that nicely handle the Big Data analytics for Internet of Things are Apache Hadoop and NoSQL. Hadoop is designed for parallelizing analytical work across many servers and is ideal for the massive data volumes you create with IoT devices. NoSQL databases such as Apache HBase are ideal for storing and retrieving IoT data as “time series data.”
Clearly the way forward is to move to cloud be it bare metal, VMs or containers. One aspect of the current public clouds that is slowing this cloud migration is cloud lock-in. Every cloud vendor is trying to make it very difficult to move out once a customer has chosen their cloud. In his session at 17th Cloud Expo, Naveen Nimmu, CEO of Clouber, Inc., will advocate that making the inter-cloud migration as simple as changing airlines would help the entire industry to quickly adopt the cloud wit...