|By Klaus Enzenhofer||
|October 8, 2013 12:45 PM EDT||
In our last two articles, we discussed what we have learned from last year's holiday season as well as things that we can do in the preparation phase for this year's upcoming event. In this blog we show you those dashboards and data points you need throughout the holiday season to make it a success.
The top goal for eCommerce sites is to ensure high conversion rates as this converts into business. IT's responsibility is to ensure that consumers can use the eCommerce site in an "enjoyable" way. But there is much more than measuring the UpTime or Response Time of your services. The dashboards shown are taken from other eCommerce sites used to monitor the health of their application, infrastructure as well as end user satisfaction and conversion rate.
#1: Infrastructure and Application Health
Dashboards need to show the system health impact on applications, services and processes. If no systems are impacted it has less priority to deal with high CPU, Memory, ...
Applications ultimately run on an IT Infrastructure; whether these machines are "physical", virtualized, or running in the cloud. Ensuring a healthy infrastructure is the key requirement for IT. But it is more important to know whether there is an immediate impact on the hosted applications, services and processes. Before upgrading any hardware, reconfiguring your routing tables, or bouncing your application, it is important to understand whether it actually impacts the application and the end user. Just because you run on 95% of CPU doesn't mean it's a problem - maybe your developers just built a perfect system that consumes all resources available in an optimum manner.
You need a dashboard that alerts on system monitoring issues but also take into account the applications, services, and processes running on them. Are these impacted by the resource shortage or not? That answer dictates your action if you know what is actually impacted.
#2: Application Performance
What are they key performance indicators per application? Are end users impacted by bad response times or failures? Is it the App or the underlying infrastructure?
The second dashboard you need focuses on the application, its performance, and impact on the end user. It answers the following critical questions:
- How much traffic is currently on the page? Is it still climbing? Is it outside the norm?
- Do we have an unusual high failure rate, e.g., failed credit card transactions, abandoned carts?
- What is the overall response time and is it violating my baseline?
- Is the application impacted by unhealthy app or web servers, e.g., high GC
- Are the hosts (physical, virtual or in the cloud) running into CPU, Memory or I/O limits?
- Are end users currently impacted when accessing the app? Are they leaving because of bad user experience?
- What is the current conversion rate or are we making money?
#3: Regional Availability and User Experience
How is user experience in our target markets? Any regional availability or performance problems?
The first two dashboards in this blog analyzed performance from within our datacenter. The dashboards above and below now focus on performance perceived from the outside - meaning - from the real end user perspective. You need to know if your app is not reachable from a specific region or when conversion rate drops even though your servers are doing fine. These two dashboards answer the following important questions for you:
- Is my site reachable from my key regional markets?
- If I am not reachable: How long did the outage last and did it impact Users?
- How many users do we have per region and what was their User Experience?
- How does the traffic per region develop over time?
- How is our conversion rate over time and how many orders do we actually get in?
How is conversion rate and number of orders evolving over time? If we have a drop in conversions - is it related to a regional problem or is it related to general system health issues in our data center?
#4: Real User Experience on the Conversion Funnel
Learning how users move through the conversion funnel, where they drop off and how response time and end user experience (APDEX) impacts the conversion funnel
You need a dedicated dashboard for all important actions along your conversion funnel. That includes landing pages and actions such as search, product details, add to cart and checkout. The dashboard helps you to understand:
- How many users you have on each conversion funnel step?
- Do they encounter problems during a particular action and is that the reason for a drop?
- How fast is each step and does it have an impact on end user experience?
#5: Third Party Monitoring dashboard
How fast is static content delivered by Akamai & Co? Are there spikes or outages that impact my end users?
Most eCommerce sites rely on third-party content which not only impacts the feature set of the site but also performance and with that end user experience. Third Party Monitoring requires a view from two different angles: The third parties that are directly included into your website or mobile app and the external services you call from your backend.
These are the questions your Third Party Monitoring dashboard has to answer:
- Are the resources delivered via CDN fast or do we have regional problems?
- Is the integrated social media (Facebook, LinkedIn, Xing...) slow?
- Are the backend services facing bad requests to the integrate third parties?
- Is the performance of the third party good?
How fast and reliable are third party services such as facebook or Google API? Does it impact the failure rate of my application?
#6: Desktop Web vs. Mobile Web vs. Mobile App dashboard
Get to know your users: what devices to they use and does that impact user experience?
Your potential customers can use desktops, tablets or smart phones to access your site. They will either have fast WiFi or slow dial-up speed. All of this impacts user experience. In order to analyze performance and optimize your site for these types of browsers, devices and connection speed you need a dashboard that tells you:
- How many users are accessing my portal via Mobile App or Mobile Browser?
- What are the top browsers used? Do we need specific optimized pages for older browsers?
- Do we need to optimize for lower bandwidths, e.g: use better image compression?
- Is there a difference between the Key Performance Indicators (KPI) depending on the different types of devices, browsers, mobile native vs. mobile web?
When disaster strikes: Collaborate with R&D
It is likely that you have smaller hiccups throughout the holiday season. To avoid lengthy and painful war room situations it is important to level-up your monitoring system and provide data your engineering team needs to speed up error resolution. Here is a list of capabilities that will speed up triage and error resolution:
- Capture all actions of each visitor
- Provide method level visibility on the server side including context information such as method arguments and return values
- Provide the ability for memory heap dumps and access to all requested application performance metrics, e.g: connection pool, thread count, heap sizes, ...
- Use tools to capture this data that developers already use and that also allow sharing data from different environments.
This level of detail is what developers need to understand what exactly went wrong instead of digging through giga bytes of log files
Crash information from mobile native apps by mobile device and version makes it easy to fix specific problems
Having these types of dashboards make it easy to monitor the success of the holiday season and also easy to react on problems and prevent larger damage by executing the right actions. Make sure you do not waste your time with problems that are not real, e.g., an individual user complains or trying to find a problem related to a regional outage of an ISP. Focus on those problems that impact a large number of users and that you can fix. This will make sure you keep conversion rate high and business flowing.
For further reading check out our other recent blogs such as DevOps Survival Guide: 2013 Online Holiday Shopping Season and With Confidence into the Holiday Season: Verifying Readiness in Test / Pre-Production
There are many considerations when moving applications from on-premise to cloud. It is critical to understand the benefits and also challenges of this migration. A successful migration will result in lower Total Cost of Ownership, yet offer the same or higher level of robustness. In his session at 15th Cloud Expo, Michael Meiner, an Engineering Director at Oracle, Corporation, will analyze a range of cloud offerings (IaaS, PaaS, SaaS) and discuss the benefits/challenges of migrating to each of...
Mar. 2, 2015 11:30 PM EST Reads: 687
Cloud data governance was previously an avoided function when cloud deployments were relatively small. With the rapid adoption in public cloud – both rogue and sanctioned, it’s not uncommon to find regulated data dumped into public cloud and unprotected. This is why enterprises and cloud providers alike need to embrace a cloud data governance function and map policies, processes and technology controls accordingly. In her session at 15th Cloud Expo, Evelyn de Souza, Data Privacy and Compliance...
Mar. 2, 2015 11:30 PM EST Reads: 674
The Workspace-as-a-Service (WaaS) market will grow to $6.4B by 2018. In his session at 16th Cloud Expo, Seth Bostock, CEO of IndependenceIT, will begin by walking the audience through the evolution of Workspace as-a-Service, where it is now vs. where it going. To look beyond the desktop we must understand exactly what WaaS is, who the users are, and where it is going in the future. IT departments, ISVs and service providers must look to workflow and automation capabilities to adapt to growing ...
Mar. 2, 2015 11:00 PM EST Reads: 551
Platform-as-a-Service (PaaS) is a technology designed to make DevOps easier and allow developers to focus on application development. The PaaS takes care of provisioning, scaling, HA, and other cloud management aspects. Apache Stratos is a PaaS codebase developed in Apache and designed to create a highly productive developer environment while also supporting powerful deployment options. Integration with the Docker platform, CoreOS Linux distribution, and Kubernetes container management system ...
Mar. 2, 2015 10:45 PM EST Reads: 672
As organizations shift toward IT-as-a-service models, the need for managing and protecting data residing across physical, virtual, and now cloud environments grows with it. CommVault can ensure protection &E-Discovery of your data – whether in a private cloud, a Service Provider delivered public cloud, or a hybrid cloud environment – across the heterogeneous enterprise. In his session at 16th Cloud Expo, Randy De Meno, Chief Technologist - Windows Products and Microsoft Partnerships, will disc...
Mar. 2, 2015 09:45 PM EST Reads: 565
Hadoop as a Service (as offered by handful of niche vendors now) is a cloud computing solution that makes medium and large-scale data processing accessible, easy, fast and inexpensive. In his session at Big Data Expo, Kumar Ramamurthy, Vice President and Chief Technologist, EIM & Big Data, at Virtusa, will discuss how this is achieved by eliminating the operational challenges of running Hadoop, so one can focus on business growth. The fragmented Hadoop distribution world and various PaaS soluti...
Mar. 2, 2015 09:30 PM EST Reads: 565
Containers and microservices have become topics of intense interest throughout the cloud developer and enterprise IT communities. Accordingly, attendees at the upcoming 16th Cloud Expo at the Javits Center in New York June 9-11 will find fresh new content in a new track called PaaS | Containers & Microservices Containers are not being considered for the first time by the cloud community, but a current era of re-consideration has pushed them to the top of the cloud agenda. With the launch ...
Mar. 2, 2015 07:15 PM EST Reads: 799
VictorOps is making on-call suck less with the only collaborative alert management platform on the market. With easy on-call scheduling management, a real-time incident timeline that gives you contextual relevance around your alerts and powerful reporting features that make post-mortems more effective, VictorOps helps your IT/DevOps team solve problems faster.
Mar. 2, 2015 05:00 PM EST Reads: 1,305
Skeuomorphism usually means retaining existing design cues in something new that doesn’t actually need them. However, the concept of skeuomorphism can be thought of as relating more broadly to applying existing patterns to new technologies that, in fact, cry out for new approaches. In his session at DevOps Summit, Gordon Haff, Senior Cloud Strategy Marketing and Evangelism Manager at Red Hat, will discuss why containers should be paired with new architectural practices such as microservices ra...
Mar. 2, 2015 04:00 PM EST Reads: 1,517
Roberto Medrano, Executive Vice President at SOA Software, had reached 30,000 page views on his home page - http://RobertoMedrano.SYS-CON.com/ - on the SYS-CON family of online magazines, which includes Cloud Computing Journal, Internet of Things Journal, Big Data Journal, and SOA World Magazine. He is a recognized executive in the information technology fields of SOA, internet security, governance, and compliance. He has extensive experience with both start-ups and large companies, having been ...
Mar. 2, 2015 04:00 PM EST Reads: 1,357
HP and Aruba Networks on Monday announced a definitive agreement for HP to acquire Aruba, a provider of next-generation network access solutions for the mobile enterprise, for $24.67 per share in cash. The equity value of the transaction is approximately $3.0 billion, and net of cash and debt approximately $2.7 billion. Both companies' boards of directors have approved the deal. "Enterprises are facing a mobile-first world and are looking for solutions that help them transition legacy investme...
Mar. 2, 2015 04:00 PM EST Reads: 733
The industrial software market has treated data with the mentality of “collect everything now, worry about how to use it later.” We now find ourselves buried in data, with the pervasive connectivity of the (Industrial) Internet of Things only piling on more numbers. There’s too much data and not enough information. In his session at @ThingsExpo, Bob Gates, Global Marketing Director, GE’s Intelligent Platforms business, to discuss how realizing the power of IoT, software developers are now focu...
Mar. 2, 2015 03:15 PM EST Reads: 1,462
Operational Hadoop and the Lambda Architecture for Streaming Data Apache Hadoop is emerging as a distributed platform for handling large and fast incoming streams of data. Predictive maintenance, supply chain optimization, and Internet-of-Things analysis are examples where Hadoop provides the scalable storage, processing, and analytics platform to gain meaningful insights from granular data that is typically only valuable from a large-scale, aggregate view. One architecture useful for capturing...
Mar. 2, 2015 02:00 PM EST Reads: 1,432
SYS-CON Events announced today that Vitria Technology, Inc. will exhibit at SYS-CON’s @ThingsExpo, which will take place on June 9-11, 2015, at the Javits Center in New York City, NY. Vitria will showcase the company’s new IoT Analytics Platform through live demonstrations at booth #330. Vitria’s IoT Analytics Platform, fully integrated and powered by an operational intelligence engine, enables customers to rapidly build and operationalize advanced analytics to deliver timely business outcomes ...
Mar. 2, 2015 01:45 PM EST Reads: 1,301
DevOps is about increasing efficiency, but nothing is more inefficient than building the same application twice. However, this is a routine occurrence with enterprise applications that need both a rich desktop web interface and strong mobile support. With recent technological advances from Isomorphic Software and others, it is now feasible to create a rich desktop and tuned mobile experience with a single codebase, without compromising performance or usability.
Mar. 2, 2015 01:15 PM EST Reads: 1,220
SYS-CON Events announced today Arista Networks will exhibit at SYS-CON's DevOps Summit 2015 New York, which will take place June 9-11, 2015, at the Javits Center in New York City, NY. Arista Networks was founded to deliver software-driven cloud networking solutions for large data center and computing environments. Arista’s award-winning 10/40/100GbE switches redefine scalability, robustness, and price-performance, with over 3,000 customers and more than three million cloud networking ports depl...
Mar. 2, 2015 01:00 PM EST Reads: 1,603
The speed of software changes in growing and large scale rapid-paced DevOps environments presents a challenge for continuous testing. Many organizations struggle to get this right. Practices that work for small scale continuous testing may not be sufficient as the requirements grow. In his session at DevOps Summit, Marc Hornbeek, Sr. Solutions Architect of DevOps continuous test solutions at Spirent Communications, will explain the best practices of continuous testing at high scale, which is r...
Mar. 2, 2015 01:00 PM EST Reads: 1,289
Thanks to Docker, it becomes very easy to leverage containers to build, ship, and run any Linux application on any kind of infrastructure. Docker is particularly helpful for microservice architectures because their successful implementation relies on a fast, efficient deployment mechanism – which is precisely one of the features of Docker. Microservice architectures are therefore becoming more popular, and are increasingly seen as an interesting option even for smaller projects, instead of bein...
Mar. 2, 2015 12:00 PM EST Reads: 2,643
The explosion of connected devices / sensors is creating an ever-expanding set of new and valuable data. In parallel the emerging capability of Big Data technologies to store, access, analyze, and react to this data is producing changes in business models under the umbrella of the Internet of Things (IoT). In particular within the Insurance industry, IoT appears positioned to enable deep changes by altering relationships between insurers, distributors, and the insured. In his session at @Things...
Mar. 2, 2015 12:00 PM EST Reads: 1,362
Security can create serious friction for DevOps processes. We've come up with an approach to alleviate the friction and provide security value to DevOps teams. In her session at DevOps Summit, Shannon Lietz, Senior Manager of DevSecOps at Intuit, will discuss how DevSecOps got started and how it has evolved. Shannon Lietz has over two decades of experience pursuing next generation security solutions. She is currently the DevSecOps Leader for Intuit where she is responsible for setting and driv...
Mar. 2, 2015 12:00 PM EST Reads: 2,421