Welcome!

AJAX & REA Authors: Elizabeth White, Liz McMillan, Trevor Parsons, Yeshim Deniz, Plutora Blog

Related Topics: SOA & WOA, Java, XML, AJAX & REA, Web 2.0, Big Data Journal

SOA & WOA: Article

With Confidence Through the Holiday Season: Manage Success in Production

The top goal for eCommerce sites is to ensure high conversion rates as this converts into business

In our last two articles, we discussed what we have learned from last year's holiday season as well as things that we can do in the preparation phase for this year's upcoming event. In this blog we show you those dashboards and data points you need throughout the holiday season to make it a success.

The top goal for eCommerce sites is to ensure high conversion rates as this converts into business. IT's responsibility is to ensure that consumers can use the eCommerce site in an "enjoyable" way. But there is much more than measuring the UpTime or Response Time of your services. The dashboards shown are taken from other eCommerce sites used to monitor the health of their application, infrastructure as well as end user satisfaction and conversion rate.

#1: Infrastructure and Application Health

Dashboards need to show the system health impact on applications, services and processes. If no systems are impacted it has less priority to deal with high CPU, Memory, ...

Applications ultimately run on an IT Infrastructure; whether these machines are "physical", virtualized, or running in the cloud. Ensuring a healthy infrastructure is the key requirement for IT. But it is more important to know whether there is an immediate impact on the hosted applications, services and processes. Before upgrading any hardware, reconfiguring your routing tables, or bouncing your application, it is important to understand whether it actually impacts the application and the end user. Just because you run on 95% of CPU doesn't mean it's a problem - maybe your developers just built a perfect system that consumes all resources available in an optimum manner.

You need a dashboard that alerts on system monitoring issues but also take into account the applications, services, and processes running on them. Are these impacted by the resource shortage or not? That answer dictates your action if you know what is actually impacted.

#2: Application Performance

What are they key performance indicators per application? Are end users impacted by bad response times or failures? Is it the App or the underlying infrastructure?

The second dashboard you need focuses on the application, its performance, and impact on the end user. It answers the following critical questions:

  1. How much traffic is currently on the page? Is it still climbing? Is it outside the norm?
  2. Do we have an unusual high failure rate, e.g., failed credit card transactions, abandoned carts?
  3. What is the overall response time and is it violating my baseline?
  4. Is the application impacted by unhealthy app or web servers, e.g., high GC
  5. Are the hosts (physical, virtual or in the cloud) running into CPU, Memory or I/O limits?
  6. Are end users currently impacted when accessing the app? Are they leaving because of bad user experience?
  7. What is the current conversion rate or are we making money?

#3: Regional Availability and User Experience

How is user experience in our target markets? Any regional availability or performance problems?

The first two dashboards in this blog analyzed performance from within our datacenter. The dashboards above and below now focus on performance perceived from the outside - meaning - from the real end user perspective. You need to know if your app is not reachable from a specific region or when conversion rate drops even though your servers are doing fine. These two dashboards answer the following important questions for you:

  1. Is my site reachable from my key regional markets?
  2. If I am not reachable: How long did the outage last and did it impact Users?
  3. How many users do we have per region and what was their User Experience?
  4. How does the traffic per region develop over time?
  5. How is our conversion rate over time and how many orders do we actually get in?

How is conversion rate and number of orders evolving over time? If we have a drop in conversions - is it related to a regional problem or is it related to general system health issues in our data center?

#4: Real User Experience on the Conversion Funnel

Learning how users move through the conversion funnel, where they drop off and how response time and end user experience (APDEX) impacts the conversion funnel

You need a dedicated dashboard for all important actions along your conversion funnel. That includes landing pages and actions such as search, product details, add to cart and checkout. The dashboard helps you to understand:

  1. How many users you have on each conversion funnel step?
  2. Do they encounter problems during a particular action and is that the reason for a drop?
  3. How fast is each step and does it have an impact on end user experience?

#5: Third Party Monitoring dashboard

How fast is static content delivered by Akamai & Co? Are there spikes or outages that impact my end users?

Most eCommerce sites rely on third-party content which not only impacts the feature set of the site but also performance and with that end user experience. Third Party Monitoring requires a view from two different angles: The third parties that are directly included into your website or mobile app and the external services you call from your backend.

These are the questions your Third Party Monitoring dashboard has to answer:

  1. Are the resources delivered via CDN fast or do we have regional problems?
  2. Is the integrated social media (Facebook, LinkedIn, Xing...) slow?
  3. Are the backend services facing bad requests to the integrate third parties?
  4. Is the performance of the third party good?

How fast and reliable are third party services such as facebook or Google API? Does it impact the failure rate of my application?

#6: Desktop Web vs. Mobile Web vs. Mobile App dashboard

Get to know your users: what devices to they use and does that impact user experience?

Your potential customers can use desktops, tablets or smart phones to access your site. They will either have fast WiFi or slow dial-up speed. All of this impacts user experience. In order to analyze performance and optimize your site for these types of browsers, devices and connection speed you need a dashboard that tells you:

  1. How many users are accessing my portal via Mobile App or Mobile Browser?
  2. What are the top browsers used? Do we need specific optimized pages for older browsers?
  3. Do we need to optimize for lower bandwidths, e.g: use better image compression?
  4. Is there a difference between the Key Performance Indicators (KPI) depending on the different types of devices, browsers, mobile native vs. mobile web?

When disaster strikes: Collaborate with R&D
It is likely that you have smaller hiccups throughout the holiday season. To avoid lengthy and painful war room situations it is important to level-up your monitoring system and provide data your engineering team needs to speed up error resolution. Here is a list of capabilities that will speed up triage and error resolution:

  1. Capture all actions of each visitor
  2. Collect Crashes, JavaScript Errors, iOS/Andorid Exceptions from your mobile app
  3. Provide method level visibility on the server side including context information such as method arguments and return values
  4. Provide the ability for memory heap dumps and access to all requested application performance metrics, e.g: connection pool, thread count, heap sizes, ...
  5. Use tools to capture this data that developers already use and that also allow sharing data from different environments.

This level of detail is what developers need to understand what exactly went wrong instead of digging through giga bytes of log files

Crash information from mobile native apps by mobile device and version makes it easy to fix specific problems

Conclusion
Having these types of dashboards make it easy to monitor the success of the holiday season and also easy to react on problems and prevent larger damage by executing the right actions. Make sure you do not waste your time with problems that are not real, e.g., an individual user complains or trying to find a problem related to a regional outage of an ISP. Focus on those problems that impact a large number of users and that you can fix. This will make sure you keep conversion rate high and business flowing.

For further reading check out our other recent blogs such as DevOps Survival Guide: 2013 Online Holiday Shopping Season and With Confidence into the Holiday Season: Verifying Readiness in Test / Pre-Production

More Stories By Klaus Enzenhofer

Klaus Enzenhofer has several years of experience and expertise in the field of Web Performance Optimization and User Experience Management. He works as Technical Strategist in the Center of Excellence Team at dynaTrace Software. In this role he influences the development of the dynaTrace Application Performance Management Solution and the Web Performance Optimization Tool dynaTrace AJAX Edition. He mainly gathered his experience in web and performance by developing and running large-scale web portals at Tiscover GmbH.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@CloudExpo Stories
15th Cloud Expo, which took place Nov. 4-6, 2014, at the Santa Clara Convention Center in Santa Clara, CA, expanded the conference content of @ThingsExpo, Big Data Expo, and DevOps Summit to include two developer events. IBM held a Bluemix Developer Playground on November 5 and ElasticBox held a Hackathon on November 6. Both events took place on the expo floor. The Bluemix Developer Playground, for developers of all levels, highlighted the ease of use of Bluemix, its services and functionalit...
"ElasticBox is an enterprise company that makes it very easy for developers and IT ops to collaborate to develop, build and deploy applications on any cloud - private, public or hybrid," stated Monish Sharma, VP of Customer Success at ElasticBox, in this SYS-CON.tv interview at DevOps Summit, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
"For the past 4 years we have been working mainly to export. For the last 3 or 4 years the main market was Russia. In the past year we have been working to expand our footprint in Europe and the United States," explained Andris Gailitis, CEO of DEAC, in this SYS-CON.tv interview at Cloud Expo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
The Internet of Things will greatly expand the opportunities for data collection and new business models driven off of that data. In her session at @ThingsExpo, Esmeralda Swartz, CMO of MetraTech, discussed how for this to be effective you not only need to have infrastructure and operational models capable of utilizing this new phenomenon, but increasingly service providers will need to convince a skeptical public to participate. Get ready to show them the money!
The Industrial Internet revolution is now underway, enabled by connected machines and billions of devices that communicate and collaborate. The massive amounts of Big Data requiring real-time analysis is flooding legacy IT systems and giving way to cloud environments that can handle the unpredictable workloads. Yet many barriers remain until we can fully realize the opportunities and benefits from the convergence of machines and devices with Big Data and the cloud, including interoperability, ...
At 15th Cloud Expo, Shrikant Pattathil, Executive Vice President at Harbinger Systems, demos a video delivery platform that helps you do interactive videos. He discusses how Harbinger is accomplishing it in the cloud world, the problems they faced and the choices they made to get around these problems.
“DevOps is really about the business. The business is under pressure today, competitively in the marketplace to respond to the expectations of the customer. The business is driving IT and the problem is that IT isn't responding fast enough," explained Mark Levy, Senior Product Marketing Manager at Serena Software, in this SYS-CON.tv interview at DevOps Summit, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
“The year of the cloud – we have no idea when it's really happening but we think it's happening now. For those technology providers like Zentera that are helping enterprises move to the cloud - it's been fun to watch," noted Mike Loftus, VP Product Management and Marketing at Zentera Systems, in this SYS-CON.tv interview at Cloud Expo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
The 3rd International Internet of @ThingsExpo, co-located with the 16th International Cloud Expo - to be held June 9-11, 2015, at the Javits Center in New York City, NY - announces that its Call for Papers is now open. The Internet of Things (IoT) is the biggest idea since the creation of the Worldwide Web more than 20 years ago.
Want to enable self-service provisioning of application environments in minutes that mirror production? Can you automatically provide rich data with code-level detail back to the developers when issues occur in production? In his session at DevOps Summit, David Tesar, Microsoft Technical Evangelist on Microsoft Azure and DevOps, will discuss how to accomplish this and more utilizing technologies such as Microsoft Azure, Visual Studio online, and Application Insights in this demo-heavy session.
Entuity®, a provider of enterprise-class network management solutions, today announced that it solidifies its position as a market leader through global enterprise customer acquisitions and a refined channel strategy. In 2014, Entuity increased new license revenues in EMEA by over 75 percent, and LATAM by over 125 percent as customers embraced Entuity for its highly automated solution and unified architecture. Entuity’s refined channel strategy focuses on even deeper strategic alignment with ke...
We are all here because we are sold on the transformative promise of The Cloud. But what good is all of this ephemeral, on-demand infrastructure if your usage doesn't actually improve the agility and speed of your business? How must Operations adapt in order to avoid stifling your Cloud initiative? In his session at DevOps Summit, Damon Edwards, co-founder and managing partner of the DTO Solutions, will highlight the successful organizational, process, and tooling patterns of high-performing c...
The 4th International DevOps Summit, co-located with16th International Cloud Expo – being held June 9-11, 2015, at the Javits Center in New York City, NY – announces that its Call for Papers is now open. Born out of proven success in agile development, cloud computing, and process automation, DevOps is a macro trend you cannot afford to miss. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's large...
Cloud Expo 2014 TV commercials will feature @ThingsExpo, which was launched in June, 2014 at New York City's Javits Center as the largest 'Internet of Things' event in the world.
“We help people build clusters, in the classical sense of the cluster. We help people put a full stack on top of every single one of those machines. We do the full bare metal install," explained Greg Bruno, Vice President of Engineering and co-founder of StackIQ, in this SYS-CON.tv interview at 15th Cloud Expo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
In this demo at 15th Cloud Expo, John Meza, Product Engineer at Esri, showed how Esri products hook into Hadoop cluster to allow you to do spatial analysis on the spatial data within your cluster, and he demonstrated rendering from a data center with ArcGIS Pro, a new product that has a brand new rendering engine.
"Blue Box has been around for 10-11 years, and last year we launched Blue Box Cloud. We like the term 'Private Cloud as a Service' because we think that embodies what we are launching as a product - it's a managed hosted private cloud," explained Giles Frith, Vice President of Customer Operations at Blue Box, in this SYS-CON.tv interview at DevOps Summit, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
"People are a lot more knowledgeable about APIs now. There are two types of people who work with APIs - IT people who want to use APIs for something internal and the product managers who want to do something outside APIs for people to connect to them," explained Roberto Medrano, Executive Vice President at SOA Software, in this SYS-CON.tv interview at Cloud Expo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
The Software Defined Data Center (SDDC), which enables organizations to seamlessly run in a hybrid cloud model (public + private cloud), is here to stay. IDC estimates that the software-defined networking market will be valued at $3.7 billion by 2016. Security is a key component and benefit of the SDDC, and offers an opportunity to build security 'from the ground up' and weave it into the environment from day one. In his session at 16th Cloud Expo, Reuven Harrison, CTO and Co-Founder of Tufin,...
SYS-CON Media announced that Splunk, a provider of the leading software platform for real-time Operational Intelligence, has launched an ad campaign on Big Data Journal. Splunk software and cloud services enable organizations to search, monitor, analyze and visualize machine-generated big data coming from websites, applications, servers, networks, sensors and mobile devices. The ads focus on delivering ROI - how improved uptime delivered $6M in annual ROI, improving customer operations by minin...