Welcome!

Machine Learning Authors: Elizabeth White, Jnan Dash, William Schmarzo, Pat Romanski, Liz McMillan

Related Topics: @DXWorldExpo, Machine Learning , @ThingsExpo

@DXWorldExpo: Article

Data and Economics 101 | @CloudExpo #IoT #AI #M2M #BigData #Analytics

Economics is the science that deals with the production, distribution, and consumption of commodities

As more organizations try to determine where best to deploy their limited budgets to support data and analytics initiatives, they realize a need to ascertain the financial value of their data and analytics - which means basic economic concepts are coming into play.  While many of you probably took an economics class in college not too long ago, some more "seasoned" readers may be rusty.

The starting point for this topic began with a blog that I wrote several months ago titled "Determining the Economic Value of Data" and this key observation that started that conversation:

Data is an unusual currency. Most currencies exhibit a one-to-one transactional relationship. For example, the quantifiable value of a dollar is considered to be finite - it can only be used to buy one item or service at a time, or a person can only do one paid job at a time. But measuring the value of data is not constrained by those transactional limitations. In fact, data currency exhibits a network effect, where data can be used at the same time across multiple use cases thereby increasing its value to the organization. This makes data a powerful currency in which to invest.

So to better understand how economics can help determine the value of an organization's data and analytics, I sought the help of an old friend who is passionate about applying economics in business. Vince Sumpter (Twitter: @vsumpter) helped deepen my understanding of some core concepts of economics, and consider where and how these economic concepts play in a business world that is looking for ways to determine the financial - or economic - value of their data and analytics.

The economic concepts that seem to have the most bearing on determining the economic value of data (and the resulting analytics) that this blog will cover include:

  • Scarcity
  • Efficiency
  • Postponement Theory
  • Multiplier Effect
  • Capital

It is our hope that this blog fuels some creative thinking and debate as we contemplate how organizations need to apply basic economic concepts to these unusual digital assets - data and analytics.

Defining Economics
I found the below two definitions of "economics" the most useful:

  • Economics is the science that deals with the production, distribution, and consumption of commodities. Economics is generally understood to concern behavior that, given the scarcity of means, arises to achieve certain ends[1].
  • Economics is a broad term referring to the scientific study of human action, particularly as it relates to human choice and the utilization of scarce resources[2].

I pulled together what I felt were some of the key phrases to come up with the following definition of the "economics of data" for purposes of this blog:

Economics of Data:  The science of human choice and behaviors as they relate to the production, distribution and consumption of scarce data and analytic resources.

Economics is governed by the law of supply and demand that dictates the interaction between the supply of a resource and the demand for that resource. The law of supply and demand defines the effect that product or service availability and the demand for that product or service has on price. Generally, a low supply and a high demand increases price, and in contrast, the greater the supply and the lower the demand, the lower the price tends to fall.

Now, we will explore the most relevant economic concepts in context to the Economics of Data.

Scarcity
Scarcity
refers to limitations-insufficient resources, goods, or abilities to achieve the desired ends. Figuring out ways to make the best use of scarce resources or find alternatives is fundamental to economics.

Scarcity Ramifications

Figure 1: Scarcity Ramifications

Scarcity is probably the heart of the economics discussion and ties directly to the laws of supply and demand.  Organizations do not have unlimited financial, human or time resources, consequently as we discussed previously, organizations must seek to prioritize their data and analytic resources against those best opportunities.  Scarcity at its heart forces organizations to do two things that they do not do well:  prioritize and focus (see "Big Data Success: Prioritize ‘Important' Over ‘Urgent'").

Scarcity plays out in the inability, or the unwillingness, for the organization to share all of its data across all of its business units.  For some business units, scarcity drives their value to the organization; that is, he who owns the data owns the power.  This short-sighted mentality manifests itself across organizations in the way of data silos and IT "Shadow Spend."  For example, if you are a Financial Services organization trying to predict your customers' lifetime value, having analytics that optimize individual business units (checking, savings, retirement, credit cards, mortgage, car loans, wealth management) without seeking to optimize the larger business objective (predict customer lifetime value) could easily lead to suboptimal or even wrong decisions about which customers to prioritize with what offers at what times through what channels.

Scarcity has the biggest impact on the prioritization and optimization of scarce data and analytic resources including:

  • Are your IT resources focused on capturing or acquiring the most important data in support of the organization's key business initiatives?
  • Are your data science resources focused on the development of the top priority analytics?
  • Does your technical and cultural environment support and even reward the capture, refinement, and re-use of the analytic results across multiple business units?

Consequently, the ability to prioritize (see "Prioritization Matrix: Aligning Business and IT On The Big Data Journey") and carefully balance the laws of supply and demand are critical to ensure not only that your data and analytics resources are being prioritized against the "optimal" projects.

Postponement Theory
Postponement
is a decision to postpone a decision (which is itself a decision). It can occur as one party seeks to either gain additional information about the decision and/or to delay for better terms from the other party.

Postponement has the following ramifications from an economics of data perspective:

  • Case #1: Organizations may decide to postpone a decision in order to gather more data and/or build more accurate analytics in order to dramatically improve the probability of making a "better" decision
  • Case #2: People and organizations may postpone a decision in order to get better terms especially given certain time constraints (e.g., car dealers get very aggressive with their terms near the end of the quarter)

While Case #2 may not have an impact on the economics of your organization's data and analytics, Case #1 has direct impact.  In order to make a postponement decision, organizations need to understand:

  • What is the estimated effectiveness of the current decision given Type I/Type II decision risks (where a Type I error is a "False Positive" error and a Type II error is a "False Negative error)? See "Understanding Type I and Type II Errors" for more details on Type I/Type II errors.
  • What data might be needed to improve the effectiveness of that decision?
  • How much more accurate can the decision be made given these new data sources and additional data science time?
  • What are the risks of Type I/Type II errors (the costs associated with making the wrong decision)?

Efficiency
Efficiency
is a relationship between ends and means. When we call a situation inefficient, we are claiming that we could achieve the desired ends with less means, or that the means employed could produce more of the ends desired.

Data and analytics play a major role driving efficiency improvements by identifying operational deficiencies and proposing recommendations (prescriptive analytics) on how to improve operational efficiencies.

The aggregation of the operational insights gained from efficiency improvement might lead to new monetization opportunities in enabling the organization to aggregate usage patterns across all customers and business constituents.  For example, organizations could create benchmarks, share, and index calculations that customers and partners could use to measure their efficiencies and create goals around efficiency optimization from the aggregated performance data.

Multiplier Effect
The multiplier effect refers to the increase in final income arising from any new injection of spending. The size of the multiplier depends upon household's marginal propensity to consume (MPC), or the marginal propensity to save (MPS).

The Multiplier Effect is one of the most important concepts developed by J.M. Keynes to explain the determination of income and employment in an economy. The theory of multiplier has been used to explain the cumulative upward and downward swings of the trade cycles that occur in a free-enterprise capitalist economy. When investment in an economy rises, it can have a multiple and cumulative effect on national income, output and employment.

The multiplier effect is, therefore, the ratio of increment in income to the increment in investment.

When applied to our thinking about the Economics of Data, the multiplier effect embodies the fact that our efforts to develop a new data source, or derived analytic measure, could have that same multiplier effect if the new data/analytics were to be leveraged beyond the initial project.

For example, when CPG manufacturers worked with retailers to implement the now ubiquitous UPC standard in the early 1980's, their primary motivation was a desire to drive more consistent pricing at the cash register... Few imagined the knock-on benefits that would accrue by now having much deeper understanding of actual product movement through the supply chain...let alone the shift in balance-of-power that subsequently ensued from CPG Manufacturer to today's Retailers!

Multiplier Effect

Figure 2:  Multiplier Effect

Price Elasticity
Price elasticity
of demand is the quantitative measure of consumer behavior that indicates the quantity of demand of a product or service depending on its increase or decrease in price. Price elasticity of demand can be calculated by the percent change in the quantity demanded by the percent change in price.

In today's big data environment, the price of data science resources (i.e. their salaries) seems almost price inelastic (inelastic describes the situation in which the quantity demanded or supplied of a good or service is unaffected when the price of that good or service changes).  That means that the demand for data science resources is only slightly affected when the price of data science resources increases.

This price inelasticity of data science resources can only be addressed in a few ways:  train (and really certify) more data scientists or dramatically improve the capabilities and ease-of-use of data science tools.

However, there is another option:  train your business users to "think like a data scientist."  The key to this process is training your business users to embrace the power of "might" in collaborating with the data science team to identify those variables and metrics that might be better predictors of performance.  We have now seen across a number of projects how coupling the creative thinking of the business users with the data scientists can yield dramatically better predictions (see forthcoming blog:  "Data Science: Identifying Variables That Might Be Better Predictors").

The "Thinking Like A Data Scientist" process will uncover a wealth of new data sources that might yield better predictors of performance.  It is then up to the data science team to employ their different data transformation, data enrichment and analytic algorithms to determine which variables and metrics are better predictors of performance.

Capital
Capital
is already-produced durable goods and assets, or any non-financial asset that is used in production of goods or services.  Capital is one of three factors of production, the others being land and labor.

Adam Smith defined capital as "that part of a man's stock which he expects to afford him revenue".  I like Adam Smith's definition because the ultimate economic goal of data and analytics is to "afford organizations revenue."  And while it may be possible to generate that revenue through the sale of data and analytics, for most organizations data and analytics as capital get converted into revenue in four ways:

  • Driving the on-going optimization of key business processes (e.g., reducing fraud by 3% annually, increasing customer retention 2.5% annually)
  • Reducing exposure to risk through management of security, compliance, regulations, and governance, to avoid security breaches, litigation, fines, theft etc. to build customer trust and loyalty while ensuring business continuity and availability.
  • Uncovering new revenue opportunities through superior customer, product and operational insights that can identify unmet customer, partner and market needs
  • Delivering a more compelling, more prescriptive customer experience that both increases customer satisfaction and advocacy, but also increases the organization's success in recommending new products and services to the highest qualified, highest potential customers and prospects

Probably the most important economic impact on data and analytics is the role of human capital.  Economists regard expenditures on education, training, and medical care as investments in human capital. They are called human capital because people cannot be separated from their knowledge, skills, health, or values in the way they can be separated from their financial and physical assets.  These human investments can raise earnings, improve health, or add to a person's good habits over one's lifetime.  But maybe more importantly, an organization's human capital can be transformed to "think differently" about the application of data and analytics to power the organization's business models.

Summary
As my friend Jeff Abbott said after reviewing this blog: "What did I do wrong to have to review this blog?"

While the economic concepts discussed in this blog likely do not apply to your day-to-day jobs, more and more I expect that the big data (data and analytics) conversation will center on basic economic concepts as organizations seek to ascertain the economic value of their data and analytics. Data and analytics exhibit unusual behaviors from an asset and currency perspective, and applying economic concepts to these behaviors may help organizations as they seek to prioritize and optimize their data and analytic investments.

So, sorry for bringing back bad college memories about your economics classes, but hey, no one said that big data was going to be only fun!

Sources:

http://www.econlib.org/library/Topics/HighSchool/KeyConcepts.html
http://www.econlib.org/library/Topics/HighSchool/Scarcity.html
http://www.economicsdiscussion.net/keynesian-economics/keynes-theory/keynes-theory-of-investment-multiplier-with-diagram/10363
http://www.tutor2u.net/economics/reference/multiplier-effect
http://www.investopedia.com/university/economics/economics3.asp
http://www.econlib.org/library/Topics/HighSchool/ElasticityofDemand.html
http://www.econlib.org/library/Topics/HighSchool/HumanCapital.html
[1] http://www.dictionary.com/browse/economics
[2] http://www.investopedia.com/terms/e/economics.asp?lgl=no-infinite

The post Data and Economics 101 appeared first on InFocus Blog | Dell EMC Services.

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business”, is responsible for setting the strategy and defining the Big Data service line offerings and capabilities for the EMC Global Services organization. As part of Bill’s CTO charter, he is responsible for working with organizations to help them identify where and how to start their big data journeys. He’s written several white papers, avid blogger and is a frequent speaker on the use of Big Data and advanced analytics to power organization’s key business initiatives. He also teaches the “Big Data MBA” at the University of San Francisco School of Management.

Bill has nearly three decades of experience in data warehousing, BI and analytics. Bill authored EMC’s Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements, and co-authored with Ralph Kimball a series of articles on analytic applications. Bill has served on The Data Warehouse Institute’s faculty as the head of the analytic applications curriculum.

Previously, Bill was the Vice President of Advertiser Analytics at Yahoo and the Vice President of Analytic Applications at Business Objects.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@CloudExpo Stories
"MobiDev is a software development company and we do complex, custom software development for everybody from entrepreneurs to large enterprises," explained Alan Winters, U.S. Head of Business Development at MobiDev, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Large industrial manufacturing organizations are adopting the agile principles of cloud software companies. The industrial manufacturing development process has not scaled over time. Now that design CAD teams are geographically distributed, centralizing their work is key. With large multi-gigabyte projects, outdated tools have stifled industrial team agility, time-to-market milestones, and impacted P&L stakeholders.
"ZeroStack is a startup in Silicon Valley. We're solving a very interesting problem around bringing public cloud convenience with private cloud control for enterprises and mid-size companies," explained Kamesh Pemmaraju, VP of Product Management at ZeroStack, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. Thi...
"Space Monkey by Vivent Smart Home is a product that is a distributed cloud-based edge storage network. Vivent Smart Home, our parent company, is a smart home provider that places a lot of hard drives across homes in North America," explained JT Olds, Director of Engineering, and Brandon Crowfeather, Product Manager, at Vivint Smart Home, in this SYS-CON.tv interview at @ThingsExpo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
In his session at 21st Cloud Expo, Carl J. Levine, Senior Technical Evangelist for NS1, will objectively discuss how DNS is used to solve Digital Transformation challenges in large SaaS applications, CDNs, AdTech platforms, and other demanding use cases. Carl J. Levine is the Senior Technical Evangelist for NS1. A veteran of the Internet Infrastructure space, he has over a decade of experience with startups, networking protocols and Internet infrastructure, combined with the unique ability to it...
"Infoblox does DNS, DHCP and IP address management for not only enterprise networks but cloud networks as well. Customers are looking for a single platform that can extend not only in their private enterprise environment but private cloud, public cloud, tracking all the IP space and everything that is going on in that environment," explained Steve Salo, Principal Systems Engineer at Infoblox, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventio...
"Codigm is based on the cloud and we are here to explore marketing opportunities in America. Our mission is to make an ecosystem of the SW environment that anyone can understand, learn, teach, and develop the SW on the cloud," explained Sung Tae Ryu, CEO of Codigm, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Gemini is Yahoo’s native and search advertising platform. To ensure the quality of a complex distributed system that spans multiple products and components and across various desktop websites and mobile app and web experiences – both Yahoo owned and operated and third-party syndication (supply), with complex interaction with more than a billion users and numerous advertisers globally (demand) – it becomes imperative to automate a set of end-to-end tests 24x7 to detect bugs and regression. In th...
The question before companies today is not whether to become intelligent, it’s a question of how and how fast. The key is to adopt and deploy an intelligent application strategy while simultaneously preparing to scale that intelligence. In her session at 21st Cloud Expo, Sangeeta Chakraborty, Chief Customer Officer at Ayasdi, provided a tactical framework to become a truly intelligent enterprise, including how to identify the right applications for AI, how to build a Center of Excellence to oper...
"IBM is really all in on blockchain. We take a look at sort of the history of blockchain ledger technologies. It started out with bitcoin, Ethereum, and IBM evaluated these particular blockchain technologies and found they were anonymous and permissionless and that many companies were looking for permissioned blockchain," stated René Bostic, Technical VP of the IBM Cloud Unit in North America, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Conventi...
High-velocity engineering teams are applying not only continuous delivery processes, but also lessons in experimentation from established leaders like Amazon, Netflix, and Facebook. These companies have made experimentation a foundation for their release processes, allowing them to try out major feature releases and redesigns within smaller groups before making them broadly available. In his session at 21st Cloud Expo, Brian Lucas, Senior Staff Engineer at Optimizely, discussed how by using ne...
"Cloud Academy is an enterprise training platform for the cloud, specifically public clouds. We offer guided learning experiences on AWS, Azure, Google Cloud and all the surrounding methodologies and technologies that you need to know and your teams need to know in order to leverage the full benefits of the cloud," explained Alex Brower, VP of Marketing at Cloud Academy, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clar...
In his session at 21st Cloud Expo, James Henry, Co-CEO/CTO of Calgary Scientific Inc., introduced you to the challenges, solutions and benefits of training AI systems to solve visual problems with an emphasis on improving AIs with continuous training in the field. He explored applications in several industries and discussed technologies that allow the deployment of advanced visualization solutions to the cloud.
Agile has finally jumped the technology shark, expanding outside the software world. Enterprises are now increasingly adopting Agile practices across their organizations in order to successfully navigate the disruptive waters that threaten to drown them. In our quest for establishing change as a core competency in our organizations, this business-centric notion of Agile is an essential component of Agile Digital Transformation. In the years since the publication of the Agile Manifesto, the conn...
Vulnerability management is vital for large companies that need to secure containers across thousands of hosts, but many struggle to understand how exposed they are when they discover a new high security vulnerability. In his session at 21st Cloud Expo, John Morello, CTO of Twistlock, addressed this pressing concern by introducing the concept of the “Vulnerability Risk Tree API,” which brings all the data together in a simple REST endpoint, allowing companies to easily grasp the severity of the ...
While some developers care passionately about how data centers and clouds are architected, for most, it is only the end result that matters. To the majority of companies, technology exists to solve a business problem, and only delivers value when it is solving that problem. 2017 brings the mainstream adoption of containers for production workloads. In his session at 21st Cloud Expo, Ben McCormack, VP of Operations at Evernote, discussed how data centers of the future will be managed, how the p...
"NetApp is known as a data management leader but we do a lot more than just data management on-prem with the data centers of our customers. We're also big in the hybrid cloud," explained Wes Talbert, Principal Architect at NetApp, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Coca-Cola’s Google powered digital signage system lays the groundwork for a more valuable connection between Coke and its customers. Digital signs pair software with high-resolution displays so that a message can be changed instantly based on what the operator wants to communicate or sell. In their Day 3 Keynote at 21st Cloud Expo, Greg Chambers, Global Group Director, Digital Innovation, Coca-Cola, and Vidya Nagarajan, a Senior Product Manager at Google, discussed how from store operations and ...
"We're focused on how to get some of the attributes that you would expect from an Amazon, Azure, Google, and doing that on-prem. We believe today that you can actually get those types of things done with certain architectures available in the market today," explained Steve Conner, VP of Sales at Cloudistics, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.