Welcome!

Machine Learning Authors: Yeshim Deniz, Pat Romanski, Elizabeth White, Zakia Bouachraoui, Liz McMillan

Blog Feed Post

Hadooplicability

The adoption of Hadoop has been increasing. More and more organizations are using Hadoop for various solutions. Some companies replace the existing data stores with Hive and HBase. Some firms make use of Mahout for machine learning. Others are building new applications on the Hadoop platform from the ground up. Now the critical question is where Hadoop can and should be used. 

Hadooplicability is the measure of the Hadoop applicability. It helps users find the most applicable areas where Hadoop can be leveraged. While Hadoopability, as defined before in another post of this blog, deals with the suitability primarily from a technology standpoint, Hadooplicability is more about the business case for Hadoop from a user perspective. Apparently there are multiple ways to assess and articulate why Hadoop is good for business, such as drivers, imperatives, benefits, impacts and cost. To make it simple, I classify the major usage scenarios and use cases into 3 categories: Bad, Innovative and Good (BIG).
  • Find bad stuff
    • Identify threats and frauds to reduce risks and minimize loss.
    • Analyze complaints and unsatisfactory comments posted on social networks.
    • E.g. fraud detection in insurance claims; trade surveillance for illegal transactions; customer churn analysis in telco to retain customers; discover fraudulent activities in anti-money laundry; spam filtering.
  • Strengthen good things
    • Understand customers in more depth to serve better.
    • Study the user behaviors to improve the services and CRM.
    • E.g. conduct sentiment analysis; examine spending habits and preferences; inspect the search attempts and clickthroughs to enhance search quality, ranking, display order and relevance.
  • Drive innovative advancement
    • Take proactive measures to gain competitive advantages.
    • Boost the user experience at the individual level.
    • E.g. ad targeting in one-to-one marketing; recommendations on shopping items and travel plans; pattern finding in Big Data mining; point-of sale transaction analysis to correlate multiple factors like weather, local news, major events, etc.

In a nutshell, Hadooplicability helps group applicable uses of Hadoop in a structured manner. Subcategories are further created for different usage patterns to deal with more granular use cases. 

For more information, please contact Tony Shan ([email protected]). ©Tony Shan. All rights reserved.

Read the original blog entry...

More Stories By Tony Shan

Tony Shan works as a senior consultant, advisor at a global applications and infrastructure solutions firm helping clients realize the greatest value from their IT. Shan is a renowned thought leader and technology visionary with a number of years of field experience and guru-level expertise on cloud computing, Big Data, Hadoop, NoSQL, social, mobile, SOA, BI, technology strategy, IT roadmapping, systems design, architecture engineering, portfolio rationalization, product development, asset management, strategic planning, process standardization, and Web 2.0. He has directed the lifecycle R&D and buildout of large-scale award-winning distributed systems on diverse platforms in Fortune 100 companies and public sector like IBM, Bank of America, Wells Fargo, Cisco, Honeywell, Abbott, etc.

Shan is an inventive expert with a proven track record of influential innovations such as Cloud Engineering. He has authored dozens of top-notch technical papers on next-generation technologies and over ten books that won multiple awards. He is a frequent keynote speaker and Chair/Panel/Advisor/Judge/Organizing Committee in prominent conferences/workshops, an editor/editorial advisory board member of IT research journals/books, and a founder of several user groups, forums, and centers of excellence (CoE).

CloudEXPO Stories
Your job is mostly boring. Many of the IT operations tasks you perform on a day-to-day basis are repetitive and dull. Utilizing automation can improve your work life, automating away the drudgery and embracing the passion for technology that got you started in the first place. In this presentation, I'll talk about what automation is, and how to approach implementing it in the context of IT Operations. Ned will discuss keys to success in the long term and include practical real-world examples. Get started on automating your way to a brighter future!
The challenges of aggregating data from consumer-oriented devices, such as wearable technologies and smart thermostats, are fairly well-understood. However, there are a new set of challenges for IoT devices that generate megabytes or gigabytes of data per second. Certainly, the infrastructure will have to change, as those volumes of data will likely overwhelm the available bandwidth for aggregating the data into a central repository. Ochandarena discusses a whole new way to think about your next-gen applications and how to address the challenges of building applications that harness all data types and sources.
Whenever a new technology hits the high points of hype, everyone starts talking about it like it will solve all their business problems. Blockchain is one of those technologies. According to Gartner's latest report on the hype cycle of emerging technologies, blockchain has just passed the peak of their hype cycle curve. If you read the news articles about it, one would think it has taken over the technology world. No disruptive technology is without its challenges and potential impediments that frequently get lost in the hype. The panel will discuss their perspective on what they see as they key challenges and/or impediments to adoption, and how they see those issues could be resolved or mitigated.
CloudEXPO New York 2018, colocated with DevOpsSUMMIT and DXWorldEXPO New York 2018 will be held November 12-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI and Machine Learning to one location.
DXWorldEXPO LLC announced today that Nutanix has been named "Platinum Sponsor" of CloudEXPO | DevOpsSUMMIT | DXWorldEXPO New York, which will take place November 12-13, 2018 in New York City. Nutanix makes infrastructure invisible, elevating IT to focus on the applications and services that power their business. The Nutanix Enterprise Cloud Platform blends web-scale engineering and consumer-grade design to natively converge server, storage, virtualization and networking into a resilient, software-defined solution with rich machine intelligence.