Welcome!

Machine Learning Authors: Carmen Gonzalez, Elizabeth White, William Schmarzo, Mark Ross-Smith, Dana Gardner

Related Topics: Containers Expo Blog, Java IoT, Microservices Expo, Agile Computing, @CloudExpo, Apache

Containers Expo Blog: Blog Feed Post

Object Storage Not Yet Defined

Agreed that object storage platforms scale better than file systems & NAS

The ExecEvent Object Storage Summit earlier this month continued to generate buzz on the industry, which is very exciting. Amplidata was represented – in spirit – at the Summit by our partners Intel and Quantum; due to an insane travel and show schedule this fall that kept us from attending personally.  We’re grateful for the mention in Storage Switzerland’s sponsor briefing articles. Very cool! With all the great stuff that has been happening for Amplidata lately, including the awesome performance test results by Howard Marks, we felt a bit like we were missing our own birthday party. We’ll be there next time!

The event fostered a few “What is Object Storage?” posts from, amongst others, George Crump. Jim O’Reilly also posted a very interesting article, although I’m not sure if he was at the event. If he wasn’t, he should be next time!

Both articles add to the body of knowledge that is rapidly evolving on what object storage is, and why customers should adopt it – so, every article helps. With a topic as technical as object storage, it’s easy to evangelize with a deep technical dive.  But that misses the “elegant simplicity” point.  Hence we love George’s use of the car park analogy which we ourselves often embrace.  His article was a helpful at-a-glance overview.  On a more technical level, Jim’s explanation of such concepts as immutable blobs, “the original version is the only version”, objects still look like files etc. offer more on how object storage really works. George’s analysis on how “Objects are given unique ID numbers” is what’s missing in Jim’s article. I guess, what we’re saying is “read both articles.”

But read them critically, and you will see that we’re not there yet. As you can read in Jim’s article, the paradigm has been around much longer than many of us know and we’re not complete in defining the best use cases, implementations, architectures, etc. For example, I’m not at all sure about the reduced metadata George writes about. I believe that over time, as we start using richer applications, we will be storing more metadata, not less. To me, Jim’s statement “To be an object, a blob of data needs a much more detailed descriptor record than what file systems use.” is more accurate.

Both articles also cover the “why” of Object Storage. I’m not sure I see the use of Jim’s deduplication paragraph, and I think we are missing erasure coding as an alternative to RAID in his article (replication can be expensive too!). Jim accurately mentions that block storage was I/O focused, but omits the exceptional throughput performance some of the object stores deliver. A good thing is that Jim sees the scalability, flexibility and cost-saving opportunities. Finally, I very much like his use cases: Google Picasa, Amazon S3, Genome etc. and it is very interesting to read that Jim sees potential for object storage in the Big Data analytics space.

So back to George’s take on why we need object storage. Agreed that object storage platforms scale better than file systems & NAS but, again, not so much because of the metadata. File systems have different challenges, such as the granularity of the hardware, limitations on numbers of files or the number of levels in the hierarchy. Distributed file systems tried to solve some of these issues, but object storage is just a much simpler approach. Agreed that adding NAS heads is an expensive and not so great solution!

The second topic I thought was interesting was the issue of “bit rot”. Bit rot is a real problem and will lead to data loss with traditional storage technologies, but not every object store will solve that. How I understood it is that it is the underlying data protection scheme that solves the problem of bit rot, not necessarily Object Storage. Erasure Coding detects bit rot and prevents data loss.  I don’t think you could restore the content of an object using the identifier, but maybe there is some really cool technology out there that I don’t know of. As George wrote “The storage system does not need an elaborate RAID protection algorithm nor do its administrators need to suffer through long RAID rebuild cycles”, I think he actually alludes to Erasure Coding but didn’t want to go that deep in this article.

Another interesting point in George’s article is the issue with backups. Once you go into the petabyte range, it becomes very unwieldy to backup data. He mentions the backup window, but add to that the overhead cost. George promotes using the unique IDs to make sure “that there are always copies of each object available on-site and off-site.” Again with the proper underlying protection schemes (erasure coding) you can rule out backups altogether!

I’m sure both George and Jim will appreciate the feedback – I fully agree with the benefits object storage brings to track iterations of files and the paragraph on geo dispersion, which we have termed geo-spreading. Finally, I hope to read some more of George’s thoughts about how object storage can help to monetize archived data as that, to me, is a key argument for this new but then again not so new storage paradigm. This is obviously not the end of the discussion; a lot will and needs to be said about this new paradigm. I’m looking forward to attending the next Object Storage events…

Read the original blog entry...

More Stories By Tom Leyden

Tom Leyden is VP Product Marketing at Scality. Scality was founded in 2009 by a team of entrepreneurs and technologists. The idea wasn’t storage, per se. When the Scality team talked to the initial base of potential customers, the customers wanted a system that could “route” data to and from individual users in the most scalable, efficient way possible. And so began a non-traditional approach to building a storage system that no one had imagined before. No one thought an object store could have enough performance for all the files and attachments of millions of users. No one thought a system could remain up and running through software upgrades, hardware failures, capacity expansions, and even multiple hardware generations coexisting. And no one believed you could do all this and scale to petabytes of content and billions of objects in pure software.

@CloudExpo Stories
One of the hottest areas in cloud right now is DRaaS and related offerings. In his session at 16th Cloud Expo, Dale Levesque, Disaster Recovery Product Manager with Windstream's Cloud and Data Center Marketing team, will discuss the benefits of the cloud model, which far outweigh the traditional approach, and how enterprises need to ensure that their needs are properly being met.
IoT is at the core or many Digital Transformation initiatives with the goal of re-inventing a company's business model. We all agree that collecting relevant IoT data will result in massive amounts of data needing to be stored. However, with the rapid development of IoT devices and ongoing business model transformation, we are not able to predict the volume and growth of IoT data. And with the lack of IoT history, traditional methods of IT and infrastructure planning based on the past do not app...
Up until last year, enterprises that were looking into cloud services usually undertook a long-term pilot with one of the large cloud providers, running test and dev workloads in the cloud. With cloud’s transition to mainstream adoption in 2015, and with enterprises migrating more and more workloads into the cloud and in between public and private environments, the single-provider approach must be revisited. In his session at 18th Cloud Expo, Yoav Mor, multi-cloud solution evangelist at Cloudy...
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo 2016 in New York. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place June 6-8, 2017, at the Javits Center in New York City, New York, is co-located with 20th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry p...
The proper isolation of resources is essential for multi-tenant environments. The traditional approach to isolate resources is, however, rather heavyweight. In his session at 18th Cloud Expo, Igor Drobiazko, co-founder of elastic.io, drew upon his own experience with operating a Docker container-based infrastructure on a large scale and present a lightweight solution for resource isolation using microservices. He also discussed the implementation of microservices in data and application integrat...
In his General Session at DevOps Summit, Asaf Yigal, Co-Founder & VP of Product at Logz.io, will explore the value of Kibana 4 for log analysis and will give a real live, hands-on tutorial on how to set up Kibana 4 and get the most out of Apache log files. He will examine three use cases: IT operations, business intelligence, and security and compliance. This is a hands-on session that will require participants to bring their own laptops, and we will provide the rest.
In his session at 18th Cloud Expo, Sagi Brody, Chief Technology Officer at Webair Internet Development Inc., and Logan Best, Infrastructure & Network Engineer at Webair, focused on real world deployments of DDoS mitigation strategies in every layer of the network. He gave an overview of methods to prevent these attacks and best practices on how to provide protection in complex cloud platforms. He also outlined what we have found in our experience managing and running thousands of Linux and Unix ...
SYS-CON Events announced today that Dataloop.IO, an innovator in cloud IT-monitoring whose products help organizations save time and money, has been named “Bronze Sponsor” of SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Dataloop.IO is an emerging software company on the cutting edge of major IT-infrastructure trends including cloud computing and microservices. The company, founded in the UK but now based in San Fran...
Internet of @ThingsExpo, taking place June 6-8, 2017 at the Javits Center in New York City, New York, is co-located with the 20th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. @ThingsExpo New York Call for Papers is now open.
"There's a growing demand from users for things to be faster. When you think about all the transactions or interactions users will have with your product and everything that is between those transactions and interactions - what drives us at Catchpoint Systems is the idea to measure that and to analyze it," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York Ci...
The 20th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held June 6-8, 2017, at the Javits Center in New York City, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Containers, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportunity. Submit your speaking proposal ...
WebRTC is the future of browser-to-browser communications, and continues to make inroads into the traditional, difficult, plug-in web communications world. The 6th WebRTC Summit continues our tradition of delivering the latest and greatest presentations within the world of WebRTC. Topics include voice calling, video chat, P2P file sharing, and use cases that have already leveraged the power and convenience of WebRTC.
20th Cloud Expo, taking place June 6-8, 2017, at the Javits Center in New York City, NY, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy.
Discover top technologies and tools all under one roof at April 24–28, 2017, at the Westin San Diego in San Diego, CA. Explore the Mobile Dev + Test and IoT Dev + Test Expo and enjoy all of these unique opportunities: The latest solutions, technologies, and tools in mobile or IoT software development and testing. Meet one-on-one with representatives from some of today's most innovative organizations
@DevOpsSummit taking place June 6-8, 2017 at Javits Center, New York City, is co-located with the 20th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. @DevOpsSummit at Cloud Expo New York Call for Papers is now open.
SYS-CON Events announced today that Catchpoint Systems, Inc., a provider of innovative web and infrastructure monitoring solutions, has been named “Silver Sponsor” of SYS-CON's DevOps Summit at 18th Cloud Expo New York, which will take place June 7-9, 2016, at the Javits Center in New York City, NY. Catchpoint is a leading Digital Performance Analytics company that provides unparalleled insight into customer-critical services to help consistently deliver an amazing customer experience. Designed ...
DevOps is being widely accepted (if not fully adopted) as essential in enterprise IT. But as Enterprise DevOps gains maturity, expands scope, and increases velocity, the need for data-driven decisions across teams becomes more acute. DevOps teams in any modern business must wrangle the ‘digital exhaust’ from the delivery toolchain, "pervasive" and "cognitive" computing, APIs and services, mobile devices and applications, the Internet of Things, and now even blockchain. In this power panel at @...
Data is the fuel that drives the machine learning algorithmic engines and ultimately provides the business value. In his session at Cloud Expo, Ed Featherston, a director and senior enterprise architect at Collaborative Consulting, discussed the key considerations around quality, volume, timeliness, and pedigree that must be dealt with in order to properly fuel that engine.
"A lot of times people will come to us and have a very diverse set of requirements or very customized need and we'll help them to implement it in a fashion that you can't just buy off of the shelf," explained Nick Rose, CTO of Enzu, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
The WebRTC Summit New York, to be held June 6-8, 2017, at the Javits Center in New York City, NY, announces that its Call for Papers is now open. Topics include all aspects of improving IT delivery by eliminating waste through automated business models leveraging cloud technologies. WebRTC Summit is co-located with 20th International Cloud Expo and @ThingsExpo. WebRTC is the future of browser-to-browser communications, and continues to make inroads into the traditional, difficult, plug-in web co...