By Kato Mivule
Database Systems
Online retailer Amazon sent shock waves into the Database tech community in 2007 with the launching of the Amazon SimpleDB [1] that led some to question the future of Relational Databases and in particular the future of SQL. [2] Microsoft also launched launched a service similar to Amazon’s SimpleDB in 2008, signaling a great interest in the new database architecture that Amazon employed with the SimpleDB. [3]
SimpleDB is based on a Distributed System Architecture that stores data in excess on devices across the Distributed Architecture of the same Sphere.[4] In the case of Amazon’s SimpleDB, data is stored on various distributed servers in the Amazon Domain.
SimpleDB is based on Amazon’s S3 – Simple Storage Service in which users are granted unlimited data storage capacity at very inexpensive rates. Data in the S3 system is stored across a number of servers or storage devices in the Amazon scalable distributed network.[5] SimpleDB and S3 are extensions of the Cloud Computing Architecture in which computing resources, software applications, and data are shared across the web on a demand basis.
Software resources such as Applications and Data are stored on distributed servers so when a user demands use of a Word Processing Application, that instance is provided to the user via the web browser. So, the user need not keep or store applications or data on their computing devices but will depend on the reliability and availability of the internet for access. [6] [7]
Amazon defines Cloud Computing Architectures as the inherent computing base in which Applications are utilized only when as demanded or needed. Once the demanded application is no longer needed or done with, the resources being utilized are discarded. [8] It is in this type of Cloud Computing architecture that SimpleDB is built on that Amazon refers to as the Elastic Compute Cloud or EC2 because the on-demand application utilizes resources elastically. [8]
Therefore SimpleDB is mainly a distributed Database tool that stores and retrieves user data across Amazon’s distributed servers or cloud computing architecture giving it the scalability advantage. SimpleDB does not follow the traditional database relational model in which data is stored on a central server.
As InformIT notes in their description of Amazon’s SimpleDB, [1] there are a number of problems with the SimpleDB database Technology, with the most trouble being a lack of schema. [1] This lack of structure in database definitions and constraints provides problems when it comes to data manipulation. For example since data is not defined, it becomes difficult carrying out mathematical calculations on data, even as InformIT notes that data in SimpleDB is stored as a variable characters. [1] This is the greatest setback to SimpleDB that out weighs any advantages.
As InformIT notes, SimpleDB provides scalability. [1] SimpleDB therefore would be best suited for quick data retrieval mechanisms rather than as a serious data management application. As one computing blogger noted, data integrity is never warranted, database aggregate transactions are not fostered and very difficult to code, and that Relational Database Technology has greatly improved in the area of scalability to the point that web applications like Facebook and LiveJournal use employ Relational Database Technology like MySQL. [9]
The issue of Security and Privacy are also of great concern when it comes to SimpleDB. It is easy to attain some degree of privacy when data is managed of specific known servers. Yet with the SimpleDB architecture, data is stored across many devices that should one device be compromised, some portion of data is at risk.
In conclusion, Relational Database Models will stay and while Amazon’s SimpleDB might not be the next big thing, it could be employed for non critical data management for fast storage and retrieval. Say for example, if one would simply want to store web address that are none critical, SimpleDB could be utilized in such circumstances.
References
[1] “Amazon Launches New Database Service — Database — InformationWeek.” [Online]. Available: http://www.informationweek.com/news/software/soa/showArticle.jhtml?articleID=204803008. [Accessed: 10-Sep-2010].
[2] “InformIT: Introducing Amazon SimpleDB > Abandoning the Relational Model?.” [Online]. Available: http://www.informit.com/articles/article.aspx?p=1619309&seqNum=3. [Accessed: 10-Sep-2010].
[3] “Microsoft launches its alternative to Amazon’s SimpleDB | ZDNet.” [Online]. Available: http://www.zdnet.com/blog/microsoft/microsoft-launches-its-alternative-to-amazons-simpledb/1245. [Accessed: 10-Sep-2010].
[4] James Murty, “Programming Amazon Web Services: S3, EC2, SQS, FPS, and SimpleDB, O’Reilly Series, Editionillustrated”. O’Reilly Media, Inc., 2008, ISBN 0596515812, 9780596515812, page 52.
[6] “Amazon Simple Storage Service (Amazon S3).” [Online]. Available: http://aws.amazon.com/s3/. [Accessed: 10-Sep-2010].
[5] “Amazon S3 – Wikipedia, the free encyclopedia.” [Online]. Available: http://en.wikipedia.org/wiki/Amazon_S3. [Accessed: 10-Sep-2010].
[6] Gregory Wilshusen, “Information Security: Federal Guidance Needed to Address Control Issues with Implementing Cloud Computing”. DIANE Publishing, ISBN 1437935648, 9781437935646, Page 10.
[7] “Cloud computing – Wikipedia, the free encyclopedia.” [Online]. Available: http://en.wikipedia.org/wiki/Cloud_computing#cite_note-23. [Accessed: 11-Sep-2010].
[8] “Amazon Web Services Developer Community : Building GrepTheWeb in the Cloud, Part 1: Cloud Architectures.” [Online]. Available: http://developer.amazonwebservices.com/connect/entry.jspa?externalID=1632&categoryID=100. [Accessed: 11-Sep-2010].
[9] “Top 10 Reasons to Avoid the SimpleDB Hype — ryanpark.org.” [Online]. Available: http://www.ryanpark.org/2008/04/top-10-avoid-the-simpledb-hype.html. [Accessed: 11-Sep-2010].