AN EXPLORATION OF OBJECT ORIENTED DATABASE MANAGEMENT SYSTEMS By Dare Obasanjo Introduction: Why Object Oriented Database Management Systems? In today's world, Client-Server applications that rely on a database on the server as a data store while servicing requests from multiple clients are quite commonplace. Most of these applications use a Relational Database Management System (RDBMS) as their data store while using an object oriented programming language for development. This causes a certain inefficency as objects must be mapped to tuples in the database and vice versa instead of the data being stored in a way that is consistent with the programming model. The "impedance mismatch" caused by having to map objects to tables and vice versa has long been accepted as a necessary performance penalty. This paper is aimed at seeking out an alternative that avoids this penalty. The purpose of this paper is to provide answers to the following questions What is an Object Oriented Database Management System (OODBMS)? Is an OODBMS a viable alternative to an RDBMS? What are the tradeoffs and benefits of using an OODBMS over an RDBMS? What does code that interacts with an OODBMS look like? Overview of Object Oriented Database Management Systems An OODBMS is the result of combining object oriented programming principles with database management principles. Object oriented programming concepts such as encapsulation, polymorphism and inheritance are enforced as well as database management concepts such as the ACID properties (Atomicity, Consistency, Isolation and Durability) which lead to system integrity, support for an ad hoc query language and secondary storage management systems which allow for managing very large amounts of data. The Object Oriented Database Manifesto [Atk 89] specifically lists the following features as mandatory for a system to support before it can be called an OODBMS; Complex objects, Object identity, Encapsulation , Types and Classes , Class or Type Hierarchies, Overriding,overloading and late binding, Computational completeness , Extensibility, Persistence , Secondary storage management, Concurrency, Recovery and an Ad Hoc Query Facility. From the aforementioned description, an OODBMS should be able to store objects that are nearly indistinguishable from the kind of objects supported by the target programming language with as little limitation as possible. Persistent objects should belong to a class and can have one or more atomic types or other objects as attributes. The normal rules of inheritance should apply with all their benefits including polymorphism, overridding inherited methods and dynamic binding. Each object has an object identifier (OID) which used as a way of uniquely identifying a particuler object. OIDs are permanent, system generated and not based on any of the member data within the object. OIDs make storing references to other objects in the database simpler but may cause referential intergrity problems if an object is deleted while other objects still have references to its OID. An OODBMS is thus a full scale object oriented development environment as well as a database management system. Features that are common in the RDBMS world such as transactions, the ability to handle large amounts of data, indexes, deadlock detection, backup and restoration features and data recovery mechanisms also exist in the OODBMS world. A primary feature of an OODBMS is that accessing objects in the database is done in a transparent manner such that interaction with persistent objects is no different from interacting with in-memory objects. This is very different from using an RDBMSs in that there is no need to interact via a query sub-language like SQL nor is there a reason to use a Call Level Interface such as ODBC, ADO or JDBC. Database operations typically involve obtaining a database root from the the OODBMS which is usually a data structure like a graph, vector, hash table, or set and traversing it to obtain objects to create, update or delete from the database. When a client requests an object from the database, the object is transferred from the database into the application's cache where it can be used either as a transient value that is disconnected from its representation in the database (updates to the cached object do not affect the object in the database) or it can be used as a mirror of the version in the database in that updates to the object are reflected in the database and changes to object in the database require that the object is refetched from the OODBMS. Comparisons of OODBMSs to RDBMSs There are concepts in the relational database model that are similar to those in the object database model. A relation or table in a relational database can be considered to be analogous to a class in an object database. A tuple is similar to an instance of a class but is different in that it has attributes but no behaviors. A column in a tuple is similar to a class attribute except that a column can hold only primitive data types while a class attribute can hold data of any type. Finally classes have methods which are computationally complete (meaning that general purpose control and computational structures are provided [McF 99]) while relational databases typically do not have computationally complete programming capabilities although some stored procedure languages come close. Below is a list of advantages and disadvantages of using an OODBMS over an RDBMS with an object oriented programming language. Advantages 1. Composite Objects and Relationships: Objects in an OODBMS can store an arbitrary number of atomic types as well as other objects. It is thus possible to have a large class which holds many medium sized classes which themselves hold many smaller classes, ad infinitum. In a relational database this has to be done either by having one huge table with lots of null fields or via a number of smaller, normalized tables which are linked via foreign keys. Having lots of smaller tables is still a problem since a join has to be performed every time one wants to query data based on the "Has-a" relationship between the entities. Also an object is a better model of the real world entity than the relational tuples with regards to complex objects. The fact that an OODBMS is better suited to handling complex,interrelated data than an RDBMS means that an OODBMS can outperform an RDBMS by ten to a thousand times depending on the complexity of the data being handled. 2. Class Hierarchy: Data in the real world is usually has hierarchical characteristics. The ever popular Employee example used in most RDBMS texts is easier to describe in an OODBMS than in an RDBMS. An Employee can be a Manager or not, this is usually done in an RDBMS by having a type identifier field or creating another table which uses foreign keys to indicate the relationship between Managers and Employees. In an OODBMS, the Employee class is simply a parent class of the Manager class. 3. Circumventing the Need for a Query Language: A query language is not necessary for accessing data from an OODBMS unlike an RDBMS since interaction with the database is done by transparently accessing objects. It is still possible to use queries in an OODBMS however. 4. No Impedence Mismatch: In a typical application that uses an object oriented programming language and an RDBMS, a signifcant amount of time is usually spent mapping tables to objects and back. There are also various problems that can occur when the atomic types in the database do not map cleanly to the atomic types in the programming language and vice versa. This "impedance mismatch" is completely avoided when using an OODBMS. 5. No Primary Keys: The user of an RDBMS has to worry about uniquely identifying tuples by their values and making sure that no two tuples have the same primary key values to avoid error conditions. In an OODBMS, the unique identification of objects is done behind the scenes via OIDs and is completely invisible to the user. Thus there is no limitation on the values that can be stored in an object. 6. One Data Model: A data model typically should model entities and their relationships, constraints and operations that change the states of the data in the system. With an RDBMS it is not possible to model the dynamic operations or rules that change the state of the data in the system because this is beyond the scope of the database. Thus applications that use RDBMS systems usually have an Entity Relationship diagram to model the static parts of the system and a seperate model for the operations and behaviors of entities in the application. With an OODBMS there is no disconnect between the database model and the application model because the entities are just other objects in the system. An entire application can thus be comprehensively modelled in one UML diagram. Disadvantages 1. Schema Changes: In an RDBMS modifying the database schema either by creating, updating or deleting tables is typically independent of the actual application. In an OODBMS based application modifying the schema by creating, updating or modifying a persistent class typically means that changes have to be made to the other classes in the application that interact with instances of that class. This typically means that all schema changes in an OODBMS will involve a system wide recompile. Also updating all the instance objects within the database can take an extended period of time depending on the size of the database. 2. Language Dependence: An OODBMS is typically tied to a specific language via a specific API. This means that data in an OODBMS is typically only accessible from a specific language using a specific API, which is typically not the case with an RDBMS. 3. Lack of Ad-Hoc Queries: In an RDBMS, the relational nature of the data allows one to construct ad-hoc queries where new tables are created from joining existing tables then querying them. Since it is currently not possible to duplicate the semantics of joining two tables by "joining" two classes then there is a loss of flexibility with an OODBMS. Thus the queries that can be performed on the data in an OODBMS is highly dependent on the design of the system. Who is currently using an OODBMS to handle mission critical data? The following information was gleaned from the ODBMS Facts website. The Chicago Stock Exchange manages stock trades via a Versant ODBMS. Radio Computing Services is the world's largest radio software company. Its product, Selector, automates the needs of the entire radio station -- from the music library, to the newsroom, to the sales department. RCS uses the POET ODBMS because it enabled RCS to integrate and organize various elements, regardless of data types, in a single program environment. The Objectivity/DB ODBMS is used as a data repository for system component naming, satellite mission planning data, and orbital management data deployed by Motorola in The Iridium System. The ObjectStore ODBMS is used in SouthWest Airline's Home Gate to provide self service to travelers through the Internet. Ajou University Medical Center in South Korea uses InterSystems' Cachè ODBMS to support all hospital functions including mission-critical departments such as pathology, laboratory, blood bank, pharmacy, and X-ray. The Large Hadron Collider at CERN in Switzerland uses an Objectivity DB. The database is currently being tested in the hundreds of terabytes at data rates up to 35 MB/second. As of November, 2000, the Stanford Linear Accelerator Center (SLAC) stored 169 terabytes of production data using Objectivity/DB. The production data is distributed across several hundred processing nodes and over 30 on-line servers. Interacting With An OODBMS Below are Java code samples for accessing a relational database and accessing an object database. Compare the size of the code in both examples. The examples are for an instant messaging application. 1. Validating a user. Java code accessing an ObjectStore™ database import import import import COM.odi.*; COM.odi.util.query.*; COM.odi.util.*; java.util.*; try { //start database session Session session = Session.create(null, null); session.join(); //open database and start transaction Database db = Database.open("IMdatabase", ObjectStore.UPDATE); Transaction tr = Transaction.begin(ObjectStore.READONLY); //get hashtable of user objects from DB OSHashMap users = (OSHashMap) db.getRoot("IMusers"); //get password and username from user String username = getUserNameFromUser(); String passwd = getPasswordFromUser(); //get user object from database and see if it exists and whether password is correct UserObject user = (UserObject) users.get(username); if(user == null) System.out.println("Non-existent user"); else if(user.getPassword().equals(passwd)) System.out.println("Successful login"); else System.out.println("Invalid Password"); //end transaction, close database and retain terminate session tr.commit(); db.close(); session.termnate(); } //exception handling would go here ... Java JDBC code accessing an IBM's DB2 Database™ import java.sql.*; import sun.jdbc.odbc.JdbcOdbcDriver; import java.util.*; try { //Launch instance of database driver. Class.forName("COM.ibm.db2.jdbc.app.DB2Driver").newInstance(); //create database connection Connection conn = DriverManager.getConnection("jdbc:db2:IMdatabase"); //get password and username from user String username = getUserNameFromUser(); String passwd = getPasswordFromUser(); //perform SQL query Statement sqlQry = conn.createStatement(); ResultSet rset = sqlQry.executeQuery("SELECT password from user_table WHERE username='" + username +"'"); if(rset.next()){ if(rset.getString(1).equals(passwd)) System.out.println("Successful login"); else System.out.println("Invalid Password"); }else{ System.out.println("Non-existent user"); } //close database connection sqlQry.close(); conn.close(); } //exception handling would go here ... There isn't much difference in the above examples although it does seem a lot clearer to perform operations on a UserObject instead of a ResultSet when validating the user. 2. Getting the user's contact list. Java code accessing an ObjectStore™ database import import import import COM.odi.*; COM.odi.util.query.*; COM.odi.util.*; java.util.*; try { /* start session and open DB, same as in section 1a */ //get hashmap of users from the DB OSHashMap users = (OSHashMap) db.getRoot("IMusers"); //get user object from database UserObject c4l = (UserObject) users.get("Carnage4Life"); UserObject[] contactList = c4l.getContactList(); System.out.println("This are the people on Carnage4Life's contact list"); for(int i=0; i < contactList.length; i++) System.out.println(contactList[i].toString()); //toString() prints fullname, username, online status and webpage URL /* close session and close DB, same as in section 1a */ }//exception handling code Java JDBC code accessing an IBM's DB2 Database™ import java.sql.*; import sun.jdbc.odbc.JdbcOdbcDriver; import java.util.*; try { /* open DB connection, same as in section 1b */ //perform SQL query Statement sqlQry = conn.createStatement(); ResultSet rset = sqlQry.executeQuery("SELECT fname, lname, user_name, online_status, webpage FROM contact_list, user_table" + "WHERE contact_list.owner_name='Carnage4Life' and contact_list.buddy_name=user_table.user_name"); System.out.println("This are the people on Carnage4Life's contact list"); while(rset.next()) System.out.println("Full Name:" + rset.getString(1) + " " + rset.getString(2) + " User Name:" + rset.getString(3) + " OnlineStatus:" + rset.getString(4) + " HomePage URL:" + rset.getString(5)); /* close DB connection, same as in section 1b */ }//exception handling code The benefits of using an OODBMS over an RDBMS in Java slowly become obvious. Consider also that if the data from the select needs to be returned to another method then all the data from the result set has to be mapped to another object (UserObject). 3. Get all the users that are online. Java code accessing an ObjectStore™ database import import import import COM.odi.*; COM.odi.util.query.*; COM.odi.util.*; java.util.*; try{ /* same as above */ //use a OODBMS query to locate all the users whose status is 'online' Query q = new Query = (UserObject.class, "onlineStatus.equals(\"online\""); Collection users = db.getRoot("IMusers"); Set onlineUsers = q.select(users); Iterator iter = onlineUsers.iterator(); // iterate over the results while ( iter.hasNext() ) { UserObject user = (UserObject) iter.next(); // send each person some announcement sendAnnouncement(user); } /* same as above */ }//exception handling goes here Java JDBC code accessing an IBM's DB2 Database™ import java.sql.*; import sun.jdbc.odbc.JdbcOdbcDriver; import java.util.*; try{ /* same as above */ //perform SQL query Statement sqlQry = conn.createStatement(); ResultSet rset = sqlQry.executeQuery("SELECT fname, lname, user_name, online_status, webpage FROM user_table WHERE online_status='online'"); while(rset.next()){ UserObject user = new UserObject(rset.getString(1),rset.getString(2),rset.getString(3) ,rset.getString(4),rset.getString(5)); sendAnnouncement(user); } /* same as above */ }//exception handling goes here List of Object Oriented Database Management Systems Proprietary Object Store O2 Gemstone Versant Ontos DB/Explorer ODBMS Ontos Poet Objectivity/DB EyeDB Open Source Ozone Zope FramerD XL2 Description of the MultiEdit Application MultiEdit allows multiple users, potentially on different machines to edit a file simultaneously. Each user has his or her own view of the file, and each view includes its own cursor. Users may enter text into (the same or different points of) the file simultaneously. It is an essential requirement of the application that the contents of the file must always be kept consistent with the actions of users. Javadocs For The Project Are Available This project uses a Client/Server architecture to handle the difficulty of maintaining consistency between multiple clients trying to edit the same document at once. Each document is an object of class ShareableDocument stored in an Object Oriented Database which is remotely accessible via a DocumentManager which sits on the server and handles client requests. Whenever a user needs to access a document it is loaded from the database by the DocumentManager and sent to them over the network. From then on whenever an edit is performed by the user the actual key stroke and the position of the cursor is sent to the server which updates an in memory copy of the object before broadcasting the event to all users who are currently accessing the document including the user that originally performed the edit. The main drawback of the above method is that the user who is typing the document will most likely experience a lag between when a character is typed and when it shows up on the GUI which is dependent on the speed of the network. Also if there is a network outage or similar error then the user cannot edit the document. Saves are simply requests to the server to persist its in memory copy of the document which is more efficient than sending the whole document to the server. ShareableDocuments are not saved unless explicitly specified by a user or when a user closes a document. The application also allows the user to lock entire ShareableDocuments which prevents others from modifying the documents but they can still see the edits being made by the owner of the lock in real-time Description of Major Classes in System User This is a representation of a user of the system which is stored in the ObjectStore™ database. PersistableRemoteObject This is an interface that describes operations that can be performed on a remote object that is shared by multiple users. Owner An interface that describes any entity that owns (i.e. created) a PersistableRemoteObject. ShareableDocument This is a representation of a document that can be editted by multiple users at once or locked by a single user. It also is stored in the ObjectStore™ database and implements the PersistableRemoteObject interface. RemoteObjectFactory This is an interface that describes the minimum amount of services that a server that manages PersistableRemoteObjects must provide. DocumentManager This is an interface describing the methods that are remotely accessible on an object that manages ShareableDocuments. It is a subclass of the RemoteObjectFactory interface. DocumentManagerImpl This is the actual class that handles managing access to ShareableDocuments by multiple users. It implements the DocumentManager interface and is a subclass of Java.rmi.UnicastRemoteObject which allows it to be accessed remotely. DocumentContext This maintains information on a ShareableDocument that is currently being accessed by one or more users. A table of these is kept in the DocumentManagerImpl class to keep track of all the ShareableDocuments that are being shared at once. ClientView This is the GUI that the user interacts with. ClientControl This does event handling for the ClientView. ClientModel This is the underlying model for the ClientView that contains data structures, maintains state and communicates with the server. It implements the ClientListener interface. ClientListener This is an interface that describes the methods that are remotely accessible on the ClientModel. The method that allows the client receive keystroke events from server is in this interface. ClientState This is an object that represents the current state of the ClientView. LoginDialog GUI used for logging on to the server. AddUserDialog GUI used for adding a new user to the system. DocumentSelectionDialog GUI used to select which document to edit. MultiEdit State Diagram Discarded Design Decisions and Potential Future Development of MultiEdit The original design called for a Paragraph object that would be a part of a ShareableDocument and would be the atomic unit capable of being locked by a user. Unfortunately the amount of semantic analysis required during runtime to constantly reparse the document to decide what textual divisions indicated a paragraph degraded performance to an unsatisfactory extent. Also a good solution is a non-trivial task. Certain considerations were made towards locking lines if they were being accessed by other users but this was outside the original specifications. Also originally each user updated their GUI before sending the key stroke to the server, this meant that if some other client had gotten to the server just before it and updated its GUI, then their representations of the document would beout of sync. The solution was to make sure that GUIs were not updated until the keystrokes had reached the server and were broadcast to all interested parties. Finally the decision to keep a copy of the document on the server instead of simply using the server to relay updates is so that when a new user requests a document that is already being accessed by two or more users, there is an effective way to give send the document without worrying about consistency issues. Conclusion The gains from using an OODBMS while developing an application using an OO programming language are many. The savings in development time by not having to worry about seperate data models as well as the fact that there is less code to write due to the lack of impedance mismatch is very attractive. In my opinion, there is little reason to pick an RDBMS over an OODBMS system for newapplication development unless there are legacy issues that have to be dealt with. This paper is the final part of my indepedent study supervised by Dr. Sham Navathe and Wai Gen Yee. Bibliography 1. OODBMS Facts. Barry & Associates. 29 April 2001<http://www.odbmsfacts.com/> 2. Atkinson, Malcolm et al. The Object-Oriented Database Manifesto. In Proceedings of the First International Conference on Deductive and ObjectOriented Databases, pages 223-40, Kyoto, Japan, December 1989 < http://www.cs.cmu.edu/People/clamen/OODBMS/Manifesto/htManifesto/Manif esto.html > 3. McFarland, Gregory, Andres Rudmik, and David Lange. Object-Oriented Database Management Systems Revisited. 31 January 1999 .<http://www.dacs.dtic.mil/techs/oodbms2/> © 2001 Dare Obasanjo
© Copyright 2024