COVER SHEET FOR PROPOSAL TO THE NATIONAL SCIENCE FOUNDATION PROGRAM ANNOUNCEMENT/SOLICITATION NO./CLOSING DATE/if not in response to a program announcement/solicitation enter NSF 04-2 FOR NSF USE ONLY NSF PROPOSAL NUMBER NSF 04-2 FOR CONSIDERATION BY NSF ORGANIZATION UNIT(S) 0447031 (Indicate the most specific unit known, i.e. program, division, etc.) ATM - CYBERINFRASTRUCTURE DATE RECEIVED NUMBER OF COPIES DIVISION ASSIGNED FUND CODE DUNS# FILE LOCATION (Data Universal Numbering System) 178341772 EMPLOYER IDENTIFICATION NUMBER (EIN) OR TAXPAYER IDENTIFICATION NUMBER (TIN) IS THIS PROPOSAL BEING SUBMITTED TO ANOTHER FEDERAL AGENCY? YES NO IF YES, LIST ACRONYM(S) SHOW PREVIOUS AWARD NO. IF THIS IS A RENEWAL AN ACCOMPLISHMENT-BASED RENEWAL 770150580 NAME OF ORGANIZATION TO WHICH AWARD SHOULD BE MADE ADDRESS OF AWARDEE ORGANIZATION, INCLUDING 9 DIGIT ZIP CODE Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing, CA. 950399644 Monterey Bay Aquarium Research Institute AWARDEE ORGANIZATION CODE (IF KNOWN) 4000069000 NAME OF PERFORMING ORGANIZATION, IF DIFFERENT FROM ABOVE ADDRESS OF PERFORMING ORGANIZATION, IF DIFFERENT, INCLUDING 9 DIGIT ZIP CODE PERFORMING ORGANIZATION CODE (IF KNOWN) IS AWARDEE ORGANIZATION (Check All That Apply) (See GPG II.C For Definitions) TITLE OF PROPOSED PROJECT MINORITY BUSINESS IF THIS IS A PRELIMINARY PROPOSAL WOMAN-OWNED BUSINESS THEN CHECK HERE Collaborative Research: Marine Metadata Interoperability REQUESTED AMOUNT PROPOSED DURATION (1-60 MONTHS) 397,843 $ SMALL BUSINESS FOR-PROFIT ORGANIZATION 12 REQUESTED STARTING DATE SHOW RELATED PRELIMINARY PROPOSAL NO. IF APPLICABLE 10/01/04 months CHECK APPROPRIATE BOX(ES) IF THIS PROPOSAL INCLUDES ANY OF THE ITEMS LISTED BELOW BEGINNING INVESTIGATOR (GPG I.A) HUMAN SUBJECTS (GPG II.D.6) DISCLOSURE OF LOBBYING ACTIVITIES (GPG II.C) Exemption Subsection PROPRIETARY & PRIVILEGED INFORMATION (GPG I.B, II.C.1.d) INTERNATIONAL COOPERATIVE ACTIVITIES: COUNTRY/COUNTRIES INVOLVED or IRB App. Date HISTORIC PLACES (GPG II.C.2.j) (GPG II.C.2.g.(iv).(c)) SMALL GRANT FOR EXPLOR. RESEARCH (SGER) (GPG II.D.1) VERTEBRATE ANIMALS (GPG II.D.5) IACUC App. Date PI/PD DEPARTMENT HIGH RESOLUTION GRAPHICS/OTHER GRAPHICS WHERE EXACT COLOR REPRESENTATION IS REQUIRED FOR PROPER INTERPRETATION (GPG I.E.1) PI/PD POSTAL ADDRESS 7700 Sandholdt Rd. Science-Engineering PI/PD FAX NUMBER Moss Landing, CA 950399644 United States 831-775-1620 NAMES (TYPED) High Degree Yr of Degree Telephone Number Electronic Mail Address BA 1982 831-775-1956 [email protected] PhD 1987 831-775-1709 [email protected] MS 2002 831-775-1987 [email protected] PI/PD NAME John Graybeal CO-PI/PD Francisco Chavez CO-PI/PD Stephanie M Watson CO-PI/PD CO-PI/PD Page 1 of 2 Electronic Signature CERTIFICATION PAGE Certification for Authorized Organizational Representative or Individual Applicant: By signing and submitting this proposal, the individual applicant or the authorized official of the applicant institution is: (1) certifying that statements made herein are true and complete to the best of his/her knowledge; and (2) agreeing to accept the obligation to comply with NSF award terms and conditions if an award is made as a result of this application. Further, the applicant is hereby providing certifications regarding debarment and suspension, drug-free workplace, and lobbying activities (see below), as set forth in Grant Proposal Guide (GPG), NSF 04-2. Willful provision of false information in this application and its supporting documents or in reports required under an ensuing award is a criminal offense (U. S. Code, Title 18, Section 1001). In addition, if the applicant institution employs more than fifty persons, the authorized official of the applicant institution is certifying that the institution has implemented a written and enforced conflict of interest policy that is consistent with the provisions of Grant Policy Manual Section 510; that to the best of his/her knowledge, all financial disclosures required by that conflict of interest policy have been made; and that all identified conflicts of interest will have been satisfactorily managed, reduced or eliminated prior to the institution’s expenditure of any funds under the award, in accordance with the institution’s conflict of interest policy. Conflicts which cannot be satisfactorily managed, reduced or eliminated must be disclosed to NSF. Drug Free Work Place Certification By electronically signing the NSF Proposal Cover Sheet, the Authorized Organizational Representative or Individual Applicant is providing the Drug Free Work Place Certification contained in Appendix C of the Grant Proposal Guide. Debarment and Suspension Certification (If answer "yes", please provide explanation.) Is the organization or its principals presently debarred, suspended, proposed for debarment, declared ineligible, or voluntarily excluded from covered transactions by any Federal department or agency? Yes No By electronically signing the NSF Proposal Cover Sheet, the Authorized Organizational Representative or Individual Applicant is providing the Debarment and Suspension Certification contained in Appendix D of the Grant Proposal Guide. Certification Regarding Lobbying This certification is required for an award of a Federal contract, grant, or cooperative agreement exceeding $100,000 and for an award of a Federal loan or a commitment providing for the United States to insure or guarantee a loan exceeding $150,000. Certification for Contracts, Grants, Loans and Cooperative Agreements The undersigned certifies, to the best of his or her knowledge and belief, that: (1) No federal appropriated funds have been paid or will be paid, by or on behalf of the undersigned, to any person for influencing or attempting to influence an officer or employee of any agency, a Member of Congress, an officer or employee of Congress, or an employee of a Member of Congress in connection with the awarding of any federal contract, the making of any Federal grant, the making of any Federal loan, the entering into of any cooperative agreement, and the extension, continuation, renewal, amendment, or modification of any Federal contract, grant, loan, or cooperative agreement. (2) If any funds other than Federal appropriated funds have been paid or will be paid to any person for influencing or attempting to influence an officer or employee of any agency, a Member of Congress, an officer or employee of Congress, or an employee of a Member of Congress in connection with this Federal contract, grant, loan, or cooperative agreement, the undersigned shall complete and submit Standard Form-LLL, ‘‘Disclosure of Lobbying Activities,’’ in accordance with its instructions. (3) The undersigned shall require that the language of this certification be included in the award documents for all subawards at all tiers including subcontracts, subgrants, and contracts under grants, loans, and cooperative agreements and that all subrecipients shall certify and disclose accordingly. This certification is a material representation of fact upon which reliance was placed when this transaction was made or entered into. Submission of this certification is a prerequisite for making or entering into this transaction imposed by section 1352, Title 31, U.S. Code. Any person who fails to file the required certification shall be subject to a civil penalty of not less than $10,000 and not more than $100,000 for each such failure. AUTHORIZED ORGANIZATIONAL REPRESENTATIVE SIGNATURE DATE NAME Tayeko Yamada TELEPHONE NUMBER 831-775-1803 Electronic Signature ELECTRONIC MAIL ADDRESS Jul 15 2004 6:22PM FAX NUMBER [email protected] 831-775-1620 *SUBMISSION OF SOCIAL SECURITY NUMBERS IS VOLUNTARY AND WILL NOT AFFECT THE ORGANIZATION’S ELIGIBILITY FOR AN AWARD. HOWEVER, THEY ARE AN INTEGRAL PART OF THE INFORMATION SYSTEM AND ASSIST IN PROCESSING THE PROPOSAL. SSN SOLICITED UNDER NSF ACT OF 1950, AS AMENDED. Page 2 of 2 Collaborative Research: Marine Metadata Interoperability The ocean science community has a variety of ongoing, independent metadata projects, each developing some combination of project- or discipline-specific standards, tools, and ontologies. None of these efforts has engaged the wider ocean science community, nor provided a coordinated set of materials that can guide the development of distributed, integrated and interoperable ocean-data systems. We propose to implement the framework for a community-based coordination project that will build from the myriad existing efforts, encourage widespread community participation (national and international), and demonstrate the use and benefits of metadata standardization. Intellectual Merit Our goals are to engage and inform the ocean science community in the creation of interoperable, metadata-centric data systems by: a) providing guidance and reference documentation on properly using and developing metadata solutions for the ocean science community; b) encouraging community involvement in the development and evaluation of those documents, and c) using two test-bed activities to demonstrate crossplatform, cross-disciplinary, interoperable distributed data systems. A steering committee of community representatives will work to identify community needs for guidance on metadata standards. We will use an open process to evaluate and coordinate existing activities and, where necessary, improve upon existing metadata guidance critical to developing distributed data management solutions. We will openly document and evaluate our ongoing effort, using multiple mechanisms for community involvement and feedback. Our intent is to maximize broad community involvement, and to build a modular framework that encourages collaboration on community standards. Two coordinated test-bed demonstrations will leverage metadata work across at least three different types of data acquisition systems: (1) cabled or moored platforms, (2) mobile autonomous systems, and (3) remote-sensing platforms. Multiple instances of each system, from different institutions, will be included in the application of existing, modular and scalable data systems. One of these test-bed activities will build from a nationwide interoperability demonstration (www.openioos.org) that leverages OpenGIS Consortium (OGC) standards, and will be funded entirely via external funds. Breadth of Impact The proposed work will provide guidance on metadata development to the ocean science community, building from developments in computer science and other geosciences. It will support the NSF emphasis on needed interoperability between data systems, as demonstrated through ORION and several recent interoperability workshops. It will also address a top priority in the OceanUS Data Management and Communications (DMAC) plan. Outreach will be accomplished through the community websites, a special session or town meeting at AGU Fall 2004, the OOS Tech 2005 workshop, and email lists. We have a wide base of participatory support already, as evidenced by the attached letters of commitment, and will grow this involvement throughout the life of the project. Collaborative Research: Marine Metadata Interoperability Introduction Scientific communities that engage in large, multi-year, multi-disciplinary and multiinstitutional research programs require a systematic approach to data management that enables interoperability between heterogeneous data systems. The critical need for this kind of interoperability faces the ocean science community at a time when recent technological advances have allowed scientists to collect large volumes of data in real time. Coastal ocean observing systems, for example, will soon provide immense volumes of data, with the total near-term annual data flow for the U.S. Integrated Ocean Observing System roughly estimated at 2,977,100 MB [1]. The proliferation of ocean data often makes it difficult to find, understand, and use needed data. Metadata standardization is key to the new approaches required. Metadata are descriptions of data sets, and provide sufficient information to enable data search, retrieval, transfer, and evaluation [2]. Basic metadata include descriptive information such as data set name, contents, and collection data. More advanced metadata can include information such as data quality, keywords or the spatial or temporal extent of the data set, Standardized metadata that can be automatically used by computers (without human intervention or interpretation) are key to effective data management, interoperability between distributed data systems and, most importantly, integrative, inter-disciplinary science. Such standardized metadata will reduce the semantic heterogeneity and syntactic incompatibility that currently hinders effective data analysis and comparison between data sets. Consequently, standardization will enable cost-effective data management and interoperability [3]. Unfortunately, many ocean (and other) scientists, and even operational observing programs, provide insufficient metadata to enable automatic data search, retrieval, transfer or evaluation. Many factors contribute to inadequate metadata specifications, including a lack of the following: 1) ontologies (controlled vocabularies) for use in metadata field names and content; 2) software tools that easily formalize ontologies in a structured language; 3) metadata standards that employ ontologies and are specifically designed for ocean data sets and for automatic computer use; and 4) effective guidance using well-suited templates and tools for metadata development. We propose to address these factors by building upon key existing efforts in ontology and metadata standards development in the ocean data community. Our approach will be an innovative community-based effort to reference and connect existing efforts, develop metadata guidance, and implement end-to-end data system demonstrations. We will emphasize the key roles of sound metadata in data management and interoperability. Our effort joins the Monterey Bay Aquarium Research Institute (MBARI) with the San Diego Supercomputer Center (SDSC) and the Scripps Institution of Oceanography (SIO) at the University of California San Diego (UCSD), Oregon State University (OSU), Lamont-Doherty Earth Observatory (LDEO) of Columbia University, the University of Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 1 Hawaii and the Southeastern Universities Research Association (SURA). We are also collaborating with the National Oceanic and Atmospheric Administration (NOAA), Ocean.US , and Woods Hole Oceanographic Institution. Internationally, collaborations will include the British Oceanographic Data Centre and the Intergovernmental Oceanographic Commission Data and Information Exchange. For advanced ontological guidance, we are collaborating with Stanford University's Medical Informatics Program, a leading developer and consultant regarding scientific ontologies and related tools. Intellectual Merit Today, despite numerous metadata and ontology standards development efforts, and even community efforts such as the Marine Metadata Initiative [48] and the Standard Naming Exercise [49], a data system developer has little guidance about providing for metadata. This is true even though every data management system requires metadata in some form, even if only in the user's head; and even though every scientist using data relies on the underlying metadata technology and architecture. The system developer must (somehow!) quickly implement a suitable metadata architecture. As a result, often the systems' science users end up with limited ability to find and use data, or even unable to use the data at all. Our proposed work will address this problem at three levels. First, by consolidating all the relevant resources in a single location, which is endorsed and supported by leading members of the oceanographic community, we will provide system developers with the knowledge they need to effectively implement mature metadata solutions. (This will in turn increase demand for mature metadata-centric data solutions.) Second, by identifying and mediating gaps in the metadata framework (for example, in ontologies and metadata standards), we will guide the community toward an integrated metadata understanding. Third, by entraining the community as we perform this work, we will create momentum to continue improving the oceanographic metadata framework beyond the single year allocated to this proposal. (While we limited the proposal to a single year due to NSF funding limitations, we envision this project's work continuing into future years.) In the area of ontologies and metadata standards, there is a particularly compelling need for such a community-based activity. Although there are efforts to develop ontologies and metadata standards for ocean data sets, these efforts address specific issues in oceanographic metadata; they do not address the broad range of syntactic and semantic interoperability needs in the ocean data community that are necessary for interoperability between data systems. Also missing is a complete set of materials to guide metadata development -- especially in terms of ontology and metadata development and tools -- for the ocean data community. In this proposal, the goal is to make community-based progress toward developing an appropriate ontology (or suite of complementary ontologies) and identifying appopriate metadata representation standard(s) using the ontology or ontologies. A more detailed presentation of current and proposed work in the areas of ontologies and metadata standards follows. Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 2 The Need for Ontologies to Specify Metadata for Ocean Data Sets To improve data management and ensure interoperability between ocean data sets, it is necessary to develop an ontology (or ontologies) to use in descriptions of the data sets. An ontology is a specification of a conceptualization of a domain of interest [4]. An ontology can represent several levels of information [5]. From the lowest to the highest levels of information, an ontology can be categorized as a: catalog (list of terms), glossary (list of terms with definitions), thesaurus (list of terms with definitions and synonyms), and a more formal ontology (list of terms with definitions, synonyms, and other relationships between terms). Many efforts have focused on creating some type of ontology for oceanographic data sets, and demonstrate the critical need to standardize descriptive terms These efforts are not being done in a coordinated way to meet the broad needs of the ocean data community. They are specific to the needs of the individual projects or programs, which often exacerbates the challenge of interoperability between data systems. Several ontology development efforts aim to identify a simple catalog or list of terms to use in data sets and metadata, such as: NASA Strategic Evolution of ESE Data Systems (SEEDS)[6] , CLIVAR [7] ), and NASAs Global Change Master Directory [8] . Several ontology development efforts also aim to identify a higher level ontology via a glossary or list of terms with definitions; see References [9] through [18] for a detailed partial list Fewer efforts have focused on developing more formal ontologies which specify complex relationships between terms, such as the Semantic Web for Earth and Environmental Terminology (SWEET) ontologies [19]. Other higher level ontologies in the ocean data community include those associated with the ARION digital library project [20] and the Poseidon project [21], a distributed system for real-time interdisciplinary ocean field and uncertainty forecasting with adapting modeling and adaptive sampling. The above efforts (and likely others that will be identified during the work proposed) provide a sound basis for the development of a community-based ontological framework. Our goal is not to create a single replacement for these efforts, nor to integrate them all into a single super-ontology. Rather, we will first comprehensively review the status, goals, and capabilities of these efforts. From this understanding, we will seek common and effective strategies for data system developers to incorporate functional, mature metadata architectures in their systems. In some cases this may call for the adoption of a single existing ontology; in others, for adopting multiple ontologies (for example, using namespaces). Recommendations will be made, and efforts initiated as needed, to address the need for a comprehensive solution. The strong desire for a single all-encompassing ontology is years from being met, but informed approaches to integrating current work will move the state of current technology toward a unified solution. The development of such a community-based ontology framework is timely in that NSF, through its Ocean Observatories Initiative, and its partner agencies in the Integrated Ocean Observing System, are investing millions in ocean research and emphasizing ready access to ocean data and data products from a variety of data systems. Examples of these Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 3 efforts include projects in biological information systems like OBIS, geoscience data systems like PetDB, RIDGE2K, and MARGINS, and infrastructure projects like OPeNDAP. The Need for Metadata Standards for Ocean Data Sets An ontology or set of complementary ontologies improves semantic interoperability by providing the controlled vocabulary for metadata field names and content in a standard. General metadata standards, such as the Federal Geographic Data Committees Content Standard for Digital Geospatial Metadata (FGDC) [22], and the International Standards Organization (ISO) 19115 [23] standard, exist to guide structural or syntactic metadata development for geospatial data sets. While extensive, these standards do not currently incorporate, or do not represent in a way that is computationally useful for searches and retrieval, all of the metadata fields necessary to adequately describe a broad range of ocean data sets. Nor do they address the question of semantic interoperability between data sets. These omissions are problematic given the many extensive efforts in ocean observing systems that rely upon metadata as the lynchpin for sound data management and interoperability between systems. The omissions also contribute to the fact that ocean data sets are expensively acquired but rarely re-used. Even data sets cited in journals and reports often cannot be found or used easily. Scientists need to have the ability to discover, analyze, process and reprocess, and reapply data sets to advance science. With effective, controlled vocabularies and standardized metadata frameworks, all of these tasks can be much simpler for the user, and even automated. For example, many scientists and data systems store their metadata (such as cruise details and sensor descriptions) for various types of ocean data sets according to the FGDC metadata standard. The scientists are storing this information in several different sections in the FGDC standard. In this situation, the useful ocean metadata is buried in the text (available for full-text, but not field-specific, searches) and is often included in sections that are counter-intuitive for that information. It is possible to improve the FGDC standard by identifying either an appropriate profile or extension to the core standard (such as the biological [24], remote sensing [25] or shoreline [26] profiles), which can include identifying an ontology to use in the metadata content as well as any new metadata fields. However, significant time and financial resources, as well as community building, are required to undertake such efforts. Other common metadata formats that are used across disciplines include the Directory Interchange Format (DIF) [27], the Dublin Core [28], and MARC21 [29]. The DIF is a metadata format compatible (i.e., you can map the fields) to the FGDC CSDGM. It is the format used by the Global Change Master Directory and the Marine Environmental Data Inventory [http://ioc.unesco.org/medi/index.html] project. However, the field lengths in the DIF are limited and the semantic heterogeneity issue isn't addressed, which makes successful mapping between standards difficult. The Dublin Core metadata standard is a simple yet effective element set for describing a wide range of networked resources, focusing on bibliographic needs. The Dublin Core standard does not include any metadata elements for data quality, is generally not oriented for scientific data, and like Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 4 most metadata standards, does not address semantic heterogeneity. The MARC21 represents and communicates bibliographic and related information in machine-readable form. It can be represented in XML, but is not oriented to scientific data. Again, it may be possible to adapt these standards and define ontologies to meet the needs of the ocean data community, but this will take a community-based effort. Mappings between different metadata standards that scientists use, and then coding these mappings so that translations can be done automatically between standards, are other possibilities to encourage ocean metadata development. For example, crosswalk tools exist to translate from the FGDC standard to the ISO 19115 standard [30]. However, information is typically lost when translating from one standard to another. One possible solution to this challenge is to identify a superset of metadata elements from a suite of metadata standards that the community identifies as appropriate, and then use this superset as the basis for the translation to a standard of choice. Several metadata standards or conventions (based on file format) are in use for ocean data sets in addition to DIF, the Dublin Core, MARC21, and the FGDC standard with extensions or profiles described above. Some of these metadata standard efforts build upon some of the ontologies previously mentioned. For example, the Cooperative Ocean/Atmosphere Research Data Service (COARDS) Conventions [31] for the standardization of NetCDF files provides structural conventions to enhance the exchange of NetCDF files via DODS/OPENDAP. The Climate Forecast (CF) standard [3] is an addition to the COARDS convention to standardize NetCDF files. It is a standard to distinguish quantities (physical description, units, prior processing, etc.) and to locate the data in space and time and as a function of other independent variables (coordinates). CF provides basic information to help find data, such as ways to record where and how the file was produced). EPIC [32], the ARGO/Global Ocean Data Assimilation Experiment [33], the OceanSites program [34] and the World Ocean Circulation Experiment [35] all have program specific NetCDF file conventions. The National Virtual Ocean Data System has also been working on a metadata model for mooring data that is based on COARDS and CF. These efforts to standardize the structure of NetCDF files are critically important, but they do not address the semantic heterogeneity (or even completely the content) in the data set descriptions. The Ecological Modeling Language [36], SensorML [37], and Marine XML [38] are XML applications representing metadata on ecological data sets, sensors, and marine data sets, respectively. However, XML is a syntactic model that cannot compensate for: a) the fact that users may use different nestings of tags or interchange elements and attributes; and b) that users may employ different terminologies in tags or in the content. As a result, XML alone cannot enable automatic interoperability because it does not address the semantic heterogeneity issues [39]. All of these efforts have addressed parts of the metadata standardization challenge, for particular communities, particular applications, or particular data models. Identifying their inconsistencies and weaknesses that hamper interoperability, and creating consensus and concepts for more interoperable approaches, will be a significant intellectual challenge and community accomplishment. Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 5 Broader Impact The proposed project advances the standardization of metadata descriptions of data sets and, as a result, enhances data management and interoperability. Data management is one of the most critical activities in ocean research, but it also one of the most poorly accomplished [40]. Currently, there is no coherent data management and communications strategy that will permit integration of diverse ocean data sets [1]. This work will contribute to the development of such a strategy by helping to accomplish several of the metadata tasks listed in the IOOS DMAC (Data Management and Communications) plan. NSF has also held several workshops on the need for interoperability between data systems in the geosciences, including the NSF/ONR Workshop on Data Assimilation in Ocean Research and the NSFs First Interoperability Design Challenge Workshop. Further, oceanographic data is expensive to acquire but is rarely re-used. For example, ship time for large blue-water vessels can cost $100,000 per day. Individual samples can costs hundreds to thousands of dollars simply for analysis. Such data acquisition and analysis costs add up quickly and are largely publicly funded. As a result, it is critical to make such data sets readily available for re-analysis and reuse. This project will engage wide representation from the oceanographic community by providing information and soliciting feedback on community websites, at a town meeting at the AGU Fall 2004 Meeting in San Francisco, at the OOS Tech 2005 workshop, and through email listservs, at a minimum. It will enhance the infrastructure for research and education by improving partnerships among agencies, lead oceanographic and related institutions, as well as by developing concrete metadata guidance for all prospective participants in the ORION and the IOOS. The project will facilitate the wider use of interoperable systems for educational and other community purposes. By making metadata information more readily available to data system developers, it will advance both their capabilities, and expectations within the field of oceanographic data management. This project will also involve a leader in marine GIS, metadata, and education, Dawn Wright, from Oregon State University, and will support the work of one of her graduate students on this project. Results of Prior NSF Support Note that John Graybeal (MBARI), Stephanie Watson (CeNCOOS), and Philip Bogden (GoMOOS/SURA) have no prior NSF support. Stephen Miller and John Helly, UCSD 1. J. Helly, NSF GEO SGER ATM-0333354 ($113,631) Workshop to Advance Long-Term Data and Metadata Interoperability in the Environmental and Earth Sciences, 8/03-8/05. Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 6 2. J. Helly, NSF 9980154/KDI($727,667): A Knowledge Network for Biocomplexity: Building and Evaluating a Metadata-based Framework for Integrating Heterogeneous Scientific Data 3. S. Miller, S. Cande, P. Henkart, I. Zaslavsky, NSF OCE ITR/IM 01-13999 ($497,000) SIOExplorer: Web Exploration of Seagoing Data, 10/01-9/04. 4. B. Schottlaender, J. Helly, S. Miller, H. Staudigel, C. Johnson, NSF DUE NSDL 0121584 ($900,381) Bridging the Gap between Libraries and Data Archives, 10/01-9/03. 5. S. Miller, J. Helly, S. Cande, NSF OCE 02-20942 ($249,942) Multibeam Database: Search and Rescue, 8/02 – 7/04. 6. S. Miller, J. Helly, H. Staudigel, C. Johnson, NSF DUE NSDL 03-33705 ($599,948) Mobilizing Enduring NSDL Resources in Plate Tectonics Research for Earth Science Education, 10/03 – 9/05 Many of the interoperability issues in this proposed new effort are being discussed in depth by a broad spectrum of the NSF community, under award (1) which supported a December 2003 workshop of experienced information system developers, to assess what we have learned about interoperability as a community from the IT investments of NSF over the past decade. A second workshop was held in May 2004 with current and future developers to minimize redundant effort and maximize useful innovation. The Knowledge and Distributed Intelligence (KDI) ecological metadata funding (2) provides a mechanism for discovering, acquiring and understanding the contents of individual data sets. It includes construction of a network-based metadata server and a federated data network on the server side, and client software for inserting the data and metadata as well as querying the catalog in various ways and retrieving selected datasets. This technology is also used by the Ecological Society of America (http://esa.sdsc.edu/datapaper) and by EarthRef (http://earthref.org). The original idea for SIOExplorer came from an ITR award (3) to open access to nearly 800 SIO expeditions for both research and education. The effort was then formalized as a fully-searchable digital library with an NSDL Collections Track project (4) by a group of investigators at SIO, SDSC and the UCSD Libraries (http://SIOExplorer.ucsd.edu), for which Helly is the principal architect. Award (5) rescues data from tape storage, corrects myriad systematic problems, and loads nearly 300 cruises of detailed multibeam data in the digital library, ready for combination with other sources, worldwide. A current NSDL project (6) tests our digital library technology in national teacher workshops which will enable students to conduct their own personal voyages of discovery in the context of cruise data, plate tectonic concepts, and educational standards. Selected publications include: § § § § § Helly, J. "New concepts of publication." [41] Helly, J., et al, "A Method for Interoperable Digital Libraries and Data Repositories," [42] Helly, J., et al. "Controlled Publication of Digital Scientific Data." [43] Helly, J., et al. "Scalable Models of Data Sharing in the Earth Sciences." [44] Miller, S. P., et al. "SIOExplorer: Digital Library Project, in An Ocean Odyssey" [45] Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 7 Robert Arko, LDEO RIDGE 2000 Open Data Exchange System (RODES). OCE03-28117 (2003-2006). The Ridge 2000 Open Data Exchange System [51] has been designed, built, and active online serving the public since December 1, 2003. RODES has been implemented in close collaboration with several related NSF funded database projects (MARGINS Focus Site Data Management, Marine Seismic Reflection Data Management System, and Antarctic Geophysical and Multibeam Synthesis), as part of an overall Marine Geoscience Data Management System [52] using an integrated backend system and shared metadata schema. Two interfaces are currently available: a Web-based search tool [53] and a cross-platform Java application [54]. A set of (data) inventory forms designed to capture metadata in a standardized way has been distributed to each funded investigator on each marine field program. To date, the database has been populated with all Ridge 2000 (and earlier RIDGE) funded programs carried out on the R/V Ewing (1990-2004), and serves basic cruise metadata, links to NSF awards, as well as navigation, gravity, magnetics, and bathymetry data. A Data Management System (DMS) steering committee has been established and will meet in July 2004. Progress has been reported at the Ridge 2000 Community Meeting in November 2003, the AGU Fall Meeting in December 2003, the Ridge 2000 Steering Committee Meeting in April 2004, and the DLESE Data Services Workshop in May 2004. Francisco Chavez, MBARI Collaborative Research: Southern Ocean Iron Fertilization Experiments (SOFeX)-OCE0000364 (Francisco Chavez, PI, Ken Johnson, co-PI). The Monterey Bay Aquarium Research Institute (MBARI) provided instrumented drifter packages and associated support for real-time tracking and in-water measurement (physical, chemical and biooptical) in support of the Southern Ocean iron enrichment experiment (SOFeX). In addition, MBARI provided continuous measurements of dissolvable iron and pCO2 concentrations while the ship was underway in the patch. The use of drifters offered the ability to continuously monitor the fertilized and control areas without committing ship time to the task. There is one publication to date (synthesis is ongoing), in Science: Coale, K.H., et al. "Southern Ocean Iron Enrichment Experiment (SOFeX): Carbon and Silicon Cycling in Antarctic Waters." [50] Project Approach This Marine Metadata Interoperability proposal addresses the need for a collaborative metadata development effort, which can lead to concrete technical deliverables and the establishment of an ongoing metadata development community. Culminating deliverables include: • a centralized community resource on the World Wide Web for metadata activities and information, including the content development described below; and Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 8 • demonstration projects using metadata to work with multiple and varied distributed oceanographic data sets. The web site will serve as a focal point for community metadata efforts, around which knowledge and involvement can coalesce and be organized. The proposed effort will emphasize access to, and reference to, existing standards and work, rather than development of entirely new and separate approaches or solutions. For example, FGDC (and its successor ISO 19115) is a required metadata standard for geospatial data sets collected by a federal agency or other federally funded entity. This standard will be referenced and perhaps augmented, but not diminished, by these efforts. Tasks The primary task will be to create and maintain a central, community-oriented, collaboratively developed web site for oceanographic metadata. This site will contain several topic-specific sections. Each section of the site will provide the content and structure needed to educate and entrain participants interested in that topic. The web site will represent the work of many independent but collaborating organizations. While a few of these organizations will receive funding via this proposal (to contribute specific parts of the common effort), all participants will contribute resources as part of their own metadata standards- and community-building efforts. Particular topics will be included in the web site and correspond to the following tasks (these tasks are described in more detail in the following paragraphs): (1) References development; (2) Ontology development; (3) Standards development process; (4) Cookbook pages; (5) Demonstration Projects. References Development Task References will be documented in three major categories: Standards, Tools, and Data Management Projects. The Standards category is the highest priority, but the other categories will also be important references for system developers. a. Standards i. Vocabularies/ontologies (e.g., SWEET, GCMD) (see Ontology development below) ii. Storage/organization protocols/schema/representation (e.g., FGDC, XML schema, MIF, RDF, OWL) iii. Exchange protocols (e.g., OAI, Z39.50, OPeNDAP) b. Tools i. ontology development (to build ontologies, whether simple dictionaries or complex vocabularies) ii. metadata management (storage, crosswalking, distributed access and exchange) Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 9 iii. for using metadata in data systems (e.g., COARDS CF-compliant tools; metadata entry or formatting tools) c. Data Management Projects Incorporating Metadata i. technologies/approaches; status; lessons and best practices The focus of the reference development effort is to consolidate a list of the relevant efforts in a single place. This enables developers to confidently access the latest information on subjects of interest. It also provides a location for metadata projects to "register" their work, so that others can know about it and find it. The web site will document the status of referenced material whenever possible, and will also include a way for members of the community to post suggestions or comments. Ontology Development Task a. Reference existing dictionaries, thesauri, and other ontologies (see the References Development subsection above) b. Identify tools to develop, maintain, and access ontologies c. Identify areas needing progress d. Conduct a reference development effort (developing a focused ontology) It is not realistic to attempt to develop a complete oceanographic ontology in one year. In fact, many existing ontology development efforts exist. This task will build on the Reference Development effort, evaluating the ontology development efforts and tools, and identifying areas where progress is needed. The ontology survey will have a wide scope, addressing platforms, instruments, sensors, data items, and deployments. As part of the evaluation task, an oceanographic ontology will be developed for a specific domain, following the examples at the Marine Metadata Workshop [47]. The ontology's level of detail (dictionary, thesaurus, etc.) is to be defined. Targeted products include defined metadata parameter sets (i.e., content standard) for the selected domain, a controlled vocabulary, one or more interoperable implementations of the content standard, and applicable tools such as converters and import/export filters. Standards Development Task There are many potential metadata standards which could be addressed as part of the Marine Metadata Interoperability project. At a minimum, the FGDC and the ISO 19115 metadata standards will be included in an analysis of existing standards. One possible product is a superset of metadata elements from all community-identified metadata standards. Such a superset of elements can then be used to implement translators between different metadata standards. See the Intellectual Merit section above for more detailed discussion of this task. Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 10 Metadata Cookbook Task a. Provide general guidance on metadata issues to data system developers i. how to do certain functions; possible resources; best practices b. Provide input to NSF metadata initiatives (including ORION) c. Provide input to the IOOS DMAC implementation effort The Marine Metadata Interoperability project's Metadata Cookbook will document metadata-related issues and tasks for data system developers. It will provide a starting point for those developers who are wondering how to address challenges like controlled vocabularies and metadata development. The Global Spatial Data Infrastructure cookbook (for data systems in general) provides an example of this concept, and may serve as a basis for this work. This project will interface closely with related NSF initiatives, particularly ORION. Specific input on metadata development will also be provided to the IOOS DMAC implementation efforts, so that this project remains aligned with the IOOS DMAC, and reflects the same overall understanding of metadata issues as that group. Metadata Interoperability Demonstrations Participants in this activity will lead two parallel efforts to demonstrate the use of metadata in creating interoperable data systems. An MBARI-led effort will focus on data acquisition and management as part of its Shore Side Data System [46,54], and a SURAled effort will focus on creation of integrated data products leveraging a recent implementation of the OpenGIS Consortium (OGC) web services (see http://www.openioos.org). The MBARI-led effort will also complement a pilot project undertaken by CeNCOOS partners to promote the regional data management systems' use of OPeNDAP as a data transport protocol and Live Access Server for on-line browse. Both testbed demonstration activities represent significant cost-share contributions to the overall goals of this proposal, thereby augmenting the funds requested from NSF. Please note that the MBARI-led effort is pending approval in the annual MBARI proposal approval process; however, much of the required functionality is already developed. Demonstration #1: Data Acquisition & Management This task will use an MBARI-developed data system (the Shore Side Data System [46,54] as the core of a multi-data set, multi-institution demonstration of metadata utilization. The demonstration will show how metadata is useful at various stages of data management, and will allow external clients to make their own demonstrations using the resulting public data sets. Key features of the resulting demonstration will include: • ingest of data from multiple platforms using detailed metadata descriptions Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 11 fixed observational asset with streaming data (mooring) remote sensing platform (satellite, airfcraft, or radar data) mobile autonomous asset (AUV or glider) cabled observing system (prototype, a la MARS, or exemplar) external data set(s) (TBD) processed data set (TBD) search for data based on metadata naming standards (defined ontologies) browse data based on metadata describing the data's life cycle and deployment (e.g., platform, collection time, deployment, mission, or relation to other data sets) display or present data using embedded metadata to control outputs (e.g., plot format, missing data, and plot contents) access to data subsets based on metadata o via standard OPeNDAP interface (for compatible data sets) o via web-accessible URL, integrating multiple data items from different platforms into single merged data set access to descriptive metadata itself o o o o o o • • • • • This data system's core functionality has been developed, although features must be added to illustrate some of these metadata applications. The modularity of the existing system will allow key metadata discoveries and directions to be integrated as the year progresses. Note that while the number of assets and data types to be described and integrated is large, it is by no means comprehensive. Other data such as ship and ROV data, model outputs, climate data products, and historical collections could be addressed in follow-on work. Demonstration #2: Data Sharing and Product Development A SURA-led effort will use evolving metadata standards from this project to build from an existing multi-institutional interoperability demonstration that employs Open GIS Consortium (OGC) standards for web services. This interoperability demonstaration involves a group of NOAA- and ONR-funded ocean observing programs around the U.S. Published data to date include in situ sea-surface temperature, satellite SST, and seasurface winds. Metadata that describe the geospatial characteristics (e.g., projections, map extents, etc.) are invisible to the user because they are built into the software that implements the protocols. However, by design, the OGC protocols do not specify metadata requirements for the data that are transferred and visualized. For this ocean science community, this lack of standard metadata conventions causes problems. The non-uniformity can be seen by querying points on the demo map, which shows that each data provider uses a different set of attributes. Lack of standards also complicates implementation, since we must communicate directly with each data provider to confirm their local standards. This prevents the demonstration from being easily scalable to multiple partners. SURA will support expansion of the existing demonstration to include the following capabilities: Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 12 1. Discovery and search tools, based on standardized metadata attributes, will be added to the OGC client site (which functions as the web server for the integrated products). This capability will leverage metadata standardization and will entail working with the partners to publish their FGDC-compliant metadata to one of the standard clearinghouses, such as the National Spatial Data Infrastructure (NSDI); 2. New data types will be added to the demo, along with the ability to distinguish data sources by measurement platform; 3. We will add easily customized tools that leverage the metadata standards for visualization and query on OGC client/web server (e.g., by using the OGC Style Layer Descriptor specification); 4. "How-to" documents and best practices for participation in the OGC demo will include these new capabilities. To encourage participation in the enhanced demonstration, SURA will continue to provide technical support for all interested participants. As in the first implementation, SURA will hire technical contractors to work with the Information Technology staff of participating institutions. In addition, SURA will support the creation of OGC publication software that simplifies the process that a data provider must follow to publish their data to the Internet with the OGC services (including data served by OPeNDAP). Open Source software tools and documentation will be made freely available to all participants. Project Management and Coordination The Marine Metadata Interoperability effort is organized as an open collaboration. With support from both NSF and NOAA leadership, the project will engage a broad range of the oceanographic metadata community to contribute its knowledge and talents. The project will be led by the PIs and a Steering Committee, with support from a technical team, which provides a primary interface to the larger oceanographic community. Technical tasks performed by the team will include: a. Web site development (standards, procedures, etc.), b. Workshops planning and web site enhancements, and c. Meeting/telecon planning and web site enhancements. The technical effort will be coordinated by the Lead PI, John Graybeal (MBARI), with input from Co-PIs Stephanie Watson (CeNCOOS), Francisco Chavez (MBARI), Stephen Miller (UCSD), Robert Arko (LDEO), Philip Bogden (GoMOOS/SURA), and the guidance of the Steering Committee. These principal PIs are uniquely qualified to lead such a project due to their collective experience in oceanography, metadata architectures, data systems development (MBARI's SSDS and GoMOOS' interoperability demonstrations) and community-building workshops (Marine Metadata Initiative, OOS Tech 2003 and 2004). NSF funds will be extended by in-kind contributions from the SURA Coastal Ocean Observing Program (SCOOP), and by participating institutions. (See attached letters of Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 13 commitment.) All the efforts of participating organizations, institutions, and individuals will be coordinated by the Lead PI, and reviewed by the Steering Committee, to ensure a coherent direction and project-level goals. The goals will be defined to address the needs of the oceanographic science community, both for specific technical results, and for an ongoing community process to achieve further critical objectives. Funded PIs will provide technical staff (building a technical team) toward the common framework, as directed by the Marine Metadata Interoperability project PIs and the Steering Committee. All project partners, Steering Committee members, and participating members of the community will apply their expertise to improve the content, credibility, and usefulness of the collaboration. Each project task (or subtask) will be coordinated by a member of the project management team or technical team, or by another designated person. This coordinator will manage development of web site products for that project topic, under the guidance of the PIs and the Steering Committee. The coordinators of each task will collaborate closely throughout the project. Coordinators will report on their progress to the Steering Committee at monthly Steering Committee telecons. The technical team is made up of the technical staff contributing to the project. One of the key roles of the technical team will be to solicit, obtain, and document information about metadata topics from members of the community. Toward this end, the technical team will actively engage participants in the organizations and projects they represent, facilitating technical exchange between those organizations and the Marine Metadata Interoperability Project. A full-time technical coordinator will be hired to make the technical team's activities as effective as possible. Steering Committee The Steering Committee will be a critical component of the proposed project. Reflecting the inclusive nature of the proposal, the Steering Committee will represent a wide range of oceanographic community interests. The roles of Steering Committee members include reviewing status of the projects, recommending next steps, responding to information requests (particularly, providing or finding information about other metadata initiatives), and contacting others in the community as needed. Steering Committee Representation The Steering Committee will include representatives from as many of the following organizations and groups as possible, with geographic diversity an additional goal: ocean research community; multiple federal agencies; international efforts; knowledge representation community; the Marine Metadata Initiative, and the FGDC community. Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 14 The Lead PI will serve as the chair of the Steering Committee. The following have agreed to be on the Steering Committee (1-2 others will be added): Stephanie Watson (CeNCOOS), Philip Bogden (SURA), Julie Bosch (NOAA), Dawn Wright (OSU), Stephen Miller (UCSD), Roy Lowry (British Oceanographic Data Centre), Robert Arko (LDEO), Bob Weller (WHOI), and Mark Musen (Stanford). Schedule This is a one-year effort. While it is intended to lead into larger efforts by the NSF, ORION, and IOOS communities, it will produce several specific deliverables, including functional demonstration software, by the end of its year of performance. The overall project will be designed to allow a natural transition to another organization without affecting its momentum. After initial planning activities in Fall 2004, an initial meeting of the Metadata Interoperability Steering Committee and presentation of the initiative, will take place at the end of the year (targeted for the Fall AGU meeting in San Francisco). Toward the end of the year, a workshop will be held in late summer on the east coast. SURA/SCOOP has volunteered to host this gathering as their (already-funded) OOS Tech 2005 workshop. The 2005 meeting will serve several purposes: a review of accomplishments and direction, a dissemination of information about the Metadata Interoperability effort, and a planning session for future work in this area. The meeting is likely to be quite large, as interest and participation builds over the course of the year. Project Milestones • • • • • • • Oct 2004: First Steering Committee meeting Nov 2004: All technical team members identified Dec 2004: Initial meeting at AGU Jan 2005: Collocated technical team meeting Feb and May 2005: Steering Committee quarterly review Aug 2005: Metadata Interoperability Workshop and Project Review Sep 2005: Final Steering Committee telecon Technical Milestones • • • • • • Nov 2004: Initial Metadata Interoperability Project web pages on-line Jan 2005: Initiate demonstration project development Apr 2005: Release preliminary metadata references and cookbook pages May 2005: Release preliminary domain-specific metadata ontology July 2005: Present preliminary metadata demonstrations Sep 2005: Final Releases: complete final update of web site, present metadata demonstration functionality (multiple sites), release domain-specific metadata ontology, publish and distribute cookbook and/or other documents, submit technical write-up and/or papers to appropriate journals Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 15 References Cited [1] Data Management and Communications Steering Committee. 2004. Data Management and Communications Plan for Research and Operational Integrated Ocean Observing Systems, Arlington, VA: Ocean.US. p. 28 of 292 pages. [online: http://dmac.ocean.us/dacsc/docs/dmac_part_I_05_10_04.pdf ] [2] Beard, K. 1996. A Structure for Organizing Metadata Collection. Presented at the NCGIA conference, Santa Fe, NM [online: http://www.ncgia.ucsb.edu/conf/SANTA_FE_CDROM/sf_papers/beard_kate/metadatapaper.html] [3] Gregory, J. 2003. The CF Metadata Standard, [online: http://www.cgd.ucar.edu/cms/eaton/cf-metadata/clivar_article.pdf] [4] Gruber, T.R. 1993. A translation approach to portable ontologies. Knowledge Acquisition, 5(2):199-220. [5] McGuinness, D. 2002. Ontologies Come of Age. In Dieter Fensel, Jim Hendler, Henry Lieberman, and Wolfgang Wahlster (Eds). Spinning the Semantic Web: Bringing the World Wide Web to Its Full Potential. Cambridge, MA: MIT Press. [6] NASA Strategic Evolution of ESE Data Systems (SEEDS) [online: http://lennier.gsfc.nasa.gov/seeds/SFormHome.html] [7] Bindoff, N. and Legler D. 2003. The WOCE Global Data Resource: Lessons Learned for CLIVAR Data and Data Requirements. Exchanges, 26 [8] Olsen, L.M., G. Major, S. Leicester, K. Shein, J. Scialdone, H. Weir, S. Ritz, C. Solomon, M. Holland, R. Bilodeau, T. Northcutt, R. Vogel. 2004. NASA/Global Change Master Directory (GCMD) Earth Science Keywords. Version 4.2.2 [online: http://gcmd.nasa.gov/Resources/valids/keyword_list.html] [9] British Oceanographic Data Center’s Oceanography Parameters [online: http://ioc.unesco.org/oceanteacher/resourcekit/M3/ocean_primer.htm] [10] U.S. Joint Global Ocean Flux Study [online: http://usjgofs.whoi.edu/datasys/param_master.html] [11] Glossary of Physical Oceanography and Related Disciplines [online: http://stommel.tamu.edu/~baum/paleo/ocean/ocean.html] [12] IOC GF3 parameter codes [online:http://ioc.unesco.org/oceanteacher/resourcekit/M3/Formats/Integrated/GF 3/GF3.htm] [13] U.S. Globec Thesaurus [online: http://globec.whoi.edu/globec-dir/thesaurus.html] [14] NOAA CO-OPs Tide and Current Glossary [online: http://coops.nos.noaa.gov/publications/glossary2.pdf] [15] SEACOOS Data Set Dictionary [online: http://www.seacoos.org/News/News_Item.TechnologyAssessment] Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 1 [16] MBARI Biological Ocean Group Shipboard CTD Data Dictionary [online: http://www.mbari.org/bog/mmug/MMUG_table.htm] [17] SeaWIFS Bio-optical Archive and Storage System (SeaBASS) [online: http://seabass.gsfc.nasa.gov/cgi-bin/stdfields.cgi] [18] Ocean Biogeographic Information System [online: http://www.iobis.org/] [19] Raskin, R., Pan, M., and C. Mattmann. 2004. Enabling Semantic Interoperability for Earth Science Data. Presented at NASA’s Earth Science Technology Conference. Palo Alto, CA, June 22-24. [online: http://www.esto.nasa.gov/conferences/estc2004/papers/a5p1.pdf] [20] C. Stefanakos, T. Gerostathis, G. Athanasoulis, C. Houstis, E. Vavalis, Building Ontologies of Environmental Applications for a Digital Library of Scientific Collections", in proceedings of the 16th International Conference "Informatics for Environmental Protection", September 25-27, 2002, Vienna (Enviro Info 2002), pp. 436-439. [online: http://dlforum.external.forth.gr:8080/] [21] Patrikalakis, N.M., J.J. McCarthy, A. R. Robinson, H. Schmidt. 2003. Poseidon: A Distributed System for Interdisciplinary Ocean Forecasting with Adaptive Modeling and Sampling: Presented at the ARION Workshop on Federated and Scalable Systems for the Environment panel talk. [online: http://czms.mit.edu/poseidon/new1/] [22] Federal Geographic Data Committee’s Content Standard for Digital Geospatial Metadata [online: http://www.fgdc.gov/metadata/contstan.html] [23] International Standards Organization 19115 Standard for Geospatial Metadata [online: http://www.isotc211.org/scope.htm#19115] [24] National Biological Information Infrastructure Biological Profile [online: http://www.nbii.gov/datainfo/metadata/standards/] [25] Profile for Remote Sensing Data [online: http://www.fgdc.gov/standards/status/csdgm_rs_ex.html] [26] Metadata Profile for Shoreline Data [online: http://www.fgdc.gov/standards/status/sub5_6.html] [27] Directory Interchange Format [online: http://ioc.unesco.org/oceanteacher/resourcekit/Module2/Metadata/Formats/dif/dif guide7b.pdf] [28] Dublin Core Metadata Standard [online: http://dublincore.org/] [29] MARC21 [online: http://lcweb.loc.gov/marc/] [30] Crosswalk Between the FGDC and ISO 19115 [online: http://www.fgdc.gov/CrossWalk/ISO-FGDC-METADATA-CROSSWALKV4.xls] [31] Cooperative Ocean Atmosphere Research Data Service (COARDS) [online: http://ferret.wrc.noaa.gov/noaa_coop/coop_cdf_profile.html] [32] EPIC [online: http://www.pmel.noaa.gov/epic/index.html] Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 2 [33] ARGO/Global Ocean Data Experiment [online: http://www.ifremer.fr/coriolis/cdc/argo/argo-dm-user-manual.pdf] [34] OceanSites Data Format Proposal [online: http://gustnado.mbari.org/marinemetadatawiki/MetadataReferences?action=Attac hFile&do=get&target=oceansites-user-manual1.doc] [35] World Ocean Circulation Experiment Global Ocean Data V3 NetCDF Convention [online: http://www.bom.gov.au/bmrc/ocean/GODAE/Projects/ServerWS/WOCE%20Net CDF%20Convention.pps] [36] Ecological Metadata Language [online: http://knb.ecoinformatics.org/software/eml/] [37] SensorML [online: http://vast.uah.edu/SensorML/] [38] MarineXML [online: http://www.marinexml.net/] [39] Lakshmanan, L.V.S. and F. Sadri. 2003. Information Integration and the Semantic Web. Data Engineering Bulletin, 26(4). [online: ftp://ftp.research.microsoft.com/pub/debull/A03dec/issue1.htm] [40] Martin, D. 2003. The National Oceanographic Partnership Program, Ocean.US, and Real Movement Towards an Integrated and Sustained Ocean Observing System. Oceanography, 16(4): 13-19. [41] Helly, J. "New concepts of publication." Nature 393: 107, 1998. [42] Helly, J., T. T. Elvins, D. Sutton and D. Martinez, A Method for Interoperable Digital Libraries and Data Repositories, Future Generation Computer Systems, Elsevier, 16 (1999), pp. 21-28, 1999. [43] Helly, J., T. T. Elvins, D. Sutton, D. Martinez, S. Miller, S. Pickett, A. M. Ellison, "Controlled Publication of Digital Scientific Data." CACM May (5): 97-101, 2002. [44] Helly, J., H. Staudigel, A. Koppers, "Scalable Models of Data Sharing in the Earth Sciences." Geochemistry, Geophysics, Geoscience. DOI number 10.1029/2002GC000314, 2003. [45] Miller, S. P., J. Helly, A. Koppers and P. Brueggeman, SIOExplorer: Digital Library Project, in An Ocean Odyssey, MTS/IEEE Oceans 2001 Conference Proceedings, MTS 0-933957-28-9, Volume 4, pp. 2288-2296, 2001.[46] [46] Graybeal J, Gomes K, McCann M, Schlining B, Schramm R, Wilkin D. MBARI's SSDS: Operational, Extensible Data Management for Ocean Observatories. 2003. The 3rd International Workshop on Scientific Use of Submarine Cables and Related Technologies. Tokyo. [47] Marine Metadata Workshop [online:http://mmug.calfish.org/metadata.htm] [48] Marine Metadata Initiative [online: http://wiki.mbari.org/marinemetadatawiki/] [49] Standard Naming Exercise [online: http://wiki.mbari.org/marinemetadata/moin.cgi/StandardNameExercise] Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 3 [50] Coale, K.H., K.J. Johnson, F.P. Chavez and others (2004) Southern Ocean Iron Enrichment Experiment (SOFeX): Carbon and Silicon Cycling in Antarctic Waters, Science, 304, 408-414. [51] Ridge 2000 Open Data Exchange System (RODES) [online: http://data.ridge2000.org/] [51] Marine Geoscience Data Management System [online; http://www.marinegeo.org/] [52] Data Link web-based search tool [online: http://www.marine-geo.org/link/] [53] GeoMapApp cross-platform Java application [online; http://www.geomapapp.org/] [54] MBARI Shore Side Data System [online: http://www.mbari.org/rd/projects/2004/moos/600031_ssds_04.html] Collaborative Research: Marine Metadata Interoperability Proposal #6389984 Page 4 BIOGRAPHICAL SKETCH: JOHN GRAYBEAL Professional Preparation: 1982 B.A., Computer Science and Statistics, University of California, Berkeley Other Professional Coursework: 2002 Object-Oriented Design. (TogetherSoft, San Jose, CA). 1999 The ACE ORB—TAO [a CORBA product] (Object Computing, Inc., St. Louis, MO) 1998 The ADAPTIVE Collaboration Environment [ACE] (Object Computing, Inc., St. Louis, MO) 1995 Cryptographic Systems (Spyrus, Inc., Santa Clara, CA) 1991 Total Quality Management (Sterling Software, Inc., Mtn. View, CA) 1990 UNIX System Administration (LURNIX, NASA Ames Research Center, CA) 1985 Designing High Quality User Interfaces (UC Extension, Santa Clara, CA) Positions: 2001-Present Monterey Bay Aquarium Research Institute: Lead, Information Applications Group As IAG Lead, coordinate work of 5-6 team members in development of multiple data management tools and systems. Serve as Project Lead for Shore Side Data System, Video Annotation and Reference System, and several specific science applications. Coordinate technical work supporting operational data systems, applications development, software resource management, and MBARI's core data infrastructure. Develop Charter and policies for Information Applications Group. Share responsibility for development of MBARI policies on data management, public data distribution, and software distribution. Serve on User Committees advising management on MBARI core data processes and standards, ROV issues, mooring issues, and wireless communications. Led development of Shore Side Data System since arrival in 2001, and began implementation in September 2002. Completed an initial release of the software, responsible for acquiring, storing, and serving MOOS Test Mooring data, in November 2002, and operated it without failure since. Have since added multiple platforms and data sources. Implemented features include: acquisition of multiple described data streams; automatic processing of data streams in multiple formats (sequential, binned packets, and netCDF) using XML metadata descriptions; automatic metadata processing into structured repository; data search services based on deployment, instrument, or item name; ability to access any data set or subset, as well as its metadata; real-time addition of, and access to, new instruments or data streams. 1996-2001 1994-1996 1989-1994 1982-1989 Sterling Software at NASA Ames: Manager, SOFIA Observatory Data Systems Sterling Software at NASA Ames: Group Lead, Authentication and Privacy Group Sterling Software at NASA Ames: Project Mgr, Airborne Observatory Data Systems Sterling Software at NASA Ames: Computer Ops/Lead, Kuiper Airborne Observatory NASA Ames positions involved data system development, operations, and management. SOFIA project involved developing and specifying standards-based, real-time data systems, and designing systems to manage scientific data throughout its life cycle. On all projects, coordinated technical development at multiple institutions, designed and tested software interfaces with national and international teams, and achieved multiple successful software releases. Successfully used highly configurable data system to control and test Telescope Assembly, using XML-based configuration files to change systems' functional behaviors and user interfaces without reprogramming. All positions at NASA Ames after 1988 incorporated Technical Lead and/or Project Manager responsibilities, managing groups from 5 to 20 individuals and budgets up to $4,000,000 annually. Management responsibilities included technical oversight, project reporting, budget preparation, fiscal reporting, purchasing, proposal writing, and all personnel management responsibilities. Positions through 1994 involved operational support, including real-time computer support on airborne observatories. Specific tasks included science and mission data and metadata management, computer systems analysis, hardware/software troubleshooting, configuration, programming, maintenance, design, deployment planning, and on-board operation. Publications: 1. Graybeal J, Gomes K, McCann M, Schlining B, Schramm R, Wilkin D. MBARI's SSDS: Operational, Extensible Data Management for Ocean Observatories. 2003. The 3rd International Workshop on Scientific Use of Submarine Cables and Related Technologies. Tokyo. 2. Graybeal J, Brock D, Culp S, Lin L, Schlappe S & Webb G. (2000) SOFIA Misson Control System Software. [Software] 3. Graybeal J, Brock D.& Papke B (2000) The Use of Open Source Software for SOFIA's Airborne Data System. Proceedings of SPIE, 4009-17. 4. Graybeal J, Krzaczek B & Milburn, J (2000) SOFIA's CORBA Experiences: Instances of Software Development. Astronomical Data Analysis Software and Systems X 5. Papke B, Graybeal J & Brock D (2000) An extensible and flexible architecture for the SOFIA Mission Controls and Communications System, Proceedings of SPIE, 4014-35. Software Releases: 1. Graybeal J, Chase A, Gomes K, McCann M, Schlining B, Schramm R and (2004) MOOS Shore Side Data System, Revision 2. [Software] 2. Gomes K., Graybeal J. (2002-2004) MOOS Test Mooring Data Access. [Software] 3. Gomes K., Graybeal J, et al (2004). CIMT Mooring Data Access [Software] [http://ssdspub.mbari.org] 4. Schlining B, Graybeal J & Jacobsen-Stout N. (2004) Video Annotation and Reference System, Iteration 2. [Software] Synergistic Activities: 2001-2004 MBARI's Shore Side Data System. Oral Presentations. MBARI. As part of SSDS project leadership, present the system concept, design, and operational concept to potential users from MBARI and collaborating institutions. 2004 Invited participant in IOOS DMAC Metadata Implementation Plan workshop (S Watson, chair). 2003 Co-sponsored Marine Metadata Workshop. 2002 MBARI's Shore Side Data System. Oral Presentation. American Geophysical Union. (Session OS10, Data Management.) 2002 Application of Data Management Solutions to TOPP Data Management. Invited Panelist, Working Session on Tagging of Pacific Pelagics Data Management. 2002 Data from Observatories and NEPTUNE: A Soup-to-Nuts Primer on Educational Possibilities. Invited Speaker, MBARI Educational Workshop on Data in Education. Current and Pending Federal Funding: (annual total award size) Pending: Collaborative Research: Marine Metadata Interoperability ($435,000). CURRICULUM VITAE: Francisco P. Chavez Monterey Bay Aquarium Research Institute (MBARI) 7700 Sandholdt Road, Moss Landing, CA 95039-9644 Voice: 831-775-1709, FAX: 831-775-1620, Email: [email protected] Education: PhD Duke University BS Humboldt State University Botany Oceanography 1987 1977 Professional Background: 2000-present 2000-present 1996-2000 1992-1996 1990-present 1987-1992 Senior Scientist (IV), MBARI Faculty (courtesy), Stanford University Associate Scientist (III), MBARI Associate Scientist (II), MBARI Research Associate University of California, Santa Cruz Assistant Scientist, MBARI Selected Professional Activities Associate Editor, Geophysical Research Letters Reviewer for Science, Nature, JGR, JMR, L&O, GRL, DSR, JPR, NASA, NOAA, NSF Grant Recipient from NOAA, NSF, ONR, NASA, NOPP Invited Participant Dahlem Konferenzen, NATO ARI Panel member for NSF, NOPP, EPA Member JGOFS time series oversight committee Reviewer, Chilean Oceanographic Program, Peruvian Fisheries Program NSF Alan Waterman award committee NSF Advisory Committee for the Geoscience Directorate Current Research Interests: Biology and chemistry of the ocean in relation to natural climate variability and global change. Instrumentation and systems for long-term ocean observing. Satellite remote sensing. Graduate Advisor: Richard T. Barber Committee Member of Following Graduate Students: Rafael A. Olivieri (UCSC), M. Celia Villac (Texas A&M), Jonathan Phinney (UCSC), Elena Mauri (MLML). Post-Doctoral Advisor for: Chris Scholin, Raphael Kudela, Peter Strutton, Russell Hopcroft, John Ryan, Brad Penta, Victor Kuwahara SELECTED PUBLICATIONS (from over 100) Barber, R.T. and F.P. Chavez. 1983. Biological consequences of El Niño. Science, 222, 12031210. Chavez, F.P. and R.T. Barber (1987). An estimate of new production in the equatorial Pacific. Deep-Sea Research, 34, 1229-1243. Chavez, F.P. (1989). Size distribution of phytoplankton in the central and eastern tropical Pacific. Global Biogeochemical Cycles, 3, 27-35. Chavez, F.P. and J.R. Toggweiler (1995). Physical estimates of global new production: the upwelling contribution, In Upwelling in the Ocean: Modern Processes and Ancient Records. Summerhayes, C.P., Emeis, K.C., Angel, M.V., Smith, R.L., and Zeitzschel, B., (eds.), p. 313-320, J. Wiley & Sons, Chichester. Chavez, F.P. (1996) Forcing and biological impact of onset of the 1992 El Niño in central California. Geophysical Research Letters 23, 265-268. Chavez, F.P., J.T. Pennington, R. Herlien, H. Jannasch, G. Thurmond and G.E. Friederich (1997) Moorings and drifters for real-time interdisciplinary oceanography. Journal of Atmospheric and Oceanic Technology 14, 1199-1211. Chavez, F.P. and C. Collins,eds. (1998) Studies of the California Current System, Deep-Sea Research II, Volume 45. Johnson, K.S., F.P. Chavez and G.E. Friederich (1999) Continental shelf sediment as a primary source of iron for coastal phytoplankton. Nature 398, 697-700. Chavez, F.P., P.G. Strutton, G.E. Friederich, R.A. Feely, G.A. Feldman, D. Foley, and M.J. McPhaden. (1999) Biological and chemical response of the equatorial Pacific Ocean to the 1997-1998 El Niño. Science 286, 2126-2131. Olivieri, R.O. and F.P. Chavez (2000) A model of plankton dynamics for the coastal upwelling system of Monterey Bay, California. Deep-Sea Research II, 47, 1077-1105. Chavez, F.P., D. Wright, R. Herlien, M. Kelley, F. Shane and P.G. Strutton (2000) A device for protecting moored spectroradiometers from bio-fouling. Journal of Oceanographic and Atmospheric Technology, 17, 215-219. Chavez, F.P. and C. Collins,eds. (2000) Studies of the California Current System Part 2, DeepSea Research II, 47, 5-6. Castro, C.G., F.P. Chavez and C. Collins (2001) The role of the California Undercurrent in the export of denitrified waters from the eastern tropical North Pacific. Global Biogeochemical Cycles, 21, 819-830. Chavez, F.P., J.T. Pennington, C.G. Castro, J.P. Ryan, R.M. Michisaki, B. Schlining, P. Walz, K.R. Buck, A. McFayden and C.A. Collins (2002) Biological and chemical consequences of the 1997-98 El Niño in central California waters. Progress in Oceanography, 54, 205232. Chavez, F.P, C.A. Collins, A. Huyer, and D. Mackas eds. (2002) El Niño along the west coast of North America. Progress in Oceanography, 54, 1-6. Chavez, F.P., J.P. Ryan, S. Lluch-Cota and M. Ñiquen C. (2003) From anchovies to sardines and back-Multidecadal change in the Pacific Ocean. Science, 299, 217-221. Curriculum Vitae Stephanie M. Watson Coordinator, Central California Coastal Ocean Observing System (CeNCOOS) Monterey Bay Aquarium Research Institute 7700 Sandholdt Road Moss Landing, CA 95039-9644 [email protected] Professional Preparation University of Maine - Orono Spatial Information Science and Engineering M.S. 2002 University of Maine – Orono Ecology and Environmental Sciences (marine) M.S. 1996 University of Maine – Orono Psychology B.S. 1991 Appointments Coordinator, Central and Northern California Ocean Observing System (CeNCOOS), Monterey Bay Aquarium Research Institute, 2004 - present Marine Mapping Users Group (MMUG) Coordinator/GIS Specialist, Pacific States Marine Fisheries Commission/California Department of Fish and Game, 2003 (one year contract) Research Associate/Project Manager, Department of Spatial Information Science and Engineering, 2001-2003 Coastal Planner, Maine Coastal Management Program, 1998-2000 NOAA Coastal Management Fellow, Florida Coastal Management Program, 1996-1998 (two year contract) Graduate Research Assistant, Department of Resource Economics and Policy, 1995-1996 Publications 1. Watson, S. 2004. Researchers Converge to Improve Metadata Standards, EOS, 85(9), p. 91. 2. Ames, E., S. Watson, and J. Wilson. 2000. Rethinking over fishing: Insights from oral histories with retired fishermen. In B. Neis and L. Felt (Eds.) Finding Our Sea Legs: Linking Fishery People and Their Knowledge with Science and Management. St. John’s, Newfoundland: ISER Books, pp. 153-164. 3. Watson, S. 1999. Where It Counts: Community-based Initiatives in Coastal Management. In Proceedings of the 1999 Coastal Zone Conference, San Diego, CA. 4. Watson, S. 1998. Local Coastal Hazard Mitigation Strategies in Florida. In Proceedings of the th 16 International Conference of The Coastal Society: Minding the Coast – It’s Everybody’s Business, Williamsburg, VA, pp. 304-309. 5. McCleave, J.D., P.J. Brickley, K.M. O’Brien, D. A. Kistner, M.W. Wong, M. Gallagher, and S.M. Watson. 1998. Do leptocephali of the European eel swim to reach continental waters? Status of the question. Journal of the Marine Biological Association of the U.K., 78, 285-306. Synergistic Activities • Development of public web pages providing a centralized resource for metadata development tools, standards, and repositories as well as ocean data standards and Geographic Information System tools. Stephanie Watson 1 • • • Holding Coordinator positions with both the Marine Mapping Users Group (2003-2004) and the Central and Northern California Ocean Observing System (present), both of which are aimed at integrating activities and transferring knowledge in the broad ocean science and coastal management communities Member of the Central Coast Joint Data Committee to promote data sharing along the central California coast Member of the National Federation of Regional Associations to integrate activities of the Regional Associations in IOOS Collaborators and Other Affiliations (a) Collaborators N. Garfield (SFSU), J. Paduan (NPS), M. Moline (Cal Poly SLO), G. Crawford (Humboldt), Y. Chao (JPL), E. Thornton (NPS), J. Doyle (NRL), W. O’Reilly (Scripps), L. Botsford (UC Davis), S. Williams (UC Davis), S. Morgan (UC Davis), J. McWilliams (UC Davis), P. Barnard (USGS), R. Cheng (USGS), D. Wright (OSU) (b) Graduate Advisors Dr. Kate Beard, Department of Spatial Information Science and Engineering, University of Maine Dr. James Wilson, School of Marine Sciences, University of Maine Grants and Awards 1. $10,200,000 “A Coastal Ocean Circulation Monitoring Program for Northern and Central California”, California Coastal Conservancy, 2004 – 2008, co-PI with Garfield, N., J. Paduan, M. Moline, G. Crawford, Y. Chao, E. Thornton, J. Doyle, W. O’Reilly, L. Botsford, S. Williams, S. Morgan, J. McWilliams, P. Barnard, R. Cheng. 2. $100,000 “Development and Implementation of a Marine GIS Curriculum”, NOAA Coastal Services Center, 2003-2005, co-PI with Wright, D. 3. $ 18,000 “Spatial Data Management for the Gulf of Maine Ocean Observing System”, University of Maine, 2001-2002, PI. 4. $101,062 “Clean Marinas and Boatyards: A Casco Bay Demonstration Project”, Environmental Protection Agency, 2000-2003, co-PI with Swanton, S. 5. $52,314 “Weskeag River Estuary Best Management Practices Demonstration”, Environmental Protection Agency, 2000-2002, co-PI with Watson, Susan. 6. $ 4,000 “Nonpoint Source Pollution in the Damariscotta River Watershed – An Education Initiative”, Davis Conservation Foundation, 1999, PI. 7. $ 50,000 “Establishing a Pilot Beach Water Quality Monitoring Program in Florida”, NOAA, 1998-1999, co-PI with Grimm, E. 8. $ 62,000 “Local Coastal Hazard Mitigation in Florida”, Florida Coastal Management Program, NOAA Coastal Management Fellowship, 1996-1998, PI. Stephanie Watson 2 CURRICULUM VITAE Robert Allen Arko Present address: Lamont-Doherty Earth Observatory of Columbia University 61 Route 9W Palisades, NY 10964 USA 845-365-8752 [email protected] Education: Bachelor of Science, Computer Science, Ohio State University, Columbus, OH, Sept. 1992 Master of Science, Computer Science, Columbia University, New York, NY, May 1998 Positions held: Lead Programmer/Analyst, Lamont-Doherty Earth Observatory, Aug. 1998 – present Intermediate Programmer/Analyst, Lamont-Doherty Earth Observatory, Aug. 1995 – Aug. 1998 Programmer/Analyst, Lamont-Doherty Earth Observatory, Sept. 1993 – Aug. 1995 Research Staff Assistant, Lamont-Doherty Earth Observatory, Sept. 1992 – Sept. 1993 Lab Supervisor, Ohio State University, Dec. 1989 – Sept. 1992 Areas of interest: Database design and administration; Metadata interoperability; Real-time data acquisition; Marine, airborne, and coastal geophysics Professional memberships: IEEE Computer Society American Geophysical Union Collaborators: David Becker (TAMU) Robin Bell (LDEO), Suzanne Carbotte (LDEO), Dale Chayes (LDEO), John Diebold (LDEO), Daniel Fornari (WHOI), John Graybeal (MBARI), John Helly (SDSC), Kim Kastens (LDEO), Kerstin Lehnert (LDEO), Stephen Miller (UCSD), William Ryan (LDEO), Timothy Shank (WHOI), Thomas Shipley (UTIG), Stephanie Watson (MBARI), References: Arko, R.A., Chayes, D.N., Carbotte, S.M., Ryan, W.B., Lehnert, K.A., and Shank, T.M. Development of a Rapid, Standardized Data Inventory for R2K Field Programs. Eos Trans. AGU, 84(46), Fall Meet. Suppl., Abstract B12A-0725, 2003. Arko, R.A. and Chayes, D.N. A Web-Based Geospatial Metadata Browser. Eos Trans. AGU, 83(47), Fall Meet. Suppl., Abstract OS62B-0252, 2002. Chayes, D.N. and Arko, R.A. Real-time Metadata Capture Implementations. Eos Trans. AGU, 83(47), Fall Meet. Suppl., Abstract OS62B-0252, 2002. 1
© Copyright 2025