Scientific Annotation Middleware (SAM) Project Status

Quarterly Reports

Status of Deliverables

Phase 1: (~15 months) will include architecture definition, requirements gathering from SciDAC and related projects, technology selection, and basic operational organization. We will begin to build basic SAM components and services according to the requirements and anticipate release of beta capabilities by the end of this phase. Key foci will be development of basic metadata services and a transitional DAV-based notebook capability, reusing DOE2000 technologies that can be delivered to collaboratory pilots and other SciDAC projects.

July 2001

  • Estimated start date of project - funding in place and staff working at proposed levels-of-effort by ~mid-August
  • Cross-site collaboration capabilities in place - NetMeeting and VNC chosen as initial tools

September 2001 (3 months)

  • Preliminary design specification for metadata management services (MMS) - Internal requirements/design documentation 12/01, public 3/02
  • High-level project timeline revised based on discussions with SciDAC pilot and middleware projects - informal, continuing discussions, shifting initial focus towards hardening MMS capabilities, 6/02
  • Project website established - Site at http://collaboratory.pnl.gov/docs/collab/sam/, aliased to http://www.scidac.org/SAM/

December 2001 (6 months)

  • Project software development environment in place - project design notebook running 9/01, ssh-protected CVS repository in place and being populated 12/01
  • Preliminary design specification for semantic services (SS) - Internal requirements/design documentation 12/01, requirements made public 3/02

March 2002 (9 months)

  • Alpha MMS capabilities (basic federation and metadata translation and generation capabilities) Notification and enhanced DAV client, 3/02, Metadata generation 6/02, Translation 9/02
  • Preliminary design specification for notebook services (NS) - Internal requirements documentation 2/02, requirements made public 3/02
  • Preliminary design specification for interface components (IC) - Internal requirements documentation 12/01, requirements made public 3/02
  • Transitional DAV-enabled notebook released - Demonstrated at SC2002, 11/02, released as part of SAM 1.0/1.1 7/03

July 2002 (1 year)

  • System design white paper - "Reintegrating the Research Record" invited paper for Computing in Science and Engineering (CISE) special issue on Scientific Databases, draft 12/02, published 5/03, semantic service directions described in "Multi-scale Science: Supporting Emerging Practice with Semantically-Derived Provenance", Myers,J., Pancerella,C. Lansing, C., Schuchardt, K., Didier, B., submitted 7/03, Semantic Web Technologies for Searching and Retrieving Scientific Data Workshop
  • Alpha SS capabilities (relationship description and query capabilities) - postponed in favor of continued MMS refinement

September 2002 (1 year, 3 months)

  • MMS 1.0 release – including documented API’s and support - software available upon request, 12/02, released as part of SAM 1.0 6/03
  • Preliminary XML/RDF schema for data pedigree information (developed in collaboration with collaboratory pilots) - Preliminary XML/DAV property definitions for pedigree information implemented within CMCS project, 11/02, schema design described in Pancerella, C.M., Myers, J.D., et. al., "Metadata in the Collaboratory for Multi-Scale Chemical Science", accepted, Proceedings of the 2003 Dublin Core Conference: Supporting Communities of Discourse and Practice-Metadata Research and Applications (DC 2003), September 28-October 2, 2003, Seattle, Washington.

Phase 2: (~18 months) will focus on incremental improvement and expansion of SAM capabilities. As we deploy versions of SAM components we will begin to institute monitoring and feedback activities to understand how the system is being used and will use this formation to guide further developments. During this phase we expect to maintain the basic system and add to its capabilities. Key foci will be the implementation of semantic and notebook services.

December 2002 (1 year, 6 months)

  • SS 1.0 release - XML/RDF Export of pedigree/provenance information 8/03, "Data Provenance in the Collaboratory for Multi-scale Chemical Science (CMCS)”, Pancerella, C., Myers, J., Rahn, L., and “Design Constraints for Scientific Annotation Systems”, Myers, J. position papers developed for Workshop on Data Derivation and Provenance, Chicago, Oct. 17-18, 2002
  • Alpha NS capabilities (basic pagination and display capabilities) - available in transitional notebook 11/02
  • Alpha IC capabilities (basic add/query/update/retrieve capabilities, semantic browsing) - SAM-compatible Portal-based compenents for browsing data/metadata and pedigree developed by CMCS project
  • Review of SAM scope and schedule with SciDAC pilot and middleware projects - Ongoing interactions with CMCS and Pervasive Collaborative Computing Environment (PCCE) projects, 12/02
  • Integration of MMS capabilities within SciDAC collaboratory pilot(s) - SAM MMS integrated into CMCS Pilot, planned for use in SciDAC Chemistry collaborations, 11/02

July 2003 (2 years)

  • NS 1.0 release - Initial suite of notebook operation and management services released as part of SAM 1.0/1.1. Prototype notarization service developed. 8/03
  • IC 1.0 release - Annotation Applet developed 8/03
  • Integration of SS capabilities for data pedigree management with SciDAC collaboratory pilot(s)- - Capabilities for XML/RDF and Graph Exchange Language (GXL) export of pedigree/provenance information implemented and used in the Collaboratory for Multiscale Chemical Science

December 2003 (2 years, 6 months)

  • System implementation paper - "Reintegrating the Research Record" invited paper for Computing in Science and Engineering (CISE) special issue on Scientific Databases, draft 12/02, published 5/03
  • MMS 1.5 Release (binary file metadata extraction capabilities) - available in 1.0 release, 6/02, support for web service translators, additional management services, 8/03
  • Alpha SAM-based notebook interface - Annotation Portlet/Applet available 8/03
  • Integration of IC capabilities within SciDAC collaboratory pilot(s) - CMCS-developed components working as integral part of CMCS portal 3/03

Phase 3: (~18 – 24 months) will begin the transition of SAM maintenance to other sources allowing for advanced research into emerging issues related to SAM’s goals of improving the utility and completeness of scientific records. Key foci will be the expanding the functionality of integration components and the SAM notebook interface. As users come to rely on SAM, we will seek to assure long-term support by fully transitioning SAM evolution to the open source community, and/or investigate other methods of support. As this occurs, the project will focus on identifying next-generation annotation challenges for DOE and investigating prototype solutions.

July 2004 (3 years)

  • SS 1.5 release (semantic translation capabilities)
  • SAM 1.0 notebook interface released
  • Formal survey of developers and end-users
  • Review of SAM scope and schedule with SciDAC pilot and middleware projects
  • Formal plan for long-term maintenance and evolution of SAM

July 2005 (4 years)

  • SAM maintenance transition to alternate funding begins
  • System evaluation paper
  • NS 1.5 release (notebook tracking, configurable notebook policies)
  • IC 1.5 release (dynamic notebook interface interaction capabilities)

July 2006 (5 years)

  • Project completion
  • SAM notebook interface 1.5 release
  • White paper analyzing state-of-the-art metadata management and annotation capabilities
  • Report on preliminary investigations of new directions

Last updated: 8/4/03