Catalogue of Data Repositories

The first DReSNet face to face meeting agreed to set up a catalogue of active Data Repositories to help us to create an ontology of E-Science Digital Repositories, and to circumscribe the issues and activities that we want to focus on.

The repositories represented in this catalogue could be international BioScience Gene databases, or lab- or project-specific repositories or even Institutional Repositories with a significant data holding. The important criteria are

  • a significant data is stored in the environment
  • it is accessible through some well specified means
  • data is collected from (or deposited by) users.

This would rule out POWS (plain old web sites), institutional repositories containing a handful of Excel files, third-party aggregations or republished data portals.

Nominations for addition to this list can be emailled to Les Carr.

eCrystals Southampton
Archive for Crystal Structures generated by the Southampton Chemical Crystallography Group and the EPSRC UK National Crystallography Service.

WorldWideMolecularMatrix Collection at Cambridge Institutional Repository
This collection will contain the calculated properties of over 200,000 Open molecules provided by the US National Cancer Institute (NCI), using semi-empirical quantum-mechanical methods. Properties include heat of formation, 3-dimensional structure, dipole moment and ionization potential. Each molecule, in Chemical Markup Language (CML) is in a separate entry indexed by NSC number (NCI).