Speaker
Description
Dataverse is an open source data repository solution being
increasingly adopted by research organizations and user
communities for data sharing and preservation. Datasets
stored in Dataverse are cataloged, described with metadata,
and can be easily shared and downloaded. In the context of
the development of a pilot catchall data repository for the
Portuguese research community we have studied performance,
availability and recovery aspects for such installation.
In this presentation we will focus on the performance
measurements we obtained with different kinds of storage
systems and the backup and recovery architecture which we
developed. We aim at shedding some light on storage and
backup solutions for Dataverse that can be also applied
to other systems.