[Moims-dai] Desirable preservation ecosystem characteristics

Mike Kearney kearneysolutions at gmail.com
Wed Jul 11 14:01:54 UTC 2018


Excellent, David, thanks!  I'm not sure that's exactly what's meant by
biological ecosystem, but regardless it's certainly coming at it from a
different, and productive, direction.  

 

One of your bullets caught my attention.

o   effort can be shared e.g. one repository may look after a variety of
Representation Information to be used by other repositories

 

When I was at the RDA I spoke with Larry Lannom about his Data Type
Registries project.  Take his example and your example, and perhaps there is
a more general statement about registries (public or commercial) for Data
Types, Representation Information, ontologies and other key items/terms
being available to help (1) initialization of a preservation archive (faster
startup); and (2) better consistency across the preservation archives that
use those registries.  Also, that better consistency will enhance
interoperability for users accessing multiple archives because of common
usage of those items/terms.   

 

But perhaps the more general statement is that such registries appear in the
ecosystem and preservation archives evolve to take advantage of them.  

 

   -=- Mike

 

Mike Kearney

Huntsville, Alabama, USA

 

From: MOIMS-DAI [mailto:moims-dai-bounces at mailman.ccsds.org] On Behalf Of
David Giaretta
Sent: Wednesday, July 11, 2018 7:49 AM
To: 'MOIMS-Data Archive Interoperability' <moims-dai at mailman.ccsds.org>
Subject: Re: [Moims-dai] Desirable preservation ecosystem characteristics

 

Hi Mike

 

You have a good list there so I'd like to come at it from another direction.

 

The OED has two definitions for ecosystem (1) "A biological community of
interacting organisms and their physical environment" and (2) "(in general
use) a complex network or interconnected system." 

 

While the second definition is general and can be aligned with technology
rather more easily, I like to bring in the first definition because 

(1)    It brings in people

(2)    It leads one to think of things changing and evolving

(3)    It leads on to think about specialisations, death, food chains and
dependencies

 

That leads one to say that in 50-100 years we should have:

-          people aware in the requirements of preservation 

o   producers of information creating/recording

*  data management plans linked to preservation i.e. capturing all the types
of "metadata" required along the way. This must include the semantic
information about the data/documents

*  support systems should help as much as possible

o   preservers of information being aware of:

*  who they are preserving for i.e. the Designated Community and the
"metadata" likely to be required (OAIS Representation Information including
formats, semantics and software etc)

*  potential uses of data including numerical data

*  the costs of preservation

o   funders (public and commercial) who are aware of:

*  the potential value of information

-          systems/processes which can cope with change and evolution:

o   implementation independent designs

o   standard, long lived / slowly evolving / backward compatible interfaces
to data and systems e.g. programming interfaces for accessing/searching
archives and for extracting components from Archival Information Packages

o   appraisal processes which help to decide if it is worth continuing to
preserve specific information

o   ways to monitor the evolving needs of Designated Communities so that
repositories can ensure that the information they are preserving can be
understood/used etc

-          specialisations, death, food chains and dependencies 

o   each repository is probably good at certain things and not at others, so
it should be clear which is which. As things such as people, technology and
funding change, a repository which is good at one time may not be good later
on.

o   effort can be shared e.g. one repository may look after a variety of
Representation Information to be used by other repositories

o   any particular repository may cease to operate so it will need to be
ready to hand over its holdings to another repository

o   information is linked so that as new information is created it may
affect the value of other previously existing information

 

Regards

 

..David

 

From: MOIMS-DAI <moims-dai-bounces at mailman.ccsds.org
<mailto:moims-dai-bounces at mailman.ccsds.org> > On Behalf Of Mike Kearney
Sent: 10 July 2018 21:36
To: 'MOIMS-Data Archive Interoperability' <moims-dai at mailman.ccsds.org
<mailto:moims-dai at mailman.ccsds.org> >
Subject: [Moims-dai] Desirable preservation ecosystem characteristics

 

Vint Cerf is a preservation evangelist, and in discussions and presentations
he makes, he's considering adding this topic:  Desirable preservation
ecosystem characteristics.  The idea is that if we know what we want the
preservation environment to be 50 or 100 years from now, we will have be
better equipped to chart a course (at least the first steps) to get there.
Vint asked me my opinion on that, and I'm asking the DAI WG community.  

 

I'm doing this off the top of my head right now, so here goes my first cut
at it:   

 

Trustworthiness of archives:  

*         A majority of archives that are certified trustworthy.   

*         Broad community agreement on what constitutes trustworthiness and
the process for certifying it.   

*         Global prioritization of the need for certification of key
archives.  

 

Interoperability of archives:  

*         Well understood and widely available user interface systems that
function broadly for multi-archive access.  

*         Archive-to-Archive communications that are easily and quickly
configured.

*         Distributed archives that allow transparent access to all
preserved data within their distributed "realm."

 

Access to archives (besides above):  

*         (From the RDA) Data sharing without boundaries.  Social and
technical infrastructure that enables open sharing of data.  

*         Well-understood and easily-waiverable system of Intellectual
Property management - for example, a system to quickly secure rights to
download YouTube videos by national archivists.  

*         Effective archive security that does not interfere with authorized
user access.  

 

Future tech:  

*         Storage capacity:  A better, faster, cheaper More's Law.  

*         AI Access to preservation archives; AI helpers that can assist
archive users whether they are part of the Designated Community or not.  

*         Auto-preservation.  Systems that automatically compile the
*correct* metadata (Representation Info) as object data is generated.  

*         AI systems that can generate warnings like "Uh oh. you're not
compiling the correct metadata!"   

 

What's on your list?  

 

   -=- Mike

 

Mike Kearney

Huntsville, Alabama, USA

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.ccsds.org/pipermail/moims-dai/attachments/20180711/b5565bc5/attachment.html>


More information about the MOIMS-DAI mailing list