15
Characterisation Adrian Brown The National Archives, UK

Characterisation Adrian Brown The National Archives, UK

Embed Size (px)

Citation preview

Page 1: Characterisation Adrian Brown The National Archives, UK

Characterisation

Adrian Brown

The National Archives, UK

Page 2: Characterisation Adrian Brown The National Archives, UK

Overview

• Develop tools and services to characterise the significant properties of digital objects, to support:– Development of preservation plans– Validation of preservation actions (evaluating

change)

• The subproject considers:– Representation properties– Inherent properties

Page 3: Characterisation Adrian Brown The National Archives, UK

Aims & Objectives

• To deliver:– Methodologies for describing significant

properties– Tools and services for automating

measurement and comparison of these properties

– Recommendations for improving the preservation characteristics of digital object types

Page 4: Characterisation Adrian Brown The National Archives, UK

Aims & Objectives

Page 5: Characterisation Adrian Brown The National Archives, UK

Achievements (Year 1)

• Characterisation registry

• Property description and extraction methodology and tools

• Characterisation tool framework

Page 6: Characterisation Adrian Brown The National Archives, UK

Characterisation registry

• First iteration registry (bringing PRONOM to its next generation)

• Persistent Unique Identifier scheme for registry information

• Support for registry-driven characterisation tool framework

Page 7: Characterisation Adrian Brown The National Archives, UK
Page 8: Characterisation Adrian Brown The National Archives, UK

Describing and extracting characteristics

• Extensible Characterisation Description Language (XCDL)

• Extensible Characterisation Extraction Language (XCEL)

Page 9: Characterisation Adrian Brown The National Archives, UK

Migrator

tiff

png

Extractor

tiff XCEL png XCEL

... XCEL... XCEL

Comparer

png XCDL

tiff XCDL

93%

XCDL & XCEL

Page 10: Characterisation Adrian Brown The National Archives, UK

XCDL/XCEL tools

• Command line interface for extractor

• Preliminary specification for comparator

• GUI for extractor experiments

Page 11: Characterisation Adrian Brown The National Archives, UK

GUI example

Page 12: Characterisation Adrian Brown The National Archives, UK

Characterisation tool framework

• Registry-driven framework for automated deployment of tools

• Initial tools implemented:– DROID– JHOVE– Java POI (MS Office documents)– JAXP (XML validation)

Page 13: Characterisation Adrian Brown The National Archives, UK
Page 14: Characterisation Adrian Brown The National Archives, UK

Planned activities (Year 2)

• Final XC*L specifications

• Characterisation registry (iteration 2)

• Representation Information Registries White Paper

• XCDL extraction tool

• Characterisation tool wrapper specification

• Emerging technologies report

Page 15: Characterisation Adrian Brown The National Archives, UK

Thank you!