Developing a Knowledge Site in Distributed Information Environments: Using a Phased Approach of Harvesting, Standardizing and Repurposing

Publication Type  Conference Paper
Year of Publication  2003
Authors  Borda, Ann; Beler, Alpay
Conference Name  International Cultural Heritage Informatics Meeting: Proceedings from ichim03
Publisher  Archives & Museum Informatics
Conference Location  École du Louvre, Paris, France
Editor  Perrot, Xavier
Keywords  ichim; ichim03; Harvesting; Dublin core; information seeking; resource discovery; web personalization; content management
Abstract  

The following paper describes an approach to creating a publicly accessible ‘knowledge’ site (‘Science & Culture’) by accessing, standardizing, and repurposing digital resources from across distributed environments. The challenges which this approach encompassed was undertaken in two key phases: implementing a harvesting model in order to overcome the challenge of retrieving and managing information from distributed source systems and integrating and delivering this content to the web and to other channels. Phase 1 involved the development of a batch content hub (‘interim database’) which was built using Dublin Core (DC) fields as the primary data structure and which served as the central container for export files originating from five separate source systems. The interim database also functioned to normalize data, generate automatic fields and to process data through specific tools. In Phase 2 of the process, the data records held in the interim database were extracted as XML wrapped DC fields to the web content management system (CMS). The extracted metadata was subsequently integrated and managed for display, building searches and relational linking with other data objects, as well as for supporting personalization functionalities and user tools.

URL  http://www.archimuse.com/publishing/ichim03/085C.pdf

Comments

Post new comment

The content of this field is kept private and will not be shown publicly.
CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
Syndicate content