HOW TO SQUEEZE ART OBJECT SUBJECT METADATA OUT OF SCHOLARLY TEXTS
The Basics
- Start with a collection that
- has descriptive cataloging information at the item level and
- includes links or potential links to (digital) (art) objects
- Identify or create a target object ‘authority list’ for the collection being cataloged
- Select, scholarly texts that provide rich description at the item level for objects in the image collection; scan and encode with barebones TEI markup;
- Process each scholarly text to identify with a high degree of accuracy each mention of a target art object (TOI)
- Parse each text to identify noun phrases and other likely metadata-bearing content
- Process each text to identify and correlate all text blocks (sentences, paragraphs, pages, chapters, footnotes, captions, etc.) that appear to refer to specific target objects (SEGMENTATION)
- Identify the important phrases and vocabulary that can be used as metadata
using:
- Format and tag metadata, incorporate it into the corresponding descriptive item records with links to art objects, load records into a bibliographic and/or image search system
|