Columbia Escutcheon

Columbia University Libraries Digital Program

Mellon Audio Preservation Project (2008-2010)
Filenaming & BEXT Specs
  Path: Digital Library Projects  : Digital PreservationMellon Audio Preservation :  File naming, BEXT,


File Naming & BEXT Specification, rev. 8/23/2010

NB: "LMS#" is "library management system ID", in Columbia's case the CLIO ID.

A. File Naming

  • Master ("Raw") File Names (96 kHz / 24 bit):

    Last_First_LMS#_TapeSequence#_FaceSequence#_[part#]_m.wav
    (where "part #" appears only if relevant)

  • Rendered ("Cooked") File Names (96 kHz / 24 bit):

    Last_First_LMS#_Session#_[part#]_r.wav
    (where "part #" appears only if relevant)

  • Service Files From Rendered (44 kHz / 16 bit)

    Last_First_LMS#_Session#_[part#]_s.wav

 B. Folder / File Hierarchy

1st level folder name: [Last Name]_[First Name]_[LMS ID]
     Overall METS file, checksum; Complete ADL file, checksum
     2nd level folder name: "Master"
          Master files, METS files checksums,
     2nd level folder name: "Rendered"
          Rendered files, METS file, checksums
     2nd level folder name: "Service"
          Service files, METS file, checksums

Example

Andrews_UJ_6880560
  Andrews_UJ_6880560_mets.xml
  Andrews_UJ_6880560_mets.xml.md5
  Andrews_UJ_6880560.adl
  Andrews_UJ_6880560.adl.md5
  Master
    Andrews_UJ_6880560_01_01_m.wav
    Andrews_UJ_6880560_01_01_m.wav.md5
    Andrews_UJ_6880560_01_02_m.wav
    Andrews_UJ_6880560_01_02_m.wav.md5
    Andrews_UJ_6880560_01_mets.xml
    Andrews_UJ_6880560_01_mets.xml.md5
    Andrews_UJ_6880560_02_01_pt1_m.wav
    Andrews_UJ_6880560_02_01_pt1_m.wav.md5
    Andrews_UJ_6880560_02_01_pt2_m.wav
    Andrews_UJ_6880560_02_01_pt2_m.wav.md5
    Andrews_UJ_6880560_02_02_m.wav
    Andrews_UJ_6880560_02_02_m.wav.md5
    Andrews_UJ_6880560_02_mets.xml
    Andrews_UJ_6880560_02_mets.xml.md5
     
  Rendered
    Andrews_UJ_6880560_01_r.wav
    Andrews_UJ_6880560_01_r.wav.md5
    Andrews_UJ_6880560_02_pt1_r.wav
    Andrews_UJ_6880560_02_pt1_r.wav.md5
    Andrews_UJ_6880560_02_pt2_r.wav
    Andrews_UJ_6880560_02_pt2_r.wav.md5
    Andrews_UJ_6880560_r_mets.xml
    Andrews_UJ_6880560_r_mets.xml.md5
     
Service
Andrews_UJ_6880560_01_s.wav
Andrews_UJ_6880560_01_s.wav.md5
Andrews_UJ_6880560_02_pt1_s.wav
Andrews_UJ_6880560_02_pt1_s.wav.md5
Andrews_UJ_6880560_02_pt2_s.wav
Andrews_UJ_6880560_02_pt2_s.wav.md5
     


 C. Broadcast Wave File Audio Extension chunk (BEXT)

  1. Description field (256 char.):
    • Master ("Raw") File will have the following elements, formatted as shown:

      Element Source CUL Spreadsheet Element Example
      Last Name Last_Name "Baldwin"
      First Name First_Name "James"
      First Session on Tape # Session(s) & [from audio] "Session 2 of 4"
      First Session on Tape Date Recording_Date(s) "10/14/1963"
      Notes Misc_Notes "MOLDY'"

      E.g.,
        "Baldwin; James; session 2 of 4; 10/14/1963; MOLDY" 
         "Andrews; U.J.; session 1 of 1; 8/17/71; with George McElroy, '#41'"
         "Benjamin; Herbert; sessions 11 and 12 of 12; 1/5/77-3/15/77; sessions 1-10 on cassette in box 150A, backside label has false info"

    • Rendered ("Cooked") File will have the following elements, formatted as shown:

      Element Source CUL Spreadsheet Element Example
      Last Name Last_Name "Baldwin"
      First Name First_Name "James"
      Individual Session # Session(s) & from audio review "Session 2 of 4
      Individual Session Date Recording_Date(s) & from audio review "10/14/1963"
      Part # n/a "pt2"

      E.g.,
       "Baldwin; James; Session 2a of 4; 10/14/1963"   
       "Andrews; U.J; Session 1 of 1; 8/17/71; pt2"
      "Jones, Jenny; Session 3 of 3; 4/25/78, 4/27/78"

  2. Originator field (32 char.): "Columbia University Libraries"

  3. Originator Reference (32 char.): [LMS ID] in format  "CLIO:6880560"

  4. Origination Date
    Date file generated. "Ten ASCII characters containing the date of creation of the audio sequence. The format is “yyyy-mm-dd” (year-month-day)."

  5. Coding History fields:
    "Non-restricted ASCII characters, containing a collection of strings terminated by CR/
    LF. Each string contains a description of a coding process applied to the audio data.
    Each new coding application is required to add a new string with the appropriate information." (See R98-1999 format for the <CodingHistory> field in BWF files.)

Previous specs

Change Log:

2010-08-23:

  • File list updated to remove possible service copy filenames ("Andrews_UJ_6880560*_s_*mets.xml" and "Andrews_UJ_6880560*_s_*mets.xml.md5") since these are referenced in the Rendered files METS record.

2009-03-10:

  • Under File Naming added spec for service files.
  • Under Folder/File Hierarchy, added examples of service file folder / names
  • Under BEXT added example of rendered file BEXT chunk where a single interview session extended over more than one day.

2009-02-20:

  • Master File Name spec: 
    • removed "Regionsequence#" element;
    • added "part#" (present only if applicable)

  • Rendered File Name spec:
    • changed "sequence#" to "session#"
    • added "part#" element (present only if applicable)

  • Folder hierarchy: move ADL file to first level

  • Example: 
    • Updated Master file examples to include TapeSequence, Face Sequence, Part#
    • Updated Rendered file exmample to include Part#

  • BEXT
    • Master File Description:
      • Removed "Project ID" element
      • Changed "Session 2a of 4" to "Session 2 of 4"

    • Rendered File Description
      • Changed "Session 2a of 4" to "Session 2 of 4 "
      • Added "Part #" element

2009-02-19:

  • for the master file, the documentation now clarifies that the session number and session date are really just the _first_ session number and date on the master tape; it also inserts "Proj:" ahead of the project number element for intelligibility.

  • for the rendered file, it adds two session-level metadata elements to the Description, for individual session number and session date; it removes the LMS ID element, on the grounds that it will already be in the BEXT segment in the Originator Reference field (true for the master file as well).

 


Columbia Libraries    Digital Program
Last revision: 08/23/10
© Columbia University Libraries