Columbia Escutcheon

Columbia University Libraries Digital Program

Mellon Audio Preservation Project (2008-2010)
Filenaming & BEXT Specs
  Path: Digital Library Projects  : Digital PreservationMellon Audio Preservation :  File naming, BEXT


 File Naming Specification

  • Master ("Raw") File Names

    Last_First_CLIO#_TapeSequence#_FaceSequence#_RegionSequence#_m.wav

  • Rendered ("Cooked") File Names:

    Last_First_CLIO#_FileSequence#_r.wav

 Folder / File Hierarchy

1st level folder: [Last Name]_[First Name]_[CLIO ID]
2nd level folder: "Master", "Rendered", "ADL"
3rd level:  [File name]

Example

Andrews_UJ_6880560
  Master
    Andrews_UJ_6880560_01_m.wav
    Andrews_UJ_6880560_01_m.wav.md5
  Rendered
    Andrews_UJ_6880560_01_r.wav
    Andrews_UJ_6880560_01_r.wav.md5
  ADL
    Andrews_UJ_6880560_01.adl
    Andrews_UJ_6880560_01.adl.md5

 Broadcast Wave File Audio Extension chunk (BEXT)

  1. Description field (256 char.):
    • Master ("Raw") File will have the following elements, formatted as shown:

      Element Source CUL Spreadsheet Element Example
      Last Name Last_Name "Baldwin"
      First Name First_Name "James"
      Project ID Project_# "Proj: 6b"
      First Session on Tape # Session(s) & [from audio] "Session 2a of 4"
      First Session on Tape Date Recording_Date(s) "10/14/1963"
      Notes Misc_Notes "MOLDY'"

      E.g.,
        "Baldwin; James; Proj: 6b; session 2a of 4; 10/14/1963; MOLDY" 
         "Andrews; U.J.; Proj: 4; session 1 of 1; 8/17/71; with George McElroy, '#41'"
         "Benjamin; Herbert; Proj: 8p; sessions 11 and 12 of 12; 1/5/77-3/15/77; sessions 1-10 on cassette in box 150A, backside label has false info"

    • Rendered ("Cooked") File will have the following elements, formatted as shown:

      Element Source CUL Spreadsheet Element Example
      Last Name Last_Name "Baldwin"
      First Name First_Name "James"
      Individual Session # Session(s) & from audio review "Session 2a of 4"
      Individual Session Date Recording_Date(s) & from audio review "10/14/1963"

      E.g.,
       "Baldwin; James; Session 2a of 4; 10/14/1963"   
       "Andrews; U.J; Session 1 of 1; 8/17/71"

  2. Originator field (32 char.): "Columbia University Libraries"

  3. Originator Reference (32 char.): [CLIO ID] in format  "CLIO:6880560"

  4. Origination Date
    "Ten ASCII characters containing the date of creation of the audio sequence. The format
    is “yyyy-mm-dd” (year-month-day)."

  5. Coding History fields:
    "Non-restricted ASCII characters, containing a collection of strings terminated by CR/
    LF. Each string contains a description of a coding process applied to the audio data.
    Each new coding application is required to add a new string with the appropriate information." (See R98-1999 format for the <CodingHistory> field in BWF files.)

Change Log:

2009-02-19:

  • for the master file, the documentation now clarifies that the session number and session date are really just the _first_ session number and date on the master tape; it also inserts "Proj:" ahead of the project number element for intelligibility.

  • for the rendered file, it adds two session-level metadata elements to the Description, for individual session number and session date; it removes the CLIO ID element, on the grounds that it will already be in the BEXT segment in the Originator Reference field (true for the master file as well).

 

 

Columbia Libraries    Digital Program
Last revision: 02/20/09
© Columbia University Libraries