File
Naming & BEXT Specification, rev.
8/23/2010
NB: "LMS#" is "library management system ID", in Columbia's case the CLIO ID.
A. File Naming
- Master ("Raw")
File Names (96 kHz / 24
bit):
Last_First_LMS#_TapeSequence#_FaceSequence#_[part#]_m.wav
(where "part #" appears
only if relevant)
- Rendered ("Cooked")
File Names (96 kHz / 24
bit):
Last_First_LMS#_Session#_[part#]_r.wav
(where "part #" appears
only if relevant)
- Service Files From Rendered
(44 kHz / 16 bit)
Last_First_LMS#_Session#_[part#]_s.wav
B. Folder
/ File Hierarchy
1st level
folder name: [Last
Name]_[First Name]_[LMS ID]
Overall METS file, checksum; Complete ADL
file, checksum
2nd
level folder name: "Master"
Master
files, METS files checksums,
2nd
level folder name: "Rendered"
Rendered
files, METS file, checksums
2nd
level folder name: "Service"
Service
files, METS file, checksums
Example
Andrews_UJ_6880560 |
|
Andrews_UJ_6880560_mets.xml |
|
Andrews_UJ_6880560_mets.xml.md5 |
|
Andrews_UJ_6880560.adl |
|
Andrews_UJ_6880560.adl.md5 |
|
Master |
|
|
Andrews_UJ_6880560_01_01_m.wav |
|
|
Andrews_UJ_6880560_01_01_m.wav.md5 |
|
|
Andrews_UJ_6880560_01_02_m.wav |
|
|
Andrews_UJ_6880560_01_02_m.wav.md5 |
|
|
Andrews_UJ_6880560_01_mets.xml |
|
|
Andrews_UJ_6880560_01_mets.xml.md5 |
|
|
Andrews_UJ_6880560_02_01_pt1_m.wav |
|
|
Andrews_UJ_6880560_02_01_pt1_m.wav.md5 |
|
|
Andrews_UJ_6880560_02_01_pt2_m.wav |
|
|
Andrews_UJ_6880560_02_01_pt2_m.wav.md5 |
|
|
Andrews_UJ_6880560_02_02_m.wav |
|
|
Andrews_UJ_6880560_02_02_m.wav.md5 |
|
|
Andrews_UJ_6880560_02_mets.xml |
|
|
Andrews_UJ_6880560_02_mets.xml.md5 |
|
|
|
|
Rendered |
|
|
Andrews_UJ_6880560_01_r.wav |
|
|
Andrews_UJ_6880560_01_r.wav.md5 |
|
|
Andrews_UJ_6880560_02_pt1_r.wav |
|
|
Andrews_UJ_6880560_02_pt1_r.wav.md5 |
|
|
Andrews_UJ_6880560_02_pt2_r.wav |
|
|
Andrews_UJ_6880560_02_pt2_r.wav.md5 |
|
|
Andrews_UJ_6880560_r_mets.xml |
|
|
Andrews_UJ_6880560_r_mets.xml.md5 |
|
|
|
|
Service |
|
|
Andrews_UJ_6880560_01_s.wav |
|
|
Andrews_UJ_6880560_01_s.wav.md5 |
|
|
Andrews_UJ_6880560_02_pt1_s.wav |
|
|
Andrews_UJ_6880560_02_pt1_s.wav.md5 |
|
|
Andrews_UJ_6880560_02_pt2_s.wav |
|
|
Andrews_UJ_6880560_02_pt2_s.wav.md5 |
|
|
|
C. Broadcast
Wave File Audio Extension chunk
(BEXT)
- Description field (256
char.):
- Master ("Raw")
File will
have the following elements,
formatted as shown:
Element |
Source
CUL Spreadsheet Element |
Example |
Last
Name |
Last_Name |
"Baldwin" |
First
Name |
First_Name |
"James" |
First Session
on Tape # |
Session(s)
& [from audio] |
"Session
2 of 4" |
First Session
on Tape Date |
Recording_Date(s) |
"10/14/1963" |
Notes |
Misc_Notes |
"MOLDY'" |
E.g.,
"Baldwin;
James; session 2 of 4; 10/14/1963;
MOLDY"
"Andrews;
U.J.; session 1 of
1; 8/17/71; with George McElroy,
'#41'"
"Benjamin;
Herbert; sessions 11 and 12
of 12; 1/5/77-3/15/77; sessions
1-10 on cassette in box 150A,
backside label has false info"
- Rendered ("Cooked")
File will
have the following elements,
formatted as shown:
Element |
Source
CUL Spreadsheet Element |
Example |
Last
Name |
Last_Name |
"Baldwin" |
First
Name |
First_Name |
"James" |
Individual
Session # |
Session(s)
& from audio review |
"Session
2 of 4 |
Individual
Session Date |
Recording_Date(s)
& from audio review |
"10/14/1963" |
Part # |
n/a |
"pt2" |
E.g.,
"Baldwin; James; Session
2a of 4; 10/14/1963"
"Andrews; U.J; Session
1 of 1; 8/17/71; pt2"
"Jones,
Jenny; Session 3 of 3; 4/25/78,
4/27/78"
- Originator field (32
char.): "Columbia
University Libraries"
- Originator Reference
(32 char.): [LMS ID]
in format "CLIO:6880560"
- Origination Date
Date file generated. "Ten ASCII
characters containing the date of
creation of the audio sequence.
The format
is yyyy-mm-dd (year-month-day)."
- Coding History fields:
"Non-restricted ASCII characters,
containing a collection of strings
terminated by CR/
LF. Each string contains a description
of a coding process applied to
the audio data.
Each new coding application is
required to add a new string with
the appropriate information." (See R98-1999
format for the <CodingHistory> field
in BWF files.)
Previous
specs
Change Log:
2010-08-23:
- File list updated to remove possible service copy filenames ("Andrews_UJ_6880560*_s_*mets.xml" and "Andrews_UJ_6880560*_s_*mets.xml.md5") since these are referenced in the Rendered files METS record.
2009-03-10:
- Under
File Naming added spec for service
files.
- Under Folder/File Hierarchy,
added examples of service file
folder / names
- Under BEXT added example of
rendered file BEXT chunk where
a single interview session extended
over more than one day.
2009-02-20:
- Master File Name spec:
- removed
"Regionsequence#" element;
- added
"part#" (present only if applicable)
- Rendered File Name spec:
- changed "sequence#" to
"session#"
- added "part#" element
(present only if applicable)
- Folder hierarchy: move ADL
file to first level
- Example:
- Updated Master
file examples to include
TapeSequence, Face Sequence,
Part#
- Updated Rendered file
exmample to include Part#
- BEXT
- Master File Description:
- Removed "Project ID"
element
- Changed "Session 2a
of 4" to "Session 2
of 4"
- Rendered File Description
- Changed "Session
2a of 4" to "Session
2 of 4 "
- Added "Part #" element
2009-02-19:
- for the master file, the
documentation now clarifies
that the session number and
session date are really just
the _first_ session number and
date on the master tape; it
also inserts "Proj:" ahead
of the project number element
for intelligibility.
- for the rendered file, it
adds two session-level metadata
elements to the Description,
for individual session number
and session date; it removes
the LMS ID element, on the
grounds that it will already
be in the BEXT segment in the
Originator Reference field (true
for the master file as well).
|