Slides - cosound.dk

Transcription

Slides - cosound.dk
Audio Culture Researcher Requirements, Metadata
Design and Potentials of Automated Feature Extraction
Birger Larsen & the RSLIS LARM/CoSound team
Information Systems and Interaction Design
Royal School of Library and Information Science
University of Copenhagen
Denmark
Haakon Lund
RSLIS
Toine Bogers
RSLIS
Marianne Lykke
AAU
Mette Skov
AAU
www.iva.dk/blar - [email protected]
CoSound, June 21, 2013
Slide 2
CoSound, June 21, 2013
This talk
• Audio culture researcher requirements for an audio
research infrastructure
• User-driven development of a metadata scheme for
radio broadcast archives
• Reflections on the potential of automated feature
extraction
Slide 3
CoSound, June 21, 2013
Audio culture researcher requirements
for an audio research infrastructure
LARM Audio Research Archive
• Research infrastructure for radio and
auditive cultural heritage, 2010-1013
• Funded by the Danish Ministry of Science, Technology
and Innovation
• Consortium of 13 Danish institutions, universities +
Danish Radio + State and University Library
Objective: To further radio and audio research by maturing
research source material
• IT-infrastructure (main architecture and interface)
• Solutions to copyright issues
• National bibliography of radio
• Toolbox: Tools for metadating, annotating, search and
dissemination
Organisation: technical solutions, national bibliography,
research cases
Slide 4
CoSound, June 21, 2013
Two main challenges
• Large heterogeneous collection
• 1 million hours of broadcast radio
• 1937-present (24/7 coverage from 1989)
• Extremely sparse metadata, no funding to index
• Streaming access only because of copyright issues
• Heterogeneous user group
• All audio culture researchers...
• ...but very diverse research interests
• No public access due to copyright > large scale
crowdsourcing not possible
Slide 5
CoSound, June 21, 2013
Audio culture researcher requirements
Initial wish: a common philology / metadata / classification
scheme to be used across researchers
How to design a metadata scheme for audio culture research?
Analysis of Humanities information needs
• Questionnaire to Danish and international audio culture
researchers
Analysis of Humanities research tasks and needs
• Wiki for describing and analysing used concepts and
coding wishes and practises
Slide 6
CoSound, June 21, 2013
Questionnaire
results
Brain storming +
tagging sessions
60%
Respondents
17 Danish
+ 51 international
Preferred search entries
(how do you/would you
like to search for audio
for your research?)
• On average title,
genre and topic are
important, media
specific metadata
less so
• But large individual
differences
Slide 7
48%
45%
48%
CoSound, June 21, 2013
Questionnaire
results
Indexing level:
Which unit of
material is
important??
• Several levels of
indexing is
necessary
Needed more detail:
• Wiki for describing
and analysing used
concepts and
coding wishes and
practises
Slide 8
CoSound, June 21, 2013
Sample research cases
WP5.8 – The auditive syntax of radio
• Jacob Kreutzfeldt: Urban Soundscapes
• Identifying ‘real sound’ from Copenhagen
• Analysing how the ‘experience of space’ is
constructed
• Investigating the correlation between technology
and the portrait of ‘the sound of the city’
• Need for search and annotation
• Sound category (e.g. traffic, shop, car, bell, goat)
• Free tagging
• Mix (sound level (fore/background), fade in/out)
• Authenticity (real/scenographic /effect)
• Anchorage (space, language, rhythm, theme)
• Other
• Torben Sanglid: DR Jingles and Theme Songs
• Investigating changes of focus over time
• Theme songs (intro, separators, outro)
• Station IDs, Spots
CoSound, June 21, 2013
Developing a metadata scheme
Need for
• effective retrieval of radio broadcasts
• adding research-specific annotations
• at both the broadcast level as well as at segments of
broadcasts.
Needs of humanities researchers are so diverse that it is
unlikely that a single unified subject list will suit all
Main metadata requirements
• easy to work with
• easily extensible
• would provide for flexible data exchange
Different sources
• Original archive data
• User generated
• Of a general nature
• Project specific
Slide 10
CoSound, June 21, 2013
Conceptual metadata scheme
Slide 11
CoSound, June 21, 2013
Administrative metadata
Channel, program title, start/end time, narrative,
creators and roles
Archive
metadata
Administrative metadata
Program title (missing or wrong),
persons (participants/subject), genre,
related objects, subject, tags, annotation
LARM
metadata
Administrative metadata
[project specific]
Slide 12
Project
metadata
CoSound, June 21, 2013
Slide 13
CoSound, June 21, 2013
Slide 14
CoSound, June 21, 2013
Conceptual metadata scheme
Consistency across project metadata?
• project metadata scheme encoded in streaming
interface
• Coding manual saved in repository
• Documentation
• Reuse
Current state
• 3-level schema implemented in larm.fm
• In use by some research groups
• Individual project schemas needs to hand coded
Slide 15
CoSound, June 21, 2013
Reflections on the potential
of automated feature extraction
Flexible annotation made possible
• But still relatively few programs can be annotated
Automatic feature extraction has the potential to enrich the
audio files automatically
• First pass to e.g. segment the files
• More manual passes to refine and correct automatic
output
• Data driven humanities
For search in radio programs, automatic speech recognition
holds great potentials
• But not easy to do well for Danish
• On-going tests on DR radio news and the Queen’s New
Year addresses in collaboration with SpeechOp.com
Slide 16
CoSound, June 21, 2013
References and links
LARM – http://www.larm-archive.org
CoSound – http://cosound.dk
Unlocking radio broadcasts : User needs in sound
retrieval / Lykke, Marianne; Skov, Mette. Proceedings of
the 4st international conference on Information interaction
in context. Association for Computing Machinery, 2012. s.
298-301.
CHAOS: User-driven Development of a Metadata Scheme
for Radio Broadcast Archives / Lykke, Marianne; Bogers,
Toine; Larsen, Birger; Lund, Haakon. iConference 2013
Proceedings. 2013. (Poster)
Slide 17
CoSound, June 21, 2013
Search and Interaction in Media Archives
http://bit.ly/DRseminar
Open seminar at DR, 9:00-13:00 Thursday June 27, 2013
Designing the Search Experience
Tony Russell-Rose, Director of UXlabs, UK
larm.fm master class
Andreas Røll Larsen, DR Archive, Research & Rights
Good Practices for Finding Information
Kristian Norling, Enterprise Search Evangelist at Findwise, Sweden
Interaction with Sound Art Installations
Marianne Lykke, Professor at University of Aalborg, Denmark
The Future of Music Interaction
Daniel Boland, PhD student at University of Glasgow, UK