Tayloring The CRM to Archaeological Requirements

Transcription

Tayloring The CRM to Archaeological Requirements
Tayloring The CRM
to Archaeological Requirements
Martin Doerr
Center for Cultural Informatics
Institute of Computer Science
Foundation for Research and Technology - Hellas
ARIADNE is funded by the European Commission's Seventh Framework Programme
The Scholarly Process
Phases of the scholarly process:
–
–
–
–
collecting and organizing evidence (observation and primary sources)
connecting facts via involved items
interpreting facts – contextualizing and hypothesis building
presenting results - publication
Problem: Billions of facts, artefacts and documents possibly shed light on the
past in unexpected contexts across all disciplines and sciences
A research infrastructure must primarily:
– enable homogeneous access to all known facts
– allow for restricting search (querying) to facts likely to contribute to
context and/or hypothesis
– trace provenance of knowledge (dependency, reliability, precision)
– allow for correcting/ refining facts in a scientific discourse
2
The Scholarly Process in a Digital World
Publications
Stories
exhibitions
Research Infrastructure
Refer
interpret
present
Discovery
of Lucy
Johanson's
Donald JohansonExpedition
Search,
correlate,
integrate
discover
collect
aggregate
update
Lucy
Ethiopia
Cleveland Museum
of Natural History
Layer of
“Latest stage of
AL 288-1 Knowledge”
Hadar
“Evidence layer”
Things
Sources
Collections
Corpora
3
Approach for Fact Connection
A Global Ontology
– allows for integrating and connecting evidential data and derived facts
– the CIDOC CRM is a good starting point
The CIDOC CRM (ISO21127)
– is about people and things meeting in space-time, parts and wholes,
use, influence and reference.
– It is not specific about circumstances of observation, find and
measurement, states of matter and geometry.
– It has no notion of input-output or derivation, only of “influence”
Therefore we have organized 4 interdisciplinary meetings
– We have analyzed types of archeological data sets, identified gaps and
set priorities.
– Then, we have been creating extensions of the CRM (and proposing
improvements) collaboratively
4
Types of Data
Types of archaeological data sets :
About things and “finds”:
o Archive/Museum/SMR collection registries: collection level
description
o Archive/Museum collection databases and “corpora”: documentation
of objects collected by subject, such as
o
o
fine arts, sculpture, numismatic, epigraphy, particular excavations,
particular culture
Description and declaration of archaeological sites or monuments
(“SMR”)
o Architectural drawings
About “finding”:
o Survey records
o Remote sensing (see also “measuring”)
o Excavation records: daily context/layer/phase/find descriptions,
description of larger structures, images
5
Types of Data
Types of data sets cont’d:
About measuring things (during excavation, conservation, other specific
research):
o
o
Imaging methods (X-ray, seismic, photogrammetry, laser, 3D surface/density
models)
Physical analytical investigation records:
o measurements of C14, thermoluminescence etc.
o mineralogical/chemical analysis: lithic , metallurgic, ceramic, bone
o
Biological analytical investigation records.
o archaeobotanic observations and measurements
o physical anthropological data, osteological data… (health state of human
remains)
o DNA analysis
o
Analytical reference data:
o
o
o
o
calibration/ reference sets of analytic data: C14, tree ring,..
material provenance reference sets
archaeobotany: species, seeds, pollen, tissues
DNA reference
6
Types of Data
Types of data sets cont’d :
Integrated hypothesis building:
o
o
o
o
o
GIS: Find distributions, landscape, visibility analysis, resources and land use
spatiotemporal distribution of cultures etc.
synthetic data: Simulations (population behavior, land use etc.)
virtual reconstructions
quantitative statistical analysis
About administration:
o protection zones
o conservation planning
o excavation licenses
7
CIDOC CRM Top-level Classes
E55 Types
refer to / refine
E39 Actors
E28 Conceptual Objects
E18 Physical Thing
participate in affect or / refer to
location
E2 Temporal Entities
E52 Time-Spans
at
E53 Places
8
CIDOC CRM Temporal Entities
E68 Dissolution
E69 Death
E64 End of Existence
E6 Destruction
E81 Transformation
E67 Birth
E4 Period
E5 Event
E63 Beginning of Existence
E66 Formation
E65 Creation
E2 Temporal Entity
E83 Type Creation
E12 Production
E11 Modification
E80 Part Removal
E9 Move
E 79 Part Addition
E3 Condition State
E10 Transfer of Custody
E7 Activity
Generalization
E8 Acquisition
E17 Type Assignment
E87 Curation Activity
E14 Condition Assessment
E13 Attribute Assignment
E15 Identifier Assignment
E86 Leaving
E16 Measurement
E85 Joining
9
Historical Events as Meetings
t
Brutus
was present at!
Caesar’s
mother
was present at!
“coherence volume” of
Caesar’s death
was present at!
Caesar
was present at!
was present at!
Brutus’
dagger
“coherence volume”
of Caesar’s birth
?
Rome
Forum Romanum,
S
10
Exchanges of Information as Meetings
t
Victory!!!
coherence volume of
second announcement
coherence volume of
first announcement
other
Soldiers
2nd Athenian
Victory!!!
1st Athenian
runner
coherence volume of the
battle of Marathon
Marathon
Athens
S
11
Observation and Measurement as Meetings
t
coherence volume of
rendering
coherence volume of
mesh-creation
3D
model
meshdata
2nd Computer
scanner
museum
object
scan-data
1st Computer
operator
coherence volume
of acquisition
Museum
It-Lab
S
12
Outcome: CRM compatible Extensions
• CRMgeo: a Spatiotemporal model
– integrates CRM with OGC standards
– a complete model of phenomena occupying spacetime
– integrates geometry- and semantics-derived topological relations
• CRMsci: a Scientific Observation model (descriptive sciences)
–
–
–
–
generalizes over INSPIRE, OBOE, SEEK, Darwin Core
generalizes concepts of units of matter and their “genesis”
introduces concept of observation and data evaluation
also validated in geology and biodiversity
• CRMarchaeo: an Excavation model
– introduces concepts of stratigraphy and excavation
• CRMdig: a model of Digital Provenance
– outcome of project 3D-COFORM being adapted to CRMsci
13
Challenge: Integrating Special and General…
(not core & application profiles !)
CIDOC
Conceptual Reference Model (CRM)
Access all data from any level
by property generalization
Few concepts,
high recall
Thing
Actor
was present at
Dublin Core
Event
happened at
CDWA
Special concepts,
high precision
Acquisition
used object
MIDAS
automatic
data export
Data
14
CIDOC CRM extension suite
CIDOC
Conceptual Reference Model (CRM)
Few concepts,
high recall
CRM
Event
Thing
Actor
happened at
was present at
Special concepts,
high precision
CRMSci
CRMArcheo CRMDig
15
CRMSci (part of)
E7
Activity
CRM Concept
P2 has type
O16 observed value
E13 Attribute
Assignment
CRMSci concept
E1 CRM Entity
E55 Type
E55 Type
S5 Inference Making
O9 observed property
type
S4 Observation
S8 Categorical Hypothesis Building
S6 Data Evaluation
E55 Type
O20 sampled from type of
part
E53 Place
S19 Encounter Event
O8 observed
S7 Simulation or Prediction
O11 described
S21 Measurement
O24 measured
O1 diminished
S2 Sample Taking
S13 Sample
S15 Observable Entity
O12 has dimension
O19 has found
object
S12 Amount of Fluid
O2 removed
O3 sampled from
O5 removed
E54 Dimension
O10 assigned
dimension
S1 Matter Removal
O4 sampled at
S9 Property Type
S10 Material Substantial
O6 forms former or current part of
O15 occupied
S11 Amount of Matter
S14 Fluid Body
E18 Physical Thing
1
CRMSci (part of)
Encounter Event
S19 Encounter Event
Scope Note:
Activities of S4 Observation (substance) where an E39 Actor
encounters an instance of E18 Physical Thing of a kind relevant
for the mission of the observation or regarded as potentially
relevant for some community (identity). This observation
produces knowledge about the existence of the respective thing
at a particular place in or on surrounding matter. This
knowledge may be new to the group of people the actor belongs
to. In that case we would talk about a discovery.
E7 Activity
E16 Measurement
S4 Observation
S21 Measurement
S19 Encounter Event
O21 has found
at
E53 Place
O19 has found
object
E18 Physical Thing
E92 Spacetime Volume
S20 / E26 Physical Feature
O22 partly or completely
contains
O23 is defined
by
E25 Man-Made Feature
E27 Site
S22 Segment of Matter
1
CRMarcheo: Modelling stratigraphic units
…comprises physical
features (S20) that are
either stratigraphic deposit
units (A2) or Stratigraphic
Interfaces (A3)
A8 Stratigraphic Unit
A3 Stratigraphic Interface
A2 Stratigraphic Deposit Unit
18
CRMarcheo: Modelling Excavation Activities
A1 Excavation Process Unit
AP4 created surfaceS1 with
stratigraphic method
A1 Excavation Process Unit
AP4 created surfaceS2 with spit
method
A7Embedding
A3 Stratigraphic Interface
A2 Stratigraphic Deposit Unit
19
Modelling stratigraphic units
A2 Stratigraphic Deposit Unit comprises connected portions of terrain or other solid
structure on, in, or under the surface of earth or seafloor exhibiting some homogeneity of
structure or substance and completely bounded by surfaces or discontinuities in substance or
structure with respect to other portions of the terrain or surfaces of objects/finds … may
contain physical objects.
… can be attributed to a single genesis event or process and have the potential to be
observed.
… is regarded to exist as long as a part of its matter is still in place with respect to a
surrounding reference space
A3 Stratigraphic Interface comprises coherent parts of the boundary surface of one or more
stratigraphic units, which appears as result of a common genesis event or process. In particular
it may be due to a removal process, which may be part of the genesis process of a
Stratigraphic Unit or not. .. …confines (AP12) partly or completely the surface (A3 Stratigraphic
Interface) of an A2 Stratigraphic Deposit Unit. One A3 Stratigraphic Interface may confine two
or more A2 Stratigraphic Deposit Units.
20
CRMarcheo: Observing Physical Constellations
AP11 has
physical
relation
A2 Deposit Unit
(1)
A8 Stratigraphic Unit
above
above
above
A2 Deposit Unit
(4)
A2 Deposit Unit
(2)
fills
above
A3 Interface
[19]
cuts
cuts
A2 Deposit Unit
(15)
cuts
A2 Deposit Unit
(18)
fills
A3 Interface
[3]
21
CRMarcheo: Derivation of Temporal Relations
AP13 has
stratigraphic
relation (is
stratigraphic
relation of)
AP14 justified by
A4 Stratigraphic Genesis
AP11 has
physical
relation
above
AP7 produced
(was produced
by)
A8 Stratigraphic Unit
A8 Stratigraphic Unit
37
above
A8 Stratigraphic Unit
36
A8 Stratigraphic Unit
34
above A8 Stratigraphic Unit
35
22
CRMarcheo: Stratigraphic Genesis
S18 Alteration
E7 Activity
AP13 has stratigraphic relation
A5 Stratigraphic Modification Event
S17 Physical Genesis
A1 Excavation Process Unit
AP1 produced
AP9 took matter from
S10 Material Substantial
A4 Stratigraphic Genesis
S20 Physical Feature
S11 Amount of Matter
AP7 produced
AP8 disturbed
AP10 is part of
AP11 has physical relation
A8 Stratigraphic Unit
A9 Segment of Matter
AP24 is or contains remains of
A2 Stratigraphic Deposit Unit
A3 Stratigraphic Interface
E18 Physical Thing
AP12 is confined by
23
Conclusions
Besides others, we have analyzed
– special collection databases (numismatic etc.)
– 5 different national methods of excavation recording
– 10 different kinds of analytical investigation, including DNA,
thermoluminescence etc,
– temporal gazetteers and “therauri of periods
We have consulted and collaborated with archaeologists, physicists, chemists,
microbiologists, biologists and other specialists and observed a surprising
analogy of procedures, which we could model as generic concepts.
Now we have a coherent global ontology for deep integration of scientific and
cultural-historical evidence and facts, probably the most elaborate and
generic currently existing for descriptive sciences.
We have submitted these extensions to CIDOC for approval and
recommendation, and work on further validation and quality of
documentation.
24
Acknowledgement
ARIADNE is a project funded by the European Commission
under the Community’s Seventh Framework Programme,
contract no. FP7-INFRASTRUCTURES-2012-1-313193.
The views and opinions expressed in this presentation are the
sole responsibility of the authors and do not necessarily reflect
the views of the European Commission.