This version: https://purl.org/episa/archonto/1.0
Latest version: https://purl.org/episa/archonto
Previous version: https://purl.org/episa/archonto/0.9
Editor: Inês Koch
This specification describes ArchOnto, a linked open data model. ArchOnto adopts a semantic web approach and encompasses the CIDOC Conceptual Reference Model and additional ontologies, envisioning interoperability with datasets curated by multiple communities of practice. Its goal is to serve as the foundation for the development of a knowledge graph prototype and new applications for the archives.
This document is in progress. This may undergo changes considering that the present ontology is in constant development.
Currently, the ArchOnto can already represent the description of archival records, identifying people, places, dates, and related events. In the future, the ontology will represent the digital representations associated with each document already described through ArchOnto.
The ArchOnto specification is produced as part of the Entity and Property Inference for Semantic Archives (EPISA) project at INESC TEC to provide authoritative documentation of the contents, status and purpose of the linked data model known as ‘ArchOnto’. Furthermore, this ontology fits into Inês Koch’s doctoral thesis project.
CIDOC CRM ontology created to represent typed properties with an arity higher than two.
Ontology that served as the basis for the creation of the N-ary ontology and, as of version 1.0, replaces it.
Use the prefixes:
Ontology created in order to validate literal values used in properties of the newly created data model.
It has classes and data properties in its base that are used to validate data from simple types in the ontology, ensuring class-based validation.
Use the prefixes:
Ontology created to represent all fields of the ISAD(G) standard;
It has Data Properties in its base that allow the incorporation of all fields ISAD(G). With this ontology, the information of archival descriptions is kept existing files created over the years by the various archives at the national level.
The ontology allows, whenever necessary, the validation of the contents that were atomized, verifying if they are in accordance with the registration that already existed in the ISAD(G) description.
All information associated with a unit of description is considered as a physical object; in CIDOC CRM there are attributes associated with the physical object and others with the conceptual object. In this ontology, it is assumed that the physical object incorporates the conceptual object.
With this ontology it will be possible to maintain the interoperability of all data represented in ISAD(G).
Use the prefixes:
ArchOnto is the name of the new model supporting the representation of assets at DGLAB (Direção-Geral do Livro, dos Arquivos e das Bibliotecas). Its goal is to serve as the foundation for developing a knowledge graph prototype and new applications for the archives.
ArchOnto is a modular data model that integrates five ontologies, dealing with complementary aspects of the archival domain. The five ontologies are CIDOC CRM Main (base ontology), CIDOC CRM PC, DataObject, ISAD Ontology, and Link2DataObject. ArchOnto also has its classes and properties.
In the figure below there is an application example of ArchOnto ontology.
Figure 1. ArchOnto Application
Example
ArchOnto is a modular data model that integrates five ontologies, dealing with complementary aspects of the archival domain, as illustrated in Figure 2. The five ontologies are:
CIDOC CRM (base ontology): the core of ArchOnto, providing the concepts and properties to capture archival records’ essential features, e.g., event, date, location, person, group.
CIDOC CRM PC: systematically represents non-binary associations, i.e., those that connect more than two individuals. Representing a person’s role in a given event is an example of this kind of association because it involves relating an event, a person, and a role.
DataObject: auxiliary ontology to deal with literal values and their validation in ArchOnto. Comprises classes and data properties for the simple types in the ontology. In this ontology, the names of classes are preceded by DOE (from DataObject Entity), and the names of properties are preceded by DOP (from DataObject Property).
ISAD Ontology: matches the ISAD(G) standard elements that can be directly obtained from existing records. The information is represented with Data Properties and is not atomized. The ontology can be regarded as an intermediate representation for legacy records, making it possible, whenever necessary, to check the contents of a given element against the information extracted thereof. This ontology is only used for the existing records in DigitArq and not for newly created records based on ArchOnto.
Link2DataObject: makes the connection between the CIDOC CRM and the DataObject ontologies with a single property to make this connection — L2DO hasValue.
In addition to these five ontologies, ArchOnto has its own classes and properties, created as extensions to CIDOC CRM. Classes are used to specialize concepts already present in CIDOC CRM but which needed to be adapted to include controlled vocabularies that exist in the archives. CIDOC CRM. In addition to these classes, the ontology also includes three properties which provides a control structure for the admissible nesting of units of description. Names for classes are preceded by ARE (from ARchival Entity), and those for properties are preceded by ARP (from ARchival Property).
Figure 2. ArchOnto
Architecture
Full detail regarding each class and property can be found in the following subsections: ArchOnto, DataObject, CIDOC CRM PC, CIDOC CRM Main, ISAD Ontology and Link2DataObject.
Ontology | Classes |
---|---|
CIDOC CRM Main | E35 Title |
ArchOnto | - ARE2 Formal Title |
ArchOnto | - ARE2 Supplied Title |
CIDOC CRM Main | E39 Actor |
ArchOnto | - ARE12 Organization |
CIDOC CRM Main | E54 Dimension |
ArchOnto | - ARE4 Extension |
CIDOC CRM Main | E55 Type |
ArchOnto | - ARE1 Level of Description |
ArchOnto | - ARE5 Identifier Type |
ArchOnto | - ARE6 Date Type |
ArchOnto | - ARE7 Name Type |
ArchOnto | - ARE8 Role Type |
ArchOnto | - ARE9 Date Certainity |
ArchOnto | - ARE11 Documentary Typology |
ArchOnto | - ARE13 Subject Type |
ArchOnto | - ARE14 Place Type |
ArchOnto | - ARE15 Acquisition Type |
ArchOnto | - ARE16 Event Type |
DataObject | DOE1 DataObject |
DataObject | - DOE2 AuthorityFile |
DataObject | - DOE3 Boolean |
DataObject | - DOE4 Date |
DataObject | - - DOE9 Approximate |
DataObject | - - DOE10 Instant |
DataObject | - - DOE11 Interval |
DataObject | - DOE5 Decimal |
DataObject | - DOE6 GeospatialCoordinates |
DataObject | - - DOE12 Latitude |
DataObject | - - DOE13 Longitude |
DataObject | - - DOE14 Polygon |
DataObject | - DOE7 Integer |
DataObject | - DOE8 String |
DataObject | - - DOE15 AuthorityString |
DataObject | - - - DOE8 PersonName |
DataObject | - - DOE16 RegexString |
CIDOC CRM PC | PC0 CRM Property |
CIDOC CRM PC | - - PC14 Carried Out By |
Ontology | Property | Domain | Range |
---|---|---|---|
ArchOnto | ARP8 upper level | ARE1 Level of Description | ARE1 Level of Description |
ArchOnto | ARP9 lower level | ARE1 Level of Description | ARE1 Level of Description |
Link2DataObject | L2DO hasValue | E1 CRM Property | DOE1 DataObject |
CIDOC CRM PC | P01 has domain | PC0 CRM Property | E1 CRM Entity |
CIDOC CRM PC | P01 has range | PC0 CRM Property | E1 CRM Entity |
DataObject | DOP1 approximateDateValue | DOE9 Approximate | xsd:dateTime |
DataObject | DOP2 endDateValue | DOE11 Interval | xsd:dateTime |
DataObject | DOP3 fileLocation | DOE15 AuthorityFile | xsd:string |
DataObject | DOP4 hasRegex | DOE16 RegexString | xsd:string |
DataObject | DOP5 name | DOE17 PersonName | xsd:string |
DataObject | DOP6 startDateValue | DOE11 Interval | xsd:dateTime |
DataObject | DOP7 stringValue | DOE8 String | xsd:string |
DataObject | DOP8 timestamp | DOE10 Instant | xsd:dateTime |
CIDOC CRM Main | P2 has type | E1 CRM Entity | E55 Type |
ArchOnto | - ARP12 has level of description | E31 Document | ARE1 Level of Description |
CIDOC CRM Main | P3 has note | E1 CRM Entity | — |
ISAD Ontology | - ISAD1 has title | E31 Document | xsd:string |
ISAD Ontology | - ISAD2 has level of description | E31 Document | xsd:string |
ISAD Ontology | - ISAD3 has reference code | E31 Document | xsd:string |
ISAD Ontology | - ISAD4 has type of title | E31 Document | xsd:string |
ISAD Ontology | - ISAD5 has date | E31 Document | xsd:string |
ISAD Ontology | - ISAD6 has dimension and support | E31 Document | xsd:string |
ISAD Ontology | - ISAD7 has administrative history | E31 Document | xsd:string |
ISAD Ontology | - ISAD8 has archival history | E31 Document | xsd:string |
ISAD Ontology | - ISAD9 has scope | E31 Document | xsd:string |
ISAD Ontology | - ISAD10 has access condition | E31 Document | xsd:string |
ISAD Ontology | - ISAD11 has current quota | E31 Document | xsd:string |
ISAD Ontology | - ISAD12 has old quota | E31 Document | xsd:string |
ISAD Ontology | - ISAD13 has original quota | E31 Document | xsd:string |
ISAD Ontology | - ISAD14 has language | E31 Document | xsd:string |
ISAD Ontology | - ISAD15 has relatd unit of description | E31 Document | xsd:string |
ISAD Ontology | - ISAD16 has existence and location of copies | E31 Document | xsd:string |
ISAD Ontology | - ISAD17 has publication notes | E31 Document | xsd:string |
ISAD Ontology | - ISAD18 has notes | E31 Document | xsd:string |
ISAD Ontology | - ISAD19 has system of arrangement | E31 Document | xsd:string |
ISAD Ontology | - ISAD20 has physical characteristics | E31 Document | xsd:string |
ISAD Ontology | - ISAD21 has description date | E31 Document | xsd:string |
ISAD Ontology | - ISAD22 has last modification | E31 Document | xsd:string |
ISAD Ontology | - ISAD23 has predominant date | E31 Document | xsd:string |
ISAD Ontology | - ISAD24 has conditions governing reproduction | E31 Document | xsd:string |
ISAD Ontology | - ISAD25 has finding aids | E31 Document | xsd:string |
ISAD Ontology | - ISAD26 has immediate source of acquisition or transfer | E31 Document | xsd:string |
ISAD Ontology | - ISAD27 has accruals | E31 Document | xsd:string |
ArchOnto is a modular data model that integrates five ontologies, dealing with complementary aspects of the archival domain. The five ontologies are:
ARE1 Level of Description |
---|
The first ISAD(G) element to be considered in the description of a document is its Level of Description, represented in ArchOnto through the ARE1 Level of Description class. This class is a subclass of E55 Type and is associated with a document (E31 Document) through the property ARP12 has level of description. |
URI: https://purl.org/episa/archonto/ARE1 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - ARP8 upper level - ARP9 lower level - ARP12 has level of description |
Example values: - Fonds - Series - Section - File - Item |
|
ARE2 Formal Title |
---|
The second ISAD(G) element to be considered in the description of a document is its Title, represented in ArchOnto through the ARE2 Formal Title or ARE3 Supplied Title classes. The ARE2 Formal Title class, a subclass of E35 Title, indicates that the type of title used in the document record is formal, originated in the document. |
URI: https://purl.org/episa/archonto/ARE2 |
Superclass: E35 Title |
Subclasses: - |
Related properties: - P102 has title |
Example values: - Acção cível de fretes em que é autor António Henriques Leal e réu António Esteves Costa - Gabinete da Área de Sines |
|
ARE3 Supplied Title |
---|
The second ISAD(G) element to be considered in the description of a document is its Title, represented in ArchOnto through the ARE2 Formal Title or ARE3 Supplied Title classes. The ARE3 Supplied Title class, a subclass of E35 Title, indicates that the type of title used in the document record is supplied, assigned by an archivist. |
URI: https://purl.org/episa/archonto/ARE3 |
Superclass: E35 Title |
Subclasses: - |
Related properties: - P102 has title |
Example values: - Juízo da Índia e Mina - Mosteiro de São Bernardo de Portalegre - Inventário por óbito de Manuel Joaquim Ferreira Lino |
|
ARE4 Extension |
---|
This class allows to indicate the document’s extension in order to distinguish the different uses given to this field. |
URI: https://purl.org/episa/archonto/ARE4 |
Superclass: E54 Dimension |
Subclasses: - |
Related properties: - P43 has dimension - P90 has value - P91 has unit |
Example values: - 13 maços - 21 folhas |
|
ARE5 Identifier Type |
---|
ISAD(G) allows several types of identifiers. ARE5 Identifier Type class allows to specify the identifier type. This class is a subclass of E55 Type and is linked to E42 Identifier through the property P2 has type. |
URI: https://purl.org/episa/archonto/ARE5 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Reference Code - Physical Location - Original Numbering - Previous Location |
|
ARE6 Date Type |
---|
This class allows to indicate a date type, which may be a predominant date, of production, death, birth, etc. |
URI: https://purl.org/episa/archonto/ARE6 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Exact dates - Inferred dates - Predominant dates |
|
ARE7 Name Type |
---|
This class allows to indicate a name type, which can be a nickname, last name or last name. |
URI: https://purl.org/episa/archonto/ARE7 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Authorized form of name - Another Form of the Name - Parallel Name Form |
|
ARE8 Role Type |
---|
This class allows to indicate the role of an individual in a given event. |
URI: https://purl.org/episa/archonto/ARE8 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P14.1 in the role of. |
Example values: - Producer - Material Author - Recipient |
|
ARE9 Date Certainty |
---|
This class allows to indicate which is the certainty relatively to a date. |
URI: https://purl.org/episa/archonto/ARE9 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Certain - Uncertain |
|
ARE11 Documentary Typology |
---|
This class allows to indicate which is the documentary typology of a document. |
URI: https://purl.org/episa/archonto/ARE11 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Certificate - Income book - Patent |
|
ARE12 Organisation |
---|
This class allows to indicate that some activity’s actor is an organisation. |
URI: https://purl.org/episa/archonto/ARE12 |
Superclass: E39 Actor |
Subclasses: - |
Related properties: - P107 has current or former member (is current or former member of) |
Example values: - |
|
ARE13 Subject Type |
---|
This class allows to indicate which is the subject of a document. |
URI: https://purl.org/episa/archonto/ARE13 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Education - Science - Law - Management |
|
ARE14 Place Type |
---|
This class allows to indicate which type of place is described, such as a country, municipality or parish. |
URI: https://purl.org/episa/archonto/ARE14 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - |
Example values: - Ocean - Archipelago - Mountain range - Country - District |
|
ARE15 Acquisition Type |
---|
This class allows to indicate the type of acquisition of a document, such as purchase, transfer, donation, etc. |
URI: https://purl.org/episa/archonto/ARE15 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Purchase - Giving - Donation - Deposit - Swap - Legacy - Reintegration - Transfer |
|
ARE16 Event Type |
---|
This class allows to indicate which type of event is present in a document. |
URI: https://purl.org/episa/archonto/ARE16 |
Superclass: E55 Type |
Subclasses: - |
Related properties: - P2 has type |
Example values: - Evaluation - Expertise - Financial management |
|
ARP8 upper level |
---|
This property relates a level of description with the upper levels where it may be aggregated (aggregate). |
URI: https://purl.org/episa/archonto/ARP8 |
Domain: ARE1 Level of Description |
Range: ARE1 Level of Description |
Inverse: ARP9 lower level |
Example values: - The level of description Item (ARE1) has upper level Fonds (ARE1). |
|
ARP9 lower level |
---|
This property relates a level of description with the lower levels where it may be aggregated (aggregate). |
URI: https://purl.org/episa/archonto/ARP9 |
Domain: ARE1 Level of Description |
Range: ARE1 Level of Description |
Inverse: ARP8 upper level |
Example values: - The level of description Fonds (ARE1) has lower level Item (ARE1). |
|
ARP12 has level of description |
---|
This property relates a given document to its level of description (E22 Human-Made Object). |
URI: https://purl.org/episa/archonto/ARP12 |
Domain: E31 Document |
Range: ARE1 Level of Description |
Inverse: - |
Example values: - The document (E31) has level of description Fonds (ARE1). (fictitious) |
|