Database Schema Documentation

ConditionVocab

Standardized set of terms describing legitimacy status, marital status, or social condition.
Values are harmonized through controlled vocabularies and mappings to ensure consistency for probabilistic record linkage (PRL).

Property Expected Type Description
label Text Standardized label representing the condition (e.g., “legitimate”, “mestizo”)
type Text Category of the condition: legitimacy, marriage_status, or social_condition

Event

A sacramental event — baptism, marriage, or burial — as recorded in the dataset.

Property Expected Type Description
event_type Text Type of sacramental event (baptism, marriage, burial)
event_date Date Date of the event (recorded or approximate) in ISO 8601 format
event_place Integer FK to the Place where the event occurred
record_id Integer FK to the Record that contains this event

OriginalTerms

Original recorded form of a label before normalization.
These are stored exactly as they appear in the source, without edits, for traceability.

Property Expected Type Description
label_type Text Entity type the label refers to (person, place, event)
label_value Text Original unnormalized textual form
label_language Text Language code in ISO 639 format

Persona

A persona is “each mention instance of a person in a document” (Embley, 2021).
This table stores individual mentions before any PRL aggregation.

Property Expected Type Description
name Text Normalized first and middle name(s)
lastname Text Normalized or inferred surname(s)
birth_date Date Recorded or inferred date of birth in ISO 8601 format
birth_place Integer FK to Place of birth
death_date Date Recorded or inferred date of death in ISO 8601 format
death_place Integer FK to Place of death
notes Text Free-text notes from transcription or data processing

PersonaCondition

Links a persona to a standardized condition (e.g., “legitimate”, “mestizo”) as recorded in a specific event.

Property Expected Type Description
person_id Integer FK to the Persona holding the condition
condition_vocab_id Integer FK to the standardized term in ConditionVocab
event_id Integer FK to the Event where the condition was recorded

PersonaRelationship

Represents an oriented relationship between two personas.
Orientation is from subject (person holding the role) to object (person receiving the role).

Property Expected Type Description
person_subject_id Integer FK to the Persona who is the subject of the relationship (e.g., father, mother, godparent)
person_object_id Integer FK to the Persona who is the object of the relationship (e.g., child, godchild)
relationship_type Text Relationship label oriented from subject to object (e.g., “is father of”)
event_id Integer FK to the Event that recorded this relationship

PersonaRoleInEvent

Relational table linking a persona to their specific role in an event.

Property Expected Type Description
person_id Integer FK to the Persona involved in the event
event_id Integer FK to the Event in which the persona participated
role Text Role label in the event context (e.g., “witness”, “baptized”)

Place

List of standardized places mentioned in the dataset, validated against controlled vocabularies and gazetteers.

Property Expected Type Description
place_label Text Standardized contemporary name of the place
language Text Language code in ISO 639 format
latitude Double Latitude coordinate
longitude Double Longitude coordinate
source Text Source vocabulary or gazetteer
uri Text URI to standardized place metadata
country_code Text ISO 3166 code of the country
part_of Text Administrative unit containing this place
part_of_uri Text URI to standardized metadata of the containing administrative unit
mentioned_as Integer FK to OriginalTerms entry matching the non-standardized name

Record

Identifies the original record in the raw dataset.

Property Expected Type Description
record_type Text Type of record (baptism, marriage, burial)
record_identifier Text Identifier matching the source dataset
record_file Text File name or path as recorded in the source dataset