Show code
library(tidyverse)library(tidyverse)This metadata dictionary provides comprehensive documentation for all datasets in the data/clean/ folder. Each section includes the dataset structure, a description of the table’s purpose, and detailed field-level documentation with expected data types and descriptions.
The datasets represent historical parish records from Sondondo, Peru, covering vital events (baptisms, marriages, and burials) as well as derived entities (places and personas). All records have been cleaned, normalized, and harmonized to facilitate analysis and integration.
Event tables document vital religious ceremonies recorded in parish registers: baptisms, marriages, and burials. These records form the primary source material for the project, capturing not only the principal individuals involved but also family relationships, social conditions, and geographic information. Each event type has its own structure reflecting the specific information recorded for that ceremony.
baptisms <- readr::read_csv("../../data/clean/bautismos_clean.csv", show_col_types = FALSE)
baptisms %>% glimpse()Rows: 6,340
Columns: 27
$ file <chr> "APAucará LB L001", "APAucará LB L001", …
$ identifier <chr> "B001", "B002", "B003", "B004", "B005", …
$ event_type <chr> "Bautizo", "Bautizo", "Bautizo", "Bautiz…
$ event_date <date> 1790-10-04, 1790-10-06, 1790-10-07, 179…
$ baptized_name <chr> "domingo", "dominga", "bartola", "franci…
$ baptized_birth_place <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ baptized_birth_date <date> 1790-08-04, 1790-08-04, 1790-08-04, 179…
$ baptized_legitimacy_status <chr> "Hijo legitimo", "Hija legitima", "Hija …
$ father_name <chr> "lucas", "juan", "jacinto", "juan", "san…
$ father_lastname <chr> "ayquipa", "lulia", "quispe", "cuebas", …
$ father_social_condition <chr> NA, NA, NA, "Mestizo", NA, NA, NA, NA, N…
$ mother_name <chr> "sevastiana", "jospha", "juliana", "clem…
$ mother_lastname <chr> "quispe", "gomes", "chinchay", "manco", …
$ mother_social_condition <chr> NA, NA, NA, "India libre", NA, NA, "solt…
$ parents_social_condition <chr> "Indios tributarios de Pampamarca", "Ind…
$ godfather_name <chr> "vicente", "ignacio", NA, NA, NA, NA, NA…
$ godfather_lastname <chr> "guamani", "varientos", NA, NA, NA, NA, …
$ godfather_social_condition <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ godmother_name <chr> NA, NA, "rotonda", "ysabel", "josefa", "…
$ godmother_lastname <chr> NA, NA, "pocco", "guillen", "santiago", …
$ godmother_social_condition <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ event_place <chr> "Pampamarca", "Pampamarca", "Pampamarca"…
$ event_geographic_descriptor_1 <chr> "Aucara", "Aucara", "Aucara", "Aucara", …
$ event_geographic_descriptor_2 <chr> "Pampamarca", "Pampamarca", "Pampamarca"…
$ event_geographic_descriptor_3 <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ event_geographic_descriptor_4 <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ baptized_lastname <chr> "ayquipa", "lulia", "quispe", "cuebas", …
| Property | Expected Type | Description |
|---|---|---|
| file | Text | Source file name from which the record was extracted |
| identifier | Text | Secuential identifier for the baptism event |
| event_type | Text | Type of event (Bautizo) |
| event_date | Date | Date of the baptism event in ISO 8601 format |
| baptized_name | Text | Normalized first and middle name(s) of the baptized individual |
| baptized_lastname | Text | Normalized or inferred surname(s) of the baptized individual |
| baptized_birth_place | Text | Place of birth of the baptized individual |
| baptized_birth_date | Date | Date of birth of the baptized individual in ISO 8601 format |
| baptized_legitimacy_status | Text | Legitimacy status at birth (legitimo, ilegitimo) |
| father_name | Text | Normalized name of the father |
| father_lastname | Text | Normalized or inferred surname(s) of the father |
| father_social_condition | Text | Social, ethnical, or political marker of the father (mestizo, indio, tributario, vecino) |
| mother_name | Text | Normalized name of the mother |
| mother_lastname | Text | Normalized or inferred surname(s) of the mother |
| mother_social_condition | Text | Social, ethnical, or political marker of the mother (mestizo, indio, tributario, vecino) |
| parents_social_condition | Text | Combined social condition of both parents |
| godfather_name | Text | Normalized name of the godfather |
| godfather_lastname | Text | Normalized or inferred surname(s) of the godfather |
| godfather_social_condition | Text | Social, ethnical, or political marker of the godfather (mestizo, indio, tributario, vecino) |
| godmother_name | Text | Normalized name of the godmother |
| godmother_lastname | Text | Normalized or inferred surname(s) of the godmother |
| godmother_social_condition | Text | Social, ethnical, or political marker of the godmother (mestizo, indio, tributario, vecino) |
| event_place | Text | Place where the baptism event took place |
| event_geographic_descriptor_1 | Text | Place or location mentioned in the record |
| event_geographic_descriptor_2 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_3 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_4 | Text | Additional place or location mentioned in the record |
marriages <- readr::read_csv("../../data/clean/matrimonios_clean.csv", show_col_types = FALSE)
marriages %>% glimpse()Rows: 1,719
Columns: 55
$ file <chr> "APAucará LM L001", "APAucará LM L001"…
$ identifier <chr> "M001", "M002", "M003", "M004", "M005"…
$ event_type <chr> "Matrimonio", "Matrimonio", "Matrimoni…
$ event_date <date> 1816-12-06, 1816-12-12, 1817-03-05, 1…
$ husband_name <chr> "josé manl manuel", "esteban", "alexan…
$ husband_lastname <chr> "de la roca", "castillo", "ramires", "…
$ husband_social_condition <chr> "don, vecinos de este pueblo [Aucara]"…
$ husband_marital_status <chr> "soltero", "soltero", "soltero", "solt…
$ husband_birth_date <date> NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
$ husband_birth_place <chr> "Ciudad de Huamanga", NA, "Pampamarca"…
$ husband_resident_in <chr> "Aucara", NA, NA, NA, NA, NA, NA, NA, …
$ husband_legitimacy_status <chr> NA, "legítimo", "legítimo", "legítimo"…
$ husband_father_name <chr> "acencio", "matheo", "leonor", "acenci…
$ husband_father_lastname <chr> "roca", "castillo", "romani", "cuchu",…
$ husband_father_social_condition <chr> "don, vecinos de este pueblo [Aucara]"…
$ husband_mother_name <chr> "leonor", "ma maria", "franca francisc…
$ husband_mother_lastname <chr> "guerrero", "torres", "paucar", "antay…
$ husband_mother_social_condition <chr> "doña, vecinos de este pueblo [Aucara]…
$ wife_name <chr> "juana", "ambrocia", "sipriana", "caci…
$ wife_lastname <chr> "rodrigues", "tasqui", "coillo", "flor…
$ wife_social_condition <chr> "doña, vecinos de este pueblo [Aucara]…
$ wife_marital_status <chr> "soltera", "soltera", "soltera", "solt…
$ wife_birth_date <date> NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
$ wife_birth_place <chr> NA, NA, "Pampamarca", "Pampamarca", NA…
$ wife_resident_in <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ wife_legitimacy_status <chr> "legítima", "legítima", "legítima", NA…
$ wife_father_name <chr> "pedro", "pedro", "cristobal", NA, "ag…
$ wife_father_lastname <chr> "rodrigues", "tasqui", "coillo", NA, "…
$ wife_father_social_condition <chr> "don, vecinos de este pueblo [Aucara]"…
$ wife_mother_name <chr> "magdalena", "ma maria", "ma maria", "…
$ wife_mother_lastname <chr> "sotelo", "palomino", "guallpatuiru", …
$ wife_mother_social_condition <chr> "doña, vecinos de este pueblo [Aucara]…
$ godparent_1_name <chr> "ygnacio", "apolin apolinario", "pablo…
$ godparent_1_lastname <chr> "baroti", "condori", "roque", "llamuca…
$ godparent_1_social_condition <chr> "don", NA, NA, NA, NA, NA, NA, NA, NA,…
$ godparent_2_name <chr> "magda magdalena", "petrona", "ma mari…
$ godparent_2_lastname <chr> "sotelo", "ventura", "puma", "guamani"…
$ godparent_2_social_condition <chr> "doña", NA, NA, NA, NA, NA, NA, "doña"…
$ godparent_3_name <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ godparent_3_lastname <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ witness_1_name <chr> "agustin", "pedro", "marcelo", "pablo"…
$ witness_1_lastname <chr> "castro", "manco", "llamuca", "roque",…
$ witness_2_name <chr> "mariano", "carlos", "julian", "antoni…
$ witness_2_lastname <chr> "castro", "canto", "urbano", "urbano",…
$ witness_3_name <chr> "juan", "pedro", "antonio", "cristobal…
$ witness_3_lastname <chr> "baldes", "guamani", "urbano", "coillo…
$ witness_4_name <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ witness_4_lastname <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ event_place <chr> "Aucara", "Aucara", "Aucara", "Pampama…
$ event_geographic_descriptor_1 <chr> "Aucara", "Aucara", "Aucara", "Aucara"…
$ event_geographic_descriptor_2 <chr> "Huamanga", "Colca", "Pampamarca", "Pa…
$ event_geographic_descriptor_3 <chr> "Coracora", NA, NA, NA, NA, NA, NA, NA…
$ event_geographic_descriptor_4 <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ event_geographic_descriptor_5 <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ event_geographic_descriptor_6 <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
The marriages table contains cleaned and standardized records of marriage events extracted from parish registers. Each row represents a unique marriage event with associated attributes for both spouses, their families, witnesses, and godparents.
| Property | Expected Type | Description |
|---|---|---|
| file | Text | Source file name from which the record was extracted |
| identifier | Text | Sequential identifier for the marriage event |
| event_type | Text | Type of event (Matrimonio) |
| event_date | Date | Date of the marriage event in ISO 8601 format |
| husband_name | Text | Normalized first and middle name(s) of the husband |
| husband_lastname | Text | Normalized or inferred surname(s) of the husband |
| husband_social_condition | Text | Social, ethnical, or political marker of the husband (mestizo, indio, tributario, vecino, don) |
| husband_marital_status | Text | Marital status of the husband at the time of marriage (soltero, viudo) |
| husband_birth_date | Date | Date of birth of the husband in ISO 8601 format |
| husband_birth_place | Text | Place of birth of the husband |
| husband_resident_in | Text | Recorded place of residence of the husband at the time of the event |
| husband_legitimacy_status | Text | Legitimacy status of the husband at birth (legítimo, ilegitimo, natural) |
| husband_father_name | Text | Normalized name of the husband’s father |
| husband_father_lastname | Text | Normalized or inferred surname(s) of the husband’s father |
| husband_father_social_condition | Text | Social, ethnical, or political marker of the husband’s father (mestizo, indio, tributario, vecino, don) |
| husband_mother_name | Text | Normalized name of the husband’s mother |
| husband_mother_lastname | Text | Normalized or inferred surname(s) of the husband’s mother |
| husband_mother_social_condition | Text | Social, ethnical, or political marker of the husband’s mother (mestizo, indio, tributario, vecino, doña) |
| wife_name | Text | Normalized first and middle name(s) of the wife |
| wife_lastname | Text | Normalized or inferred surname(s) of the wife |
| wife_social_condition | Text | Social, ethnical, or political marker of the wife (mestizo, indio, tributario, vecino, doña) |
| wife_marital_status | Text | Marital status of the wife at the time of marriage (soltera, viuda) |
| wife_birth_date | Date | Date of birth of the wife in ISO 8601 format |
| wife_birth_place | Text | Place of birth of the wife |
| wife_resident_in | Text | Recorded place of residence of the wife at the time of the event |
| wife_legitimacy_status | Text | Legitimacy status of the wife at birth (legítima, ilegitima, natural) |
| wife_father_name | Text | Normalized name of the wife’s father |
| wife_father_lastname | Text | Normalized or inferred surname(s) of the wife’s father |
| wife_father_social_condition | Text | Social, ethnical, or political marker of the wife’s father (mestizo, indio, tributario, vecino, don) |
| wife_mother_name | Text | Normalized name of the wife’s mother |
| wife_mother_lastname | Text | Normalized or inferred surname(s) of the wife’s mother |
| wife_mother_social_condition | Text | Social, ethnical, or political marker of the wife’s mother (mestizo, indio, tributario, vecino, doña) |
| godparent_1_name | Text | Normalized name of the first godparent |
| godparent_1_lastname | Text | Normalized or inferred surname(s) of the first godparent |
| godparent_1_social_condition | Text | Social, ethnical, or political marker of the first godparent (mestizo, indio, tributario, vecino, don, doña) |
| godparent_2_name | Text | Normalized name of the second godparent |
| godparent_2_lastname | Text | Normalized or inferred surname(s) of the second godparent |
| godparent_2_social_condition | Text | Social, ethnical, or political marker of the second godparent (mestizo, indio, tributario, vecino, don, doña) |
| godparent_3_name | Text | Normalized name of the third godparent (when applicable) |
| godparent_3_lastname | Text | Normalized or inferred surname(s) of the third godparent (when applicable) |
| witness_1_name | Text | Normalized name of the first witness |
| witness_1_lastname | Text | Normalized or inferred surname(s) of the first witness |
| witness_2_name | Text | Normalized name of the second witness |
| witness_2_lastname | Text | Normalized or inferred surname(s) of the second witness |
| witness_3_name | Text | Normalized name of the third witness (when applicable) |
| witness_3_lastname | Text | Normalized or inferred surname(s) of the third witness (when applicable) |
| witness_4_name | Text | Normalized name of the fourth witness (when applicable) |
| witness_4_lastname | Text | Normalized or inferred surname(s) of the fourth witness (when applicable) |
| event_place | Text | Place where the marriage event took place |
| event_geographic_descriptor_1 | Text | Place or location mentioned in the record |
| event_geographic_descriptor_2 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_3 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_4 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_5 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_6 | Text | Additional place or location mentioned in the record |
burials <- readr::read_csv("../../data/clean/entierros_clean.csv", show_col_types = FALSE)
burials %>% glimpse()Rows: 2,192
Columns: 26
$ file <chr> "APAucará LD L001", "APAucará LD L001", …
$ identifier <chr> "E001", "E002", "E003", "E004", "E005", …
$ event_type <chr> "Entierro", "Entierro", "Entierro", "Ent…
$ event_date <date> 1846-10-06, 1846-10-07, 1846-11-02, 184…
$ doctrine <chr> "Parroquia de Aucará", "Parroquia de Auc…
$ event_place <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ deceased_name <chr> "julian", "joce", "martina", "dorotea", …
$ deceased_lastname <chr> "xavies", "raime", "condori", "ccoyllo",…
$ deceased_birth_date <date> NA, 1821-10-13, 1766-11-21, 1806-12-18,…
$ deceased_birth_place <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ deceased_social_condition <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ deceased_marital_status <chr> "\"marido que fue\"", "\"marido que fue\…
$ deceased_legitimacy_status <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ father_name <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, "jua…
$ father_lastname <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, "her…
$ mother_name <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, "cle…
$ mother_lastname <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, "man…
$ husband_name <chr> NA, NA, "luciano", "josé", "mariano", "g…
$ wife_name <chr> "mercedes", "francisca", NA, NA, NA, NA,…
$ burial_place <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ event_geographic_descriptor_1 <chr> "Aucará", "Aucará", "Aucará", "Aucará", …
$ event_geographic_descriptor_2 <chr> "Lucanas", "Lucanas", "Lucanas", "Lucana…
$ event_geographic_descriptor_3 <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ event_geographic_descriptor_4 <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ husband_lastname <chr> NA, NA, "ccoyllo", "espinosa", "huallpat…
$ wife_lastname <chr> "lupa", "cucho", NA, NA, NA, NA, "javier…
The burials table contains cleaned and standardized records of burial events extracted from parish registers. Each row represents a unique burial event with associated attributes such as date, location, and information about the deceased individual and their surviving family members.
| Property | Expected Type | Description |
|---|---|---|
| file | Text | Source file name from which the record was extracted |
| identifier | Text | Sequential identifier for the burial event |
| event_type | Text | Type of event (Entierro) |
| event_date | Date | Date of the burial event in ISO 8601 format |
| doctrine | Text | Name of the parish or doctrine where the burial was registered |
| event_place | Text | Place where the burial event took place |
| deceased_name | Text | Normalized first and middle name(s) of the deceased individual |
| deceased_lastname | Text | Normalized or inferred surname(s) of the deceased individual |
| deceased_birth_date | Date | Date of birth of the deceased individual in ISO 8601 format |
| deceased_birth_place | Text | Place of birth of the deceased individual |
| deceased_social_condition | Text | Social, ethnical, or political marker of the deceased (mestizo, indio, tributario, vecino, don, doña) |
| deceased_marital_status | Text | Marital status of the deceased at the time of death (soltero/soltera, casado/casada, viudo/viuda, marido que fue, mujer que fue) |
| deceased_legitimacy_status | Text | Legitimacy status of the deceased at birth (legítimo, ilegitimo, natural) |
| father_name | Text | Normalized name of the deceased’s father |
| father_lastname | Text | Normalized or inferred surname(s) of the deceased’s father |
| mother_name | Text | Normalized name of the deceased’s mother |
| mother_lastname | Text | Normalized or inferred surname(s) of the deceased’s mother |
| husband_name | Text | Normalized name of the deceased’s husband (when deceased was married) |
| wife_name | Text | Normalized name of the deceased’s wife (when deceased was married) |
| burial_place | Text | Specific location where the deceased was buried |
| event_geographic_descriptor_1 | Text | Place or location mentioned in the record |
| event_geographic_descriptor_2 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_3 | Text | Additional place or location mentioned in the record |
| event_geographic_descriptor_4 | Text | Additional place or location mentioned in the record |
| husband_lastname | Text | Normalized or inferred surname(s) of the deceased’s husband |
| wife_lastname | Text | Normalized or inferred surname(s) of the deceased’s wife |
The places table represents a controlled vocabulary of geographic locations extracted from all event records and normalized through a combination of manual curation and automated gazetteer matching. Each place has been geocoded and linked to external authorities (GeoNames, Getty Thesaurus of Geographic Names) when possible, enabling spatial analysis and visualization of the historical records.
places <- readr::read_csv("../../data/clean/unique_places.csv", show_col_types = FALSE)
places %>% glimpse()Rows: 74
Columns: 16
$ place_id <dbl> 1, 4, 6, 8, 12, 13, 14, 15, 16, 17, 18, 19, …
$ manually_normalized_place <chr> "Acobamba", "Andamarca", "Apongo", "Aucará",…
$ standardize_label <chr> "Acobamba", "Andamarca", "Apongo", "Aucará",…
$ language <chr> "es", "es", "es", "es", "es", "es", "es", "e…
$ latitude <dbl> -12.07757, -15.63833, -14.01327, -14.25000, …
$ longitude <dbl> -74.87127, -70.58848, -73.93247, -74.08333, …
$ source <chr> "GeoNames", "GeoNames", "GeoNames", "GeoName…
$ id <dbl> 8663907, 3947725, 3947431, 3947087, 3939003,…
$ uri <chr> "http://sws.geonames.org/8663907/", "http://…
$ country_code <chr> "PE", "PE", "PE", "PE", "PE", "PE", "PE", "P…
$ part_of <lgl> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ part_of_uri <lgl> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, …
$ confidence <dbl> 100, 100, 100, 100, 100, 100, 100, 100, 100,…
$ threshold <dbl> 90, 90, 90, 90, 90, 90, 90, 90, 90, 90, 90, …
$ match_type <chr> "exact", "exact", "exact", "exact", "exact",…
$ mentioned_as <chr> "['Acobamba']", "['Andamarca']", "['Apongo']…
The places table contains unique geographic locations mentioned in the parish records, along with their standardized names and geographic coordinates.
| Property | Expected Type | Description |
|---|---|---|
| place_id | Numeric | Unique identifier for the place |
| manually_normalized_name | Text | |
| standardized_name | Text | Name standardized using external gazetteers |
| language | Text | Language of the place name (e.g., es, en) |
| latitude | Numeric | Latitude coordinate of the place |
| longitude | Numeric | Longitude coordinate of the place |
| source_gazetteer | Text | Source gazetteer used for standardization (e.g., geonames, tgn) |
| id | Text | Identifier of the place in the source gazetteer |
| uri | Text | URI linking to the place in the source gazetteer |
| country_code | Text | ISO country code of the place |
| part_of | Text | Higher-level administrative division the place belongs to |
| part_of_uri | Text | URI of the higher-level administrative division |
| confidence | Numeric | Confidence score of the place standardization (0-100) |
| treshold | Numeric | Treshold used for the place standardization |
| match_type | Text | Type of match made during standardization (e.g., exact, fuzzy) |
| mentioned_as | Text | Original text mention of the place in the records |
Personas represent individual person mentions extracted from all event records and restructured into a person-centric format. Unlike the event tables which are organized around ceremonies, this table focuses on individuals and their attributes as documented across multiple events. Each row is a unique person mention with inferred demographic information, though multiple mentions may refer to the same historical individual. Future work will link these mentions through probabilistic record linkage to reconstruct life histories.
personas <- readr::read_csv("../../data/clean/personas.csv", show_col_types = FALSE)
personas %>% glimpse()Rows: 47,072
Columns: 15
$ event_idno <chr> "bautizo-1", "bautizo-1", "bautizo-1", "bautizo-1"…
$ original_identifier <chr> "APAucará-LB-L001_B001", "APAucará-LB-L001_B001", …
$ persona_type <chr> "baptized", "father", "mother", "godfather", "bapt…
$ name <chr> "domingo", "lucas", "sevastiana", "vicente", "domi…
$ birth_place <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ birth_date <date> 1790-08-04, NA, NA, NA, 1790-08-04, NA, NA, NA, 1…
$ legitimacy_status <chr> "legitimo", NA, NA, NA, "legitimo", NA, NA, NA, "l…
$ lastname <chr> "ayquipa", "ayquipa", "quispe", "guamani", "lulia"…
$ persona_idno <chr> "persona-1", "persona-2", "persona-3", "persona-4"…
$ social_condition <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ marital_status <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ resident_in <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ death_place <chr> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
$ death_date <date> NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA, N…
$ gender <chr> "male", "male", "female", "male", "female", "male"…
Personas represent individual mentions of people in the historical records before any aggregation through probabilistic record linkage (PRL). Each row corresponds to a unique mention with associated attributes such as name, birth and death details, and places.
| Property | Expected Type | Description |
|---|---|---|
| event_idno | Text | Unique identifier for the event mention |
| original_identifier | Text | Original identifier from the source document |
| name | Text | Normalized first and middle name(s) |
| lastname | Text | Normalized or inferred surname(s) |
| persona_type | Text | Type of persona (e.g., baptized, parent, godparent) |
| birth_date | Date | Recorded or inferred date of birth in ISO 8601 format |
| birth_place | Text | Place of birth |
| death_date | Date | Recorded or inferred date of death in ISO 8601 format |
| death_place | Text | Place of death |
| gender | Text | Inferred gender (male, female, unknown) |
| resident_in | Text | Recorded place of residence at the time of the event |
| legitimacy_status | Text | Legitimacy status at birth (legitimo, ilegitimo) |
| marital_status | Text | Marital status at the time of the event (soltero, casado) |
| social_condition | Text | Social, ethnical, or political marker (mestizo, indio, tributario, vecino) |