Integrating Categorical Legacy Data in Spatial Models: A Unique Dataset from Southeast Asia
Author(s): Brian McCray; Jesse Stephen; Christopher Nicholson
Year: 2024
Summary
This is an abstract from the "SAA 2024: Individual Abstracts" session, at the 89th annual meeting of the Society for American Archaeology.
Despite advances made in open access publishing, significant archaeological information remains confined to the grey literature or to unpublished, internal reports in the possession of institutions. For at least twenty years, archaeologists have realized that digital archiving could make this material more accessible at a larger scale, but the tools to access these documents and convert them to be a legible part of a comparative dataset are still developing. This poster presents the results of an archival research project using natural language processing to make 159 legacy reports digitally accessible for incorporation in spatial models using generalized linear modeling and machine learning approaches. These data were gathered from archaeological reports generated by the Defense POW/MIA Accounting Agency (DPAA) and its antecedents, in partnership with the center for Digital Antiquity at Arizona State University. Though challenging to implement, we found that context-specific categorical data, such as aircraft type, could be extracted from reports and used to refine generalized linear and machine learning spatial models. The project shows how creative digital archaeology approaches can work with diverse datasets.
Cite this Record
Integrating Categorical Legacy Data in Spatial Models: A Unique Dataset from Southeast Asia. Brian McCray, Jesse Stephen, Christopher Nicholson. Presented at The 89th Annual Meeting of the Society for American Archaeology. 2024 ( tDAR id: 499889) ; doi:10.48512/XCV8499889
This Resource is Part of the Following Collections
Keywords
General
Digital Archaeology: Simulation and Modeling
•
Historic
Geographic Keywords
Asia: Southeast Asia
Spatial Coverage
min long: 100.986; min lat: 8.581 ; max long: 109.951; max lat: 15.792 ;
Record Identifiers
Abstract Id(s): 40119.0
File Information
Name | Size | Creation Date | Date Uploaded | Access | |
---|---|---|---|---|---|
SAA_2024_copy2.pdf | 5.68mb | Jun 21, 2024 2:10:04 PM | Public |