Mining the Past – Data-Intensive Knowledge Discovery in the Study of Historical Textual Traditions

Text-heavy and unstructured data constitute the primary source materials for many historical reconstructions. In history and the history of religion, text analysis has typically been conducted by systematically selecting a small sample of texts and subjecting it to highly detailed reading and mental...

Full description

Saved in:  
Bibliographic Details
Published in:Journal of Cognitive Historiography
Authors: Nielbo, Kristoffer Laigaard (Author) ; Slingerland, Edward G. 1968- (Author) ; Nichols, Ryan (Author)
Format: Electronic Article
Language:English
Check availability: HBZ Gateway
Journals Online & Print:
Drawer...
Fernleihe:Fernleihe für die Fachinformationsdienste
Published: Equinox Publ. [2016]
In: Journal of Cognitive Historiography
Further subjects:B HISTORICAL research
B Methodology
B quantitative text analysis
B text mining
Online Access: Volltext (Verlag)
Volltext (doi)
Description
Summary:Text-heavy and unstructured data constitute the primary source materials for many historical reconstructions. In history and the history of religion, text analysis has typically been conducted by systematically selecting a small sample of texts and subjecting it to highly detailed reading and mental synthesis. But two interrelated technological developments have rendered a new data-intensive paradigm—one that can usefully supplement qualitative analysis—possible in the study of historical textual traditions. First, the availability of significant computing power has made it possible to run algorithms for automated text analysis on most personal computers. Second, the rapid increase in full text digital databases relevant to the study of religion has considerably reduced costs related to data acquisition and digitization. However, a limited understanding of the scope, advantages, and limitations of data-intensive methods, combined with an overly enthusiastic praise of big data by policy-makers and data scientists, have created real obstacles to the implementation of this paradigm in historical research. This is unfortunate, because history offers a rich and uncharted field for data-intensive knowledge discovery, and historians already have the much sought after and necessary domain expertise. In this article we seek to remove obstacles to the data intensive paradigm by presenting its methods and models for handling text-heavy data.
ISSN:2051-9680
Contains:Enthalten in: Journal of Cognitive Historiography
Persistent identifiers:DOI: 10.1558/jch.31662