Toiling with the Pāli Canon

The paper describes the preparation of a Buddhist corpus in the Middle Indo-Aryan language Pāli, which is available only in a flat TEI format, for content-based analysis. This task includes transforming the file into a hierarchical TEI P5 representation, followed by tokenisation (including sandhi re...

Full description

Saved in:  
Bibliographic Details
Authors: Elwert, Frederik (Author) ; Sellmer, Sven 1969- (Author) ; Wortmann, Sven (Author) ; Pachurka, Manuel (Author) ; Knauth, Jürgen (Author) ; Alfter, David (Author)
Format: Electronic Article
Language:English
Check availability: HBZ Gateway
Fernleihe:Fernleihe für die Fachinformationsdienste
Published: Institute of Computer Science, Polish Academy of Sciences 2015
In: Proceedings of the Workshop on Corpus-Based Research in the Humanities (CRH)
Year: 2015, Pages: 39-48
Online Access: Volltext (kostenfrei)
Volltext (kostenfrei)

MARC

LEADER 00000caa a22000002 4500
001 1700655671
003 DE-627
005 20210112153915.0
007 cr uuu---uuuuu
008 200616s2015 xx |||||o 00| ||eng c
020 |a 9788363159191 
024 7 |a 10.15496/publikation-52722  |2 doi 
024 7 |a 10900/111346  |2 hdl 
035 |a (DE-627)1700655671 
035 |a (DE-599)KXP1700655671 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 0  |2 ssgn 
100 1 |e VerfasserIn  |0 (DE-588)142107360  |0 (DE-627)704145839  |0 (DE-576)327927631  |4 aut  |a Elwert, Frederik 
109 |a Elwert, Frederik 
245 1 0 |a Toiling with the Pāli Canon  |c Frederik Elwert, Sven Sellmer, Sven Wortmann, Manuel Pachurka, Jürgen Knauth, David Alfter 
264 1 |c 2015 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
520 |a The paper describes the preparation of a Buddhist corpus in the Middle Indo-Aryan language Pāli, which is available only in a flat TEI format, for content-based analysis. This task includes transforming the file into a hierarchical TEI P5 representation, followed by tokenisation (including sandhi resolution), lemmatisation, and POS tagging. 
700 1 |e VerfasserIn  |0 (DE-588)120492539  |0 (DE-627)080709176  |0 (DE-576)17919240X  |4 aut  |a Sellmer, Sven  |d 1969- 
700 1 |e VerfasserIn  |0 (DE-588)143919032  |0 (DE-627)704643111  |0 (DE-576)339696958  |4 aut  |a Wortmann, Sven 
700 1 |a Pachurka, Manuel  |e VerfasserIn  |4 aut 
700 1 |a Knauth, Jürgen  |e VerfasserIn  |4 aut 
700 1 |a Alfter, David  |e VerfasserIn  |4 aut 
773 0 8 |i Enthalten in  |a Corpus-based Research in the Humanities (1. : 2015 : Warschau)  |t Proceedings of the Workshop on Corpus-Based Research in the Humanities (CRH)  |d Warschau : Institute of Computer Science, Polish Academy of Sciences, 2015  |g (2015), Seite 39-48  |h VI, 123 S.  |w (DE-627)1700654985  |z 9788363159191  |7 nnnm 
773 1 8 |g year:2015  |g pages:39-48 
856 4 0 |u http://dx.doi.org/10.15496/publikation-52722  |x Resolving-System  |z kostenfrei  |3 Volltext 
856 4 0 |u http://hdl.handle.net/10900/111346  |x Resolving-System  |z kostenfrei  |3 Volltext 
936 u w |j 2015  |h 39-48 
951 |a AR 
ELC |a 1 
LOK |0 000 xxxxxcx a22 zn 4500 
LOK |0 001 3687474836 
LOK |0 003 DE-627 
LOK |0 004 1700655671 
LOK |0 005 20210111171011 
LOK |0 008 200616||||||||||||||||ger||||||| 
LOK |0 040   |a DE-Tue135  |c DE-627  |d DE-Tue135 
LOK |0 092   |o n 
LOK |0 852   |a DE-Tue135 
LOK |0 852 1  |9 00 
LOK |0 866   |x Erfassung nach Autorenmeldung 
LOK |0 935   |a ixau  |a rwzw 
OAS |a 1 
ORI |a SA-MARC-ixtheoa001.raw 
REL |a 1 
SUB |a REL