Toiling with the Pāli Canon
The paper describes the preparation of a Buddhist corpus in the Middle Indo-Aryan language Pāli, which is available only in a flat TEI format, for content-based analysis. This task includes transforming the file into a hierarchical TEI P5 representation, followed by tokenisation (including sandhi re...
Authors: | ; ; ; ; ; |
---|---|
Format: | Electronic Article |
Language: | English |
Check availability: | HBZ Gateway |
Fernleihe: | Fernleihe für die Fachinformationsdienste |
Published: |
Institute of Computer Science, Polish Academy of Sciences
2015
|
In: |
Proceedings of the Workshop on Corpus-Based Research in the Humanities (CRH)
Year: 2015, Pages: 39-48 |
Online Access: |
Volltext (kostenfrei) Volltext (kostenfrei) |
MARC
LEADER | 00000caa a22000002 4500 | ||
---|---|---|---|
001 | 1700655671 | ||
003 | DE-627 | ||
005 | 20210112153915.0 | ||
007 | cr uuu---uuuuu | ||
008 | 200616s2015 xx |||||o 00| ||eng c | ||
020 | |a 9788363159191 | ||
024 | 7 | |a 10.15496/publikation-52722 |2 doi | |
024 | 7 | |a 10900/111346 |2 hdl | |
035 | |a (DE-627)1700655671 | ||
035 | |a (DE-599)KXP1700655671 | ||
040 | |a DE-627 |b ger |c DE-627 |e rda | ||
041 | |a eng | ||
084 | |a 0 |2 ssgn | ||
100 | 1 | |e VerfasserIn |0 (DE-588)142107360 |0 (DE-627)704145839 |0 (DE-576)327927631 |4 aut |a Elwert, Frederik | |
109 | |a Elwert, Frederik | ||
245 | 1 | 0 | |a Toiling with the Pāli Canon |c Frederik Elwert, Sven Sellmer, Sven Wortmann, Manuel Pachurka, Jürgen Knauth, David Alfter |
264 | 1 | |c 2015 | |
336 | |a Text |b txt |2 rdacontent | ||
337 | |a Computermedien |b c |2 rdamedia | ||
338 | |a Online-Ressource |b cr |2 rdacarrier | ||
520 | |a The paper describes the preparation of a Buddhist corpus in the Middle Indo-Aryan language Pāli, which is available only in a flat TEI format, for content-based analysis. This task includes transforming the file into a hierarchical TEI P5 representation, followed by tokenisation (including sandhi resolution), lemmatisation, and POS tagging. | ||
700 | 1 | |e VerfasserIn |0 (DE-588)120492539 |0 (DE-627)080709176 |0 (DE-576)17919240X |4 aut |a Sellmer, Sven |d 1969- | |
700 | 1 | |e VerfasserIn |0 (DE-588)143919032 |0 (DE-627)704643111 |0 (DE-576)339696958 |4 aut |a Wortmann, Sven | |
700 | 1 | |a Pachurka, Manuel |e VerfasserIn |4 aut | |
700 | 1 | |a Knauth, Jürgen |e VerfasserIn |4 aut | |
700 | 1 | |a Alfter, David |e VerfasserIn |4 aut | |
773 | 0 | 8 | |i Enthalten in |a Corpus-based Research in the Humanities (1. : 2015 : Warschau) |t Proceedings of the Workshop on Corpus-Based Research in the Humanities (CRH) |d Warschau : Institute of Computer Science, Polish Academy of Sciences, 2015 |g (2015), Seite 39-48 |h VI, 123 S. |w (DE-627)1700654985 |z 9788363159191 |7 nnnm |
773 | 1 | 8 | |g year:2015 |g pages:39-48 |
856 | 4 | 0 | |u http://dx.doi.org/10.15496/publikation-52722 |x Resolving-System |z kostenfrei |3 Volltext |
856 | 4 | 0 | |u http://hdl.handle.net/10900/111346 |x Resolving-System |z kostenfrei |3 Volltext |
936 | u | w | |j 2015 |h 39-48 |
951 | |a AR | ||
ELC | |a 1 | ||
LOK | |0 000 xxxxxcx a22 zn 4500 | ||
LOK | |0 001 3687474836 | ||
LOK | |0 003 DE-627 | ||
LOK | |0 004 1700655671 | ||
LOK | |0 005 20210111171011 | ||
LOK | |0 008 200616||||||||||||||||ger||||||| | ||
LOK | |0 040 |a DE-Tue135 |c DE-627 |d DE-Tue135 | ||
LOK | |0 092 |o n | ||
LOK | |0 852 |a DE-Tue135 | ||
LOK | |0 852 1 |9 00 | ||
LOK | |0 866 |x Erfassung nach Autorenmeldung | ||
LOK | |0 935 |a ixau |a rwzw | ||
OAS | |a 1 | ||
ORI | |a SA-MARC-ixtheoa001.raw | ||
REL | |a 1 | ||
SUB | |a REL |