From Networks to Named Entities and Back Again: Exploring Classical Arabic Isnad Networks

This paper explores new methods for disambiguating the identity of individuals in classical Arabic citations (isnāds) using a network-based approach. After training a model to extract name mentions from classical Arabic, we embed these mentions in vector space using fine-tuned BERT representations a...

Ausführliche Beschreibung

Gespeichert in:  
Bibliographische Detailangaben
Nebentitel:Mit arabischen Schriftzeichen im Text
VerfasserInnen: Muther, Ryan (VerfasserIn) ; Smith, David (VerfasserIn) ; Savant, Sarah Bowen (VerfasserIn)
Medienart: Elektronisch Aufsatz
Sprache:Englisch
Verfügbarkeit prüfen: HBZ Gateway
Journals Online & Print:
Lade...
Fernleihe:Fernleihe für die Fachinformationsdienste
Veröffentlicht: Université du Luxembourg 2023
In: Journal of historical network research
Jahr: 2023, Band: 8, Seiten: 1-20
weitere Schlagwörter:B Hadith
B name disambiguation
B Natural Language Processing
B Network Analysis
Online Zugang: Volltext (kostenfrei)
Volltext (kostenfrei)

MARC

LEADER 00000naa a22000002 4500
001 1886026033
003 DE-627
005 20240417071902.0
007 cr uuu---uuuuu
008 240417s2023 xx |||||o 00| ||eng c
024 7 |a 10.25517/jhnr.v8i1.135  |2 doi 
035 |a (DE-627)1886026033 
035 |a (DE-599)KXP1886026033 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 0  |2 ssgn 
100 1 |a Muther, Ryan  |e VerfasserIn  |4 aut 
245 1 0 |a From Networks to Named Entities and Back Again  |b Exploring Classical Arabic Isnad Networks 
246 1 |i Abweichender Titel  |a Mit arabischen Schriftzeichen im Text 
264 1 |c 2023 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
520 |a This paper explores new methods for disambiguating the identity of individuals in classical Arabic citations (isnāds) using a network-based approach. After training a model to extract name mentions from classical Arabic, we embed these mentions in vector space using fine-tuned BERT representations and use community detection to infer clusters of coreferent mentions. The best-performing clustering approach reduces error on the CoNLL metric by 30%. Then, as a case study, we examine the problem of determining the number of direct transmitters to Ibn ʿAsākir (d. 1176) in a set of isnāds taken from the 12th century historical text Taʾrīkh Madīnat Dimashq (TMD, History of Damascus), using our method to replicate human judgement. 
601 |a Network 
650 4 |a Hadith 
650 4 |a name disambiguation 
650 4 |a Natural Language Processing 
650 4 |a Network Analysis 
700 1 |a Smith, David  |e VerfasserIn  |4 aut 
700 1 |e VerfasserIn  |0 (DE-588)1044734957  |0 (DE-627)772631107  |0 (DE-576)398088535  |4 aut  |a Savant, Sarah Bowen 
773 0 8 |i Enthalten in  |t Journal of historical network research  |d Luxembourg : Université du Luxembourg, 2017  |g 8(2023), Seite 1-20  |h Online-Ressource  |w (DE-627)1000904911  |w (DE-600)2908863-X  |w (DE-576)494553545  |x 2535-8863  |7 nnns 
773 1 8 |g volume:8  |g year:2023  |g pages:1-20 
856 4 0 |u http://jhnr.uni.lu/index.php/jhnr/article/view/135  |x Verlag  |z kostenfrei  |3 Volltext 
856 4 0 |u https://doi.org/10.25517/jhnr.v8i1.135  |x Resolving-System  |z kostenfrei  |3 Volltext 
951 |a AR 
ELC |a 1 
LOK |0 000 xxxxxcx a22 zn 4500 
LOK |0 001 4512816984 
LOK |0 003 DE-627 
LOK |0 004 1886026033 
LOK |0 005 20240417071902 
LOK |0 008 240417||||||||||||||||ger||||||| 
LOK |0 040   |a DE-Tue135  |c DE-627  |d DE-Tue135 
LOK |0 092   |o n 
LOK |0 852   |a DE-Tue135 
LOK |0 852 1  |9 00 
LOK |0 935   |a ixzs  |a ixzo 
OAS |a 1 
ORI |a TA-MARC-ixtheoa001.raw 
REL |a 1 
SUB |a REL