Coreference-resolution

  • Published on
    This paper presents a cross-document coreference resolution approach tailored for low-resource languages, specifically focusing on Thai. The authors adapt an existing English model that uses agglomerative clustering to identify and group coreferent entities across documents. The study also compares manual and automatic span detection methods, finding that a fine-tuned longformer model provides the best performance, achieving a CoNLL F1 score of 72.87. This research offers a framework that could be extended to other low-resource languages.