Editor for this issue: Zackary Leech <zleechlinguistlist.org>
The third International Evaluation of Ancient Chinese Information Processing (sentence segmentation and punctuation for ancient Chinese)
Short Title: EvaHan 2024
Date: 25-May-2024 - 25-May-2024
Location: Torino, Italy
Contact: Bin Li
Contact Email: [email protected]
Meeting URL: https://circse.github.io/LT4HALA/2024/EvaHan
Linguistic Field(s): Computational Linguistics
Subject Language(s): Chinese, Old (och)
English (eng)
Meeting Description:
Registration
If you are interested in participating, please fill out the electronic application form: https://forms.office.com/r/jxDBanU7pd. When filling it out, please make sure your information is correct and your email address is working. After receiving your registration information, we will send you an email to notify you, please pay attention to check it.
Submitting Runs
Each team can submit runs for two tasks. A run should be produced according to the ‘closed modality’. The second run will be produced according to the ‘open modality’. The closed run is compulsory, while the open run is optional.
Once the system has produced the results for the task over the test set, participants have to follow these instructions to complete their submission:
Name the runs with the following filename format: testID_teamName_systemID_modality.txt For example: testa_unicatt_1_closed.txt would be the first run of a team called unicatt using the closed modality for the task using testa.txt document. testb_unicatt_2_open.txt would be the second run of a team called unicatt using the open modality for the task using the blind testb.txt document.
Send the file to the following email address: libin.njnu[AT]gmail.com, using the subject “EvaHan Submission: task - teamName”, where the “task” is either testa or testb.
Each team could submit up to 2 running files for each test file in each modality. Thus, each team could submit up to 8 running files in total.
Writing the Technical Report
Technical reports will be included in the proceedings of the Workshop on Language. Technologies for Historical and Ancient Languages 2024 (LT4HALA 2024) as short papers and published alongside the LREC-COLING proceedings.
All the reports must:
• be submitted through the START platform: START submission page of the workshop.
• use the official LREC-COLING style templates.
• not exceed four (4) pages of content (excluding references)
• contain (at least) the following sections: description of the system, results, discussion, and reference.
Reports will receive a light review: we will check for the correctness of the format, the exactness of results and ranking, and overall exposition. If needed, we will contact the authors asking for corrections.
EvaHan 2024 is the third International Evaluation of Ancient Chinese Information Processing, focusing this year on the intricate tasks of sentence segmentation and punctuation for ancient Chinese.
EvaHan third edition has one task (i.e. a joint task of Sentence Segmentation and Punctuation.
EvaHan 2024 is organized by Bin Li, Bolin Chang, Minxuan Feng, Chao Xu, Liu Liu, Dongbo Wang.
Important Dates
8 January 2024: training data available
Evaluation Window
8 March 2024: test data available
15 March 2024: system results submission deadline
24 March 2024: paper submission deadline
30 March 2024: notification of acceptance
10 April 2024: camera-ready paper submission
20-25 May 2024: workshop
How to Participate
Participants will be required to submit their runs and to provide a technical report for the task they participated in.
Page Updated: 07-Feb-2024
LINGUIST List is supported by the following publishers: