این سایت در حال حاضر پشتیبانی نمی شود و امکان دارد داده های نشریات بروز نباشند
صفحه اصلی
درباره پایگاه
فهرست سامانه ها
الزامات سامانه ها
فهرست سازمانی
تماس با ما
JCR 2016
جستجوی مقالات
دوشنبه 4 اسفند 1404
Journal of Artificial Intelligence and Data Mining
، جلد ۹، شماره ۲، صفحات ۱۸۱-۱۹۲
عنوان فارسی
چکیده فارسی مقاله
کلیدواژههای فارسی مقاله
عنوان انگلیسی
ParsNER-Social: A Corpus for Named Entity Recognition in Persian Social Media Texts
چکیده انگلیسی مقاله
Named Entity Recognition (NER) is one of the essential prerequisites for many natural language processing tasks. All public corpora for Persian named entity recognition, such as ParsNERCorp and ArmanPersoNERCorpus, are based on the Bijankhan corpus, which is originated from the Hamshahri newspaper in 2004. Correspondingly, most of the published named entity recognition models in Persian are specially tuned for the news data and are not flexible enough to be applied in different text categories, such as social media texts. This study introduces ParsNER-Social, a corpus for training named entity recognition models in the Persian language built from social media sources. This corpus consists of 205,373 tokens and their NER tags, crawled from social media contents, including 10 Telegram channels in 10 different categories. Furthermore, three supervised methods are introduced and trained based on the ParsNER-Social corpus: Two conditional random field models as baseline models and one state-of-the-art deep learning model with six different configurations are evaluated on the proposed dataset. The experiments show that the Mono-Lingual Persian models based on Bidirectional Encoder Representations from Transformers (MLBERT) outperform the other approaches on the ParsNER-Social corpus. Among different Configurations of MLBERT models, the ParsBERT+BERT-TokenClass model obtained an F1-score of 89.65%.
کلیدواژههای انگلیسی مقاله
Named Entity Recognition, Natural Language Processing, Social Media Corpus, Persian Language
نویسندگان مقاله
M. Asgari-Bidhendi |
Computer Engineering School, Iran University of Science and Technology, Tehran, Iran.
B. Janfada |
Computer Engineering School, Iran University of Science and Technology, Tehran, Iran.
O. R. Roshani Talab |
Computer Engineering School, Iran University of Science and Technology, Tehran, Iran.
B. Minaei-Bidgoli |
School of Computer Engineering, Iran University of Science and Technology, Tehran, Iran.
نشانی اینترنتی
http://jad.shahroodut.ac.ir/article_2023_ef4c0c677865f8f5a25263a9ae6e57d8.pdf
فایل مقاله
فایلی برای مقاله ذخیره نشده است
کد مقاله (doi)
زبان مقاله منتشر شده
en
موضوعات مقاله منتشر شده
نوع مقاله منتشر شده
برگشت به:
صفحه اول پایگاه
|
نسخه مرتبط
|
نشریه مرتبط
|
فهرست نشریات