این سایت در حال حاضر پشتیبانی نمی شود و امکان دارد داده های نشریات بروز نباشند
صفحه اصلی
درباره پایگاه
فهرست سامانه ها
الزامات سامانه ها
فهرست سازمانی
تماس با ما
JCR 2016
جستجوی مقالات
سه شنبه 25 آذر 1404
رایانش نرم و فناوری اطلاعات
، جلد ۹، شماره ۱، صفحات ۱۸-۲۷
عنوان فارسی
A Speech Act Classifier for Persian Texts and its Application in Identifying Rumors
چکیده فارسی مقاله
Speech Acts (SAs) are one of the important areas of pragmatics, which give us a better understanding of the state of mind of the people and convey an intended language function. Knowledge of the SA of a text can be helpful in analyzing that text in natural language processing applications. This study presents a dictionary-based statistical technique for Persian SA recognition. The proposed technique classifies a text into seven classes of SA based on four criteria: lexical, syntactic, semantic, and surface features. WordNet as the tool for extracting synonym and enriching features dictionary is utilized. To evaluate the proposed technique, we utilized four classification methods including Random Forest (RF), Support Vector Machine (SVM), Naive Bayes (NB), and K-Nearest Neighbors (KNN). The experimental results demonstrate that the proposed method using RF and SVM as the best classifiers achieved a state-of-the-art performance with an accuracy of 0.95 for classification of Persian SAs. Our original vision of this work is introducing an application of SA recognition on social media content, especially identifying the common SA in rumors and its application in the rumor detection. Therefore, the proposed system utilized to determine the common SAs in rumors. The results showed that Persian rumors are often expressed in three SA classes including narrative, question, and threat, and in some cases with the request SA. Also, the evaluation results indicate that SA as a distinctive feature between rumors and non-rumors improves the accuracy of rumor identification from 0.762 (based on common context features) to 0.791 (the combination of common context features and four SA classes).
کلیدواژههای فارسی مقاله
عنوان انگلیسی
A Speech Act Classifier for Persian Texts and its Application in Identifying Rumors
چکیده انگلیسی مقاله
Speech Acts (SAs) are one of the important areas of pragmatics, which give us a better understanding of the state of mind of the people and convey an intended language function. Knowledge of the SA of a text can be helpful in analyzing that text in natural language processing applications. This study presents a dictionary-based statistical technique for Persian SA recognition. The proposed technique classifies a text into seven classes of SA based on four criteria: lexical, syntactic, semantic, and surface features. WordNet as the tool for extracting synonym and enriching features dictionary is utilized. To evaluate the proposed technique, we utilized four classification methods including Random Forest (RF), Support Vector Machine (SVM), Naive Bayes (NB), and K-Nearest Neighbors (KNN). The experimental results demonstrate that the proposed method using RF and SVM as the best classifiers achieved a state-of-the-art performance with an accuracy of 0.95 for classification of Persian SAs. Our original vision of this work is introducing an application of SA recognition on social media content, especially identifying the common SA in rumors and its application in the rumor detection. Therefore, the proposed system utilized to determine the common SAs in rumors. The results showed that Persian rumors are often expressed in three SA classes including narrative, question, and threat, and in some cases with the request SA. Also, the evaluation results indicate that SA as a distinctive feature between rumors and non-rumors improves the accuracy of rumor identification from 0.762 (based on common context features) to 0.791 (the combination of common context features and four SA classes).
کلیدواژههای انگلیسی مقاله
Speech Act, Persian text classification, Feature Extraction, WordNet, Rumor detection
نویسندگان مقاله
Zoleikha Jahanbakhsh-Nagadeh |
Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.
Mohammad-Reza Feizi-Derakhshi |
Department of Computer Engineering, Faculty of Electrical and Computer Engineering, University of Tabriz, Iran.
Arash Sharifi |
Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.
نشانی اینترنتی
http://jscit.nit.ac.ir/article_103557_39ee4141ca88e2610c15237386cdb480.pdf
فایل مقاله
اشکال در دسترسی به فایل - ./files/site1/rds_journals/834/article-834-2465432.pdf
کد مقاله (doi)
زبان مقاله منتشر شده
fa
موضوعات مقاله منتشر شده
نوع مقاله منتشر شده
برگشت به:
صفحه اول پایگاه
|
نسخه مرتبط
|
نشریه مرتبط
|
فهرست نشریات