این سایت در حال حاضر پشتیبانی نمی شود و امکان دارد داده های نشریات بروز نباشند
پردازش علائم و داده ها، جلد ۲۲، شماره ۲، صفحات ۱۲۷-۱۳۸

عنوان فارسی ارائه روشی جدید برای طبقه‌بندی چندبرچسبی برمبنای شبکه‌های عصبی
چکیده فارسی مقاله
طبقه‌­بندی چندبرچسبی نوعی از طبقه‌­بندی است که در آن نمونه‌­ها می­‌توانند صفر، یک یا بیش از یک برچسب داشته باشند؛ به‌عبارت‌دیگر هر نمونه به‌وسیله یک مجموعه از برچسب‌­ها نمایش داده می­‌شود. با توجه به پژوهش‌های اخیر، درنظرگرفتن ارتباط بین برچسب­‌ها نتایج بهتری را حاصل می­‌کند. در این مقاله برای درنظرگرفتن ارتباط بین برچسب­‌ها، در مرحله نخست از خوشه‌­بندی  k- میانگین با محدودیت استفاده و در مرحله دوم برای هر خوشه یک شبکه ­عصبی پرسپترون چندلایه درنظر گرفته شده‌است؛ درنهایت با ترکیب برچسب‌­های پیش­‌بینی‌شده به‌وسیله طبقه‌­بند­ها، برچسب‌های نهایی به‌دست می­‌آید. با توجه به اینکه تعداد شبکه‌های عصبی نسبت به حالت معمول افزایش و به‌تبع آن‌زمان آموزش داده‌­ها بیشتر می­‌شود، روش جدیدی برای کاهش ابعاد با استفاده از جمع پراکنده به‌کار برده شده‌است. با ارزیابی روش پیشنهادی بر روی مجموعه‌داده‌های موجود در مقایسه با روش‌های پیشین این نتیجه حاصل شد که روش پیشنهادی در سه مجموعه‌داده از نوع متن در بسیاری از معیارها مانند دقت، صحت و فاصله همینگ در بین الگوریتم‌­ها رتبه نخست را داشته است.
کلیدواژه‌های فارسی مقاله طبقه‌بندی، طبقه‌بندی چندبرچسبی، خوشه‌بندی، شبکه‌های عصبی

عنوان انگلیسی Presenting a new method for multi label classification based on neural network
چکیده انگلیسی مقاله
The problem of classification can be divided into two categories: single-label and multi-label. Single-label classification consists of binary and multi-class classification. In binary classification, the task is to predict one in two possible classes, such as distinguishing between spam and non-spam emails. In multi-class classification, the goal is to classify instances into more than two classes, such as identifying different species of flowers based on petal measurements. In contrast to single-label classification, multi-label classification is more complex because each instance could belong to multiple categories simultaneously. In multi-label learning, instead of assigning a single label to each instance, a set of labels is assigned. This means that each sample may have zero, one, or more than one associated label. For example, in a text classification task, a news article about technology and business might be labeled as both "Technology" and "Business". To handle multi-label classification, several approaches have been developed. One of the simplest methods is Binary Relevance (BR), which transforms the multi-label problem into multiple independent binary classification tasks—one for each label. Although this approach is easy to implement, it treats each label independently and ignores possible relationships among them. However, in real-world applications, labels are often correlated; for instance, in medical diagnosis, certain diseases frequently appear together. In another approach, Label Powerset (LP), considers label dependencies by treating each unique combination of labels as a separate class. While this method captures relationships between labels, it suffers from scalability issues while dealing with a large number of labels, as the number of possible label combinations increases exponentially. To address these challenges, the proposed method incorporates k-means constraint clustering to group both labels and features prior to applying classification. In the first step, clustering is performed to group similar labels together, ensuring that label correlations are preserved. This also helps to mitigate the issue of imbalanced classification, where certain labels may be underrepresented in the dataset. Once the labels are being clustered, a separate multi-layer neural network would be assigned to each cluster. Instead of using a single large neural network for all labels, multiple smaller networks would be trained for different label clusters. This approach enhances learning efficiency and improves accuracy by focusing on relevant label groups. However, using multiple classifiers increases computational costs and training time. To mitigate this issue, a scatter-add dimension reduction technique is applied. Using scatter-add, attributes are efficiently assigned to the input of each neural network, ensuring that each classifier receives only the relevant feature subset. Each neural network then predicts labels within its designated cluster. Eventually, the predictions from all classifiers are combined to generate the final multi-label output for each instance. To evaluate the effectiveness of the proposed method, experiments were conducted on various text datasets. The results were compared with traditional multi-label classification methods, including Binary Relevance and Label Powerset. The evaluation has been based on several performance metrics, such as accuracy, precision, and hamming-loss. The results demonstrated that the proposed approach achieved superior performance across multiple datasets, ranking first in several evaluation criteria. Notably, it outperformed existing methods by a margin of approximately 1% in accuracy. These findings suggest that clustering-based multi-label classification using k-means constraint clustering and multi-layer neural networks is a promising approach. By leveraging label correlations and reducing dimensionality, the proposed method effectively improves classification performance while addressing issues such as label imbalance and computational inefficiency. Future research may further explore optimization techniques to reduce training time while maintaining high accuracy.
کلیدواژه‌های انگلیسی مقاله Classification, Multi-Label Classification, Clustering, Neural Networks

نویسندگان مقاله محسن نصیری | Mohsen Nasiri
Msc, Computer Engineering Department, Shahid Rajaee Teacher Training University, Tehran, Iran
کارشناس‌ارشد دانشکده مهندسی کامپیوتر، دانشگاه تربیت دبیر شهید رجایی، تهران، ایران

نگین دانشپور | Negin Daneshpour
Associate Professor, Computer Engineering Department, Shahid Rajaee Teacher Training University, Tehran, Iran
دانشیار دانشکده مهندسی کامپیوتر، دانشگاه تربیت دبیر شهید رجایی، تهران، ایران


نشانی اینترنتی http://jsdp.rcisp.ac.ir/browse.php?a_code=A-10-815-8&slc_lang=fa&sid=1
فایل مقاله فایلی برای مقاله ذخیره نشده است
کد مقاله (doi)
زبان مقاله منتشر شده fa
موضوعات مقاله منتشر شده مقالات پردازش داده‌های رقمی
نوع مقاله منتشر شده پژوهشی
برگشت به: صفحه اول پایگاه   |   نسخه مرتبط   |   نشریه مرتبط   |   فهرست نشریات