Zagazig University Digital Repository
Home
Thesis & Publications
All Contents
Publications
Thesis
Graduation Projects
Research Area
Research Area Reports
Search by Research Area
Universities Thesis
ACADEMIC Links
ACADEMIC RESEARCH
Zagazig University Authors
Africa Research Statistics
Google Scholar
Research Gate
Researcher ID
CrossRef
Arabic text clustering using improved clustering algorithms with dimensionality reduction
Faculty
Computer Science
Year:
2019
Type of Publication:
ZU Hosted
Pages:
Authors:
Mohammed Abdel Basset Metwally Attia
Staff Zu Site
Abstract In Staff Site
Journal:
Cluster Computing Springer
Volume:
Keywords :
Arabic text clustering using improved clustering
Abstract:
Arabic Text document clustering is an important aspect for providing conjectural navigation and browsing techniques by organizing massive amounts of data into a small number of defined clusters. However, Words in form of vector are used for clustering methods is often unsatisfactory as it ignores relationships between important terms. Cluster analysis separates data into groups on clusters for the purposes of improved understanding or summarization. Clustering has a long history and many techniques developed in statistics, data mining, pattern recognition and other fields. This research proposes three approaches; Unsupervised, Semi Supervised techniques and Semi Supervised with dimensionality reduction to construct a clustering based classifier for Arabic text documents. Using k-means, incremental k-means, Threshold + k-means and k-means with dimensionality reduction, after document preprocessing removing stop words and gets the root for each term in each document. Then apply a term weighting method to get the weight of each term with respect to its document. Then apply a similarity measure method to each document and its similarity with other documents. And using F-measure, entropy and support vector machine (SVM) for calculate accuracy. The datasets are online dynamic datasets that are characterized by its availability and credibility on the internet. Arabic language is a challenging language when applied in an inference based algorithm. So, selecting the appropriate dataset is a principal factor in such research. The accuracy of those methods compared with other approaches and the proposed methods shows better accuracy and fewer errors for new classification test cases. Considering that the dimension reduction process is very sensitive because increasing the ratio of reduction can destroy important terms.
Author Related Publications
Mohammed Abdel Basset Metwally Attia, "Discrete greedy flower pollination algorithm for spherical traveling salesman problem", Springer, 2019
More
Mohammed Abdel Basset Metwally Attia, "A New Hybrid Flower Pollination Algorithm for Solving Constrained Global Optimization Problems", Natural Sciences Publishing Cor., 2014
More
Mohammed Abdel Basset Metwally Attia, "A novel equilibrium optimization algorithm for multi-thresholding image segmentation problems", Springer London, 2021
More
Mohammed Abdel Basset Metwally Attia, "An efficient binary slime mould algorithm integrated with a novel attacking-feeding strategy for feature selection", Pergamon, 2021
More
Mohammed Abdel Basset Metwally Attia, "An efficient teaching-learning-based optimization algorithm for parameters identification of photovoltaic models: Analysis and validations", Pergamon, 2021
More
Department Related Publications
Abdul Wahid Ibrahim Mahmoud Khamis, "Cameraphone Recognition of Arabic Fingerspelling", International Journal of Computer Science and Information Technology & Security (IJCSITS), 2013
More
Mohammed Abdel Basset Metwally Attia, "A hybrid flower pollination algorithm for solving ill-conditioned set of equations", Int. J. Bio-Inspired Computation, 2016
More
Zaher Awad Aboelenieen Elhendy, "NEW APPROACH TO IMAGE EDGE DETECTION BASED ON QUANTUM ENTROPY", JOURNAL OF RUSSIAN LASER RESEARCH, 2016
More
Ibrahiem Mahmoud Mohamed Elhenawy, "A hybrid whale optimization algorithm based on local search strategy for the permutation flow shop scheduling problem", North-Holland, 2018
More
Mohammed Abdel Basset Metwally Attia, "A hybrid whale optimization algorithm based on local search strategy for the permutation flow shop scheduling problem", North-Holland, 2018
More
جامعة المنصورة
جامعة الاسكندرية
جامعة القاهرة
جامعة سوهاج
جامعة الفيوم
جامعة بنها
جامعة دمياط
جامعة بورسعيد
جامعة حلوان
جامعة السويس
شراقوة
جامعة المنيا
جامعة دمنهور
جامعة المنوفية
جامعة أسوان
جامعة جنوب الوادى
جامعة قناة السويس
جامعة عين شمس
جامعة أسيوط
جامعة كفر الشيخ
جامعة السادات
جامعة طنطا
جامعة بنى سويف