Determining Extractive Summary for a Single Document Based on Collaborative Filtering Frequency Prediction and Mean Shift Clustering

Faculty Computer Science Year: 2019
Type of Publication: ZU Hosted Pages: . 494-505
Authors:
Journal: IAENG International Journal of Computer Science International Association of Engineers Volume: 46
Keywords : Determining Extractive Summary , , Single Document Based    
Abstract:
This paper presents a new unsupervised algorithm for determining extractive summary for a single document using term frequency prediction, which is obtained from memory-based collaborative filtering (CF) approach, and Mean Shift Clustering algorithm. The new algorithm uses Term-Sentence Collaborative Filtering (TSCF) for predicting term frequency. These term frequencies are used in sentence ranking according to the presence percentage of each word/term in each sentence. TSCF computes term frequencies for either terms present or missing (sparse) in a sentence via collaborative filtering prediction algorithm. The new algorithm uses Mean Shift Clustering algorithm as a final framework to group sentences according to their ranks to get more coherent summaries. Experiments show the effect of using different weighting functions including: Term Frequency (TF), Term Frequency Inverse Document Frequency (TFIDF) and binary TF. In addition, they show the effect of using different distance metrics that support sparse matrices representations including: Cosine, Euclidean and Manhattan. Experiments also, show the effect of using L1 and L2 normalization. ROUGE is used as a fully automatic metric in text summarization on DUC2002 datasets. Results show ROUGE-1, ROUGE-2, ROUGE-L and ROUGE-SU4 average recall, precision and f-measure scores, which show the effectiveness of the new algorithm. Results show that the proposed TSCF algorithm has promising results and outperforms related baseline techniques in many ROUGE scores.
   
     
 
       

Author Related Publications

  • Ahmed Raafat Abass Mohamed Saliem, "BERT-CNN: A Deep Learning Model for Detecting Emotions from Text", Tech Science Press, 2021 More
  • Ahmed Raafat Abass Mohamed Saliem, "Using General Regression with Local Tuning for Learning Mixture Models from Incomplete Data Sets", ScienceDirect, 2010 More
  • Ahmed Raafat Abass Mohamed Saliem, "On determining efficient finite mixture models with compact and essential components for clustering data", ScienceDirect, 2013 More
  • Ahmed Raafat Abass Mohamed Saliem, "Unsupervised learning of mixture models based on swarm intelligence and neural networks with optimal completion using incomplete data", ScienceDirect, 2012 More
  • Ahmed Raafat Abass Mohamed Saliem, "Adaptive competitive learning neural networks", ScienceDirect, 2013 More

Department Related Publications

  • Amr Mohammed Abdel Latif Emam, "An Implementation of a Fast Threaded Nondeterministic LL (*) Parser Generator", International Journal of Computer Applications, 2015 More
  • Ibrahiem Mahmoud Mohamed Elhenawy, "Solving 0–1 knapsack problem by binary flower pollination algorithm", Springer, 2018 More
  • Mohammed Abdel Basset Metwally Attia, "Solving 0–1 knapsack problem by binary flower pollination algorithm", Springer, 2018 More
  • Doaa El-Shahat Barakat Mohammed, "Solving 0–1 knapsack problem by binary flower pollination algorithm", Springer, 2018 More
  • Mohammed Abdel Basset Metwally Attia, "Neutrosophic AHP-Delphi Group decision making model based on trapezoidal neutrosophic numbers", Springer‏, 2018 More
Tweet