التنقيب عن العلاقات الإكلينيكية من ملفات المرضى

Faculty Computer Science Year: 2024
Type of Publication: ZU Hosted Pages:
Authors:
Journal: Volume:
Keywords : التنقيب , العلاقات الإكلينيكية , ملفات المرضى    
Abstract:
This thesis focuses on extracting relationships from patient narratives. The Clinical E-Science Framework (CLEF) project was used to extract important information from medical texts by building a system for the purpose of clinical research, evidence-based healthcare and genotype-meets-phenotype informatics. The system is divided into two parts, one part concerns with the identification of relationships between clinically important entities in the text. The full parses and domain-specific grammars had been used to apply many approaches to extract the relationship. In the second part of the system, statistical machine learning (ML) approaches are applied to extract relationship. A corpus of oncology narratives that hand annotated with clinical relationships can be used to train and test a system that has been designed and implemented by supervised machine learning (ML) approaches. Many features can be extracted from these texts that are used to build a model by the classifier. The goal of this thesis is to analyze the implementation of multiple supervised learning algorithms for relationship extraction. As well as the effects of adding the features, changing the size of the corpus, and changing the uneven margin parameter of the algorithm on relationship extraction are examined. Also the evaluation of the supervised learning algorithms and comparing them based on the performance. The results of the performance of these algorithms for extracting clinical relationships from medical text are very important. SVM with uneven margin is much suitable algorithm which achieves high accuracy, but it takes more time in the run than Perceptron with uneven margin. Perceptron with uneven margin is very fast algorithm than others as well as the accuracy is relatively near to SVM, there is small change in between. Increasing the value of τ leads to improve the performance to reach the value that has high performance where τ = 0.8 after that point the performance dropped. Adding new feature sets like non-syntactic features improves the performance. Adding the syntactic features leads to small drop in the performance that unclear. Changing the size of training corpus leads to improve the performance although required increasing the classification time.
   
     
 
       

Author Related Publications

  • Wafaa Tawfik Abdelmoniem, "تحليل البيانات الضخمة بالحوسبة المتوازية والموزعة", 2024 More
  • Wafaa Tawfik Abdelmoniem, "Clinical Relationships Extraction Techniques from Patient Narratives", International Journal of Computer Science, 2013 More
  • Wafaa Tawfik Abdelmoniem, "GRAPH MINING TECHNIQUES FOR GRAPH CLUSTERING: STARTING POINT", Journal of Theoretical and Applied Information Technology, 2019 More
  • Wafaa Tawfik Abdelmoniem, "A Practical Comparison of Local Graph Clustering Algorithms", International Journal of Engineering Trends and Technology (IJETT), 2019 More

Department Related Publications

  • Khalid Aly Eldrandaly Mohamed Saeed Eldrandaly, "An Expert GIS-Based ANP-OWA Decision Making Framework for Tourism Development Site Selection", MECS Publisher, 2014 More
  • Khalid Aly Eldrandaly Mohamed Saeed Eldrandaly, "A Modified Artificial Bee Colony Algorithm for Solving Least-Cost Path Problem in Raster GIS", Natural Sciences Publishing Corporation., 2015 More
  • Mohamed Monier Hassan Mohamed Hassan, "A Modified Artificial Bee Colony Algorithm for Solving Least-Cost Path Problem in Raster GIS", Natural Sciences Publishing Corporation., 2015 More
  • Nabil Moustafa AbdelAziz, "A Modified Artificial Bee Colony Algorithm for Solving Least-Cost Path Problem in Raster GIS", Natural Sciences Publishing Corporation., 2015 More
  • Abdelnaser Hessien Reyad Zaied , "An Integrated Framework for Project Management Success Factors", the Egyptian International Journal of Engineering Science and Technology (EIJEST),, 2014 More
Tweet