Deep Learning-based Method for Enhancing the Detection of Arabic Authorship Attribution using Acoustic and Textual-based Features

Mohammed Al-Sarem, Faisal Saeed*, Sultan Noman Qasem, Abdullah M. Albarrak

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Authorship attribution (AA) is defined as the identification of the original author of an unseen text. It is found that the style of the author’s writing can change from one topic to another, but the author’s habits are still the same in different texts. The authorship attribution has been extensively studied for texts written in different languages such as English. However, few studies investigated the Arabic authorship attribution (AAA) due to the special challenges faced with the Arabic scripts. Additionally, there is a need to identify the authors of texts extracted from livestream broadcasting and the recorded speeches to protect the intellectual property of these authors. This paper aims to enhance the detection of Arabic authorship attribution by extracting different features and fusing the outputs of two deep learning models. The dataset used in this study was collected from the weekly livestream and recorded Arabic sermons that are available publicly on the official website of Al-Haramain in Saudi Arabia. The acoustic, textual and stylometric features were extracted for five authors. Then, the data were pre-processed and fed into the deep learning-based models (CNN architecture and its pre-trained ResNet34). After that the hard and soft voting ensemble methods were applied for combining the outputs of the applied models and improve the overall performance. The experimental results showed that the use of CNN with textual data obtained an acceptable performance using all evaluation metrics. Then, the performance of ResNet34 model with acoustic features outperformed the other models and obtained the accuracy of 90.34%. Finally, the results showed that the soft voting ensemble method enhanced the performance of AAA and outperformed the other method in terms of accuracy and precision, which obtained 93.19% and 0.9311 respectively.
    Original languageEnglish
    Pages (from-to)41-51
    Number of pages11
    JournalInternational Journal of Advanced Computer Science and Applications
    Volume14
    Issue number7
    DOIs
    Publication statusPublished (VoR) - 31 Jul 2023

    Funding

    This research is funded by the Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia through project number IFP-IMSIU202106.

    FundersFunder number
    Deputyship for Research & Innovation
    Ministry of Education in Saudi ArabiaIFP-IMSIU202106

      Keywords

      • Authorship attribution
      • acoustic features
      • fusion approach
      • Deep learning
      • CNN
      • ResNet34

      Fingerprint

      Dive into the research topics of 'Deep Learning-based Method for Enhancing the Detection of Arabic Authorship Attribution using Acoustic and Textual-based Features'. Together they form a unique fingerprint.

      Cite this