This book presents methods and approaches used to identify the true author of a doubtful document or text excerpt. It provides a broad introduction to all text categorization problems (like authorship attribution, psychological traits of the author, detecting fake news, etc.) grounded in stylistic f
Comparative study for Stylometric analysis techniques for authorship attribution
β Scribed by Raafat, Maryam A. (author);El-Wakil, Rania Abdel-Fattah (author);Atia, Ayman (author)
- Publisher
- IEEE
- Year
- 2021
- Leaves
- 6
- Category
- Scientific
No coin nor oath required. For personal study only.
β¦ Synopsis
A text is a meaningful source of information. Capturing the right patterns in written text gives metrics to measure and infer to what extent this text belongs or is relevant to a specific author. This research aims to introduce a new feature that goes more in deep in the language structure. The feature introduced is based on an attempt to differentiate stylistic changes among authors according to the different sentence structure
each author uses. The study showed the effect of introducing this new feature to machine learning models to enhance their performance. It was found that the prediction of authors was enhanced by adding sentence structure as an additional feature as the f1 scores increased by 0.3% and when normalizing the data and adding the feature it increased by 5%.
π SIMILAR VOLUMES
<p><p>This book presents methods and approaches used to identify the true author of a doubtful document or text excerpt. It provides a broad introduction to all text categorization problems (like authorship attribution, psychological traits of the author, detecting fake news, etc.) grounded in styli
Brings together techniques for the design and analysis of comparative studies. Methods include multivariate matching, standardization and stratification, analysis of covariance, logit analysis, and log linear analysis. Quantitatively assesses techniques' effectiveness in reducing bias. Discusses hyp
Brings together techniques for the design and analysis of comparative studies. Methods include multivariate matching, standardization and stratification, analysis of covariance, logit analysis, and log linear analysis. Quantitatively assesses techniques' effectiveness in reducing bias. Discusses hyp
<p><p>The book first explores the cybersecurityβs landscape and the inherent susceptibility of online communication system such as e-mail, chat conversation and social media in cybercrimes. Common sources and resources of digital crimes, their causes and effects together with the emerging threats fo
Authorship attribution, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and a wide range of application. It is an important problem not only in information retrieval but in many other disciplines as