Create your own natural language training corpus for machine learning. This example-driven book walks you through the annotation cycle, from selecting an annotation task and creating the annotation specification to designing the guidelines, creating a "gold standard" corpus, and then beginning the a
Natural Language Annotation for Machine Learning: A guide to corpus-building for applications
β Scribed by James Pustejovsky, Amber Stubbs
- Publisher
- O'Reilly Media
- Year
- 2012
- Tongue
- English
- Leaves
- 350
- Edition
- Early Release
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Create your own natural language training corpus for machine learning. This example-driven book walks you through the annotation cycle, from selecting an annotation task and creating the annotation specification to designing the guidelines, creating a "gold standard" corpus, and then beginning the actual data creation with the annotation process.
Systems exist for analyzing existing corpora, but making a new corpus can be extremely complex. To help you build a foundation for your own machine learning goals, this easy-to-use guide includes case studies that demonstrate four different annotation tasks in detail. Youβll also learn how to use a lightweight software package for annotating texts and adjudicating the annotations.
This book is a perfect companion to O'Reillyβs Natural Language Processing with Python, which describes how to use existing corpora with the Natural Language Toolkit.
π SIMILAR VOLUMES
Create your own natural language training corpus for machine learning. This example-driven book walks you through the annotation cycle, from selecting an annotation task and creating the annotation specification to designing the guidelines, creating a "gold standard" corpus, and then begin
<DIV><p>Create your own natural language training corpus for machine learning. Whether youβre working with English, Chinese, or any other natural language, this hands-on book guides you through a proven annotation development cycleβthe process of adding metadata to your training corpus to help ML al
<DIV><p>Create your own natural language training corpus for machine learning. Whether youβre working with English, Chinese, or any other natural language, this hands-on book guides you through a proven annotation development cycleβthe process of adding metadata to your training corpus to help ML al