This book provides a linguist with a statistical toolkit for exploration and analysis of linguistic data. It employs R, a free software environment for statistical computing, which is increasingly popular among linguists. <i>How to do Linguistics with R: Data exploration and statistical analysis</i>
Programming for Corpus Linguistics: How to Do Text Analysis with Java
โ Scribed by Oliver Mason
- Publisher
- Edinburgh University Press
- Year
- 2022
- Tongue
- English
- Leaves
- 250
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
The ability to program a computer has become increasingly important in work that involves corpora. Specialised research needs can no longer be met by available software, and purchasing customised programs is usually not an option. This book enables the researcher to write programs for text and corpus processing. Useful techniques are illustrated with the popular programming language Java, which is very well suited for handling textual data, and at the same time easy to learn.
Key Features
- a general introduction to programming for readers with a linguistic background
- a practical introduction to corpus linguistics for readers with a programming background who are new to corpus processing
- a guide to relevant aspects of Java which will be useful for text processing
- a variety of sample programs which are in themselves useful tools for corpus research.
๐ SIMILAR VOLUMES
Paradoxically, doing corpus linguistics is both easier and harder than it has ever been before. On the one hand, it is easier because we have access to more existing corpora, more corpus analysis software tools, and more statistical methods than ever before. On the other hand, reliance on these exis
<p>The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistic
<p>The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistic
The rapidly growing volume of digital natural language text and the complexity of data abstracted from it have increasingly rendered traditional corpus linguistic analytical methodology obsolete. This book describes a cluster analytic methodology for generating linguistic hypotheses on the basis of