𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Data Mining Methods for the Content Analyst: An Introduction to the Computational Analysis of Content

✍ Scribed by Kalev Leetaru


Publisher
Routledge
Year
2011
Tongue
English
Leaves
121
Series
Routledge Communication Series
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


With continuous advancements and an increase in user popularity, data mining technologies serve as an invaluable resource for researchers across a wide range of disciplines in the humanities and social sciences. In this comprehensive guide, author and research scientist Kalev Leetaru introduces the approaches, strategies, and methodologies of current data mining techniques, offering insights for new and experienced users alike.

Designed as an instructive reference to computer-based analysis approaches, each chapter of this resource explains a set of core concepts and analytical data mining strategies, along with detailed examples and steps relating to current data mining practices. Every technique is considered with regard to context, theory of operation and methodological concerns, and focuses on the capabilities and strengths relating to these technologies. In addressing critical methodologies and approaches to automated analytical techniques, this work provides an essential overview to a broad innovative field.

✦ Table of Contents


DATA MINING METHODS FOR THE CONTENT ANALYST An Introduction to the Computational Analysis of Content
Copyright
Contents
List of Tables and Figures
Acknowledgments
1 Introduction
What Is Content Analysis?
Why Use Computerized Analysis Techniques?
Standalone Tools or Integrated Suites
Transitioning from Theory to Practice
Chapter in Summary
2 Obtaining and Preparing Data
Collecting Data from Digital Text Repositories
Are the Data Meaningful?
Using Data in Unintended Ways
Analytical Resolution
Types of Data Sources
Finding Sources
Searching Text Collections
Sources of Incompleteness
Licensing Restrictions and Content Blackouts
Measuring Viewership
Accuracy and Convenience Samples
Random Samples
Multimedia Content
Converting to Textual Format
Prosody
Example Data Sources
Patterns in Historical War Coverage
Competitive Intelligence
Global News Coverage
Downloading Content
Digital Content
Print Content
Preparing Content
Document Extraction
Cleaning
Post Filtering
Reforming/Reshaping
Content Proxy Extraction
Chapter in Summary
3 Vocabulary Analysis
The Basics
Word Histograms
Readability Indexes
Normative Comparison
Non-word Analysis
Colloquialisms: Abbreviations and Slang
Restricting the Analytical Window
Vocabulary Comparison and Evolution/Chronemics
Advanced Topics
Syllables, Rhyming, and β€œSounds Like”
Gender and Language
Authorship Attribution
Word Morphology, Stemming, and Lemmatization
Chapter in Summary
4 Correlation and Co-occurrence
Understanding Correlation
Computing Word Correlations
Directionality
Concordance
Co-occurrence and Search
Language Variation and Lexicons
Non-co-occurrence
Correlation with Metadata
Chapter in Summary
5 Lexicons, Entity Extraction, and Geocoding
Lexicons
Lexicons and Categorization
Lexical Correlation
Lexicon Consistency Checks
Thesauri and Vocabulary Expanders
Named Entity Extraction
Lexicons and Processing
Applications
Geocoding, Gazetteers, and Spatial Analysis
Geocoding
Gazetteers and the Geocoding Process
Operating Under Uncertainty
Spatial Analysis
Chapter in Summary
6 Topic Extraction
How Machines Process Text
Unstructured Text
Extracting Meaning from Text
Applications of Topic Extraction
Comparing/Clustering Documents
Automatic Summarization
Automatic Keyword Generation
Multilingual Analysis: Topic Extraction with Multiple Languages
Chapter in Summary
7 Sentiment Analysis
Examining Emotions
Evolution
Evaluation
Analytical Resolution: Documents versus Objects
Hand-crafted versus Automatically Generated Lexicons
Other Sentiment Scales
Limitations
Measuring Language Rather Than Worldview
Chapter in Summary
8 Similarity, Categorization and Clustering
Categorization
The Vector Space Model
Feature Selection
Feature Reduction
Learning Algorithm
Evaluating ATC Results
Benefi ts of ATC over Human Categorization
Limitations of ATC
Applications of ATC
Clustering
Automated Clustering
Hierarchical Clustering
Partitional Clustering
Document Similarity
Vector Space Model
Contingency Tables
Chapter in Summary
9 Network Analysis
Understanding Network Analysis
Network Content Analysis
Representing Network Data
Constructing the Network
Network Structure
The Triad Census
Network Evolution
Visualization and Clustering
Chapter in Summary
References
Index


πŸ“œ SIMILAR VOLUMES


Spatial Context: An Introduction to Fund
✍ Christopher Gold πŸ“‚ Library πŸ“… 2016 πŸ› CRC Press 🌐 English

<P>Many disciplines are concerned with manipulating geometric (or spatial) objects in the computer – such as geology, cartography, computer aided design (CAD), etc. – and each of these have developed their own data structures and techniques, often independently. Nevertheless, in many cases the objec

Content Analysis: An Introduction to Its
✍ Dr. Klaus H. Krippendorff πŸ“‚ Library πŸ“… 2003 🌐 English

The Second Edition of Content Analysis: An Introduction to Its Methodology is a definitive sourcebook of the history and core principles of content analysis as well as an essential resource for present and future studies. The book introduces readers to ways of analyzing meaningful matter such as

Content Analysis: An Introduction to Its
✍ Klaus Krippendorff πŸ“‚ Library πŸ“… 2018 πŸ› SAGE Publications 🌐 English

Since the first edition published in 1980, <i>Content Analysis</i> has helped shape and define the field. In the highly anticipatedΒ Fourth Edition, award-winning scholar and author Klaus Krippendorff introduces you to the most current method of analyzing the textual fabric of contemporary society. S

Metaverse: Concept, Content and Context
✍ Shenghui Cheng πŸ“‚ Library πŸ“… 2023 πŸ› Springer 🌐 English

<p><span>The metaverse, a hybrid society of the real and the virtual is attracting significant attention from academia to industry and is starting to change the world. Composed of ten chapters, this book introduces the metaverse from three aspects – concept, content and context. It starts with numer

Metaverse: Concept, Content and Context
✍ Shenghui Cheng πŸ“‚ Library πŸ“… 2023 πŸ› Springer Nature 🌐 English

The metaverse, a hybrid society of the real and the virtual is attracting significant attention from academia to industry and is starting to change the world. Composed of ten chapters, this book introduces the metaverse from three aspects – concept, content and context. It starts with numerous conce