Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to sign up for SAGE Journal Email Alerts today!

Sign In to gain access to subscriptions and/or personal tools.
Social Science Computer Review
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Perrin, A. J.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Other

The CodeRead System

Using Natural Language Processing to Automate Coding of Qualitative Data

Andrew J. Perrin

University of North Carolina, Chapel Hill

Most social science research uses data that originate, in one form or another, as written or spoken text. Quantitative researchers code these data very strictly, categorizing answers to questions into fixed groups. In contrast, qualitative researchers typically code free-form text by marking it up according to a set of ideas about the nature and content of the text. This article suggests the use of some elementary techniques from the field of statistical natural language processing to partially automate the process of coding large quantities of free-form textual data. The article presents CodeRead, a set of tools that implement these techniques. The system’s principal innovation is its ability to generate coding rules from a precoded sample of text. This capacity allows for the analysis of much longer textual data than was previously practical. It also insures that the rules used for coding such data are specific and uniformly applied.

Key Words: coding • content analysis • text analysis • automated coding

Social Science Computer Review, Vol. 19, No. 2, 213-220 (2001)
DOI: 10.1177/089443930101900207


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?