Since its inception, R has become one of the preeminent programs for statistical computing and data analysis. The ready availability of the program, along with a wide variety of packages and the supportive R community make R an excellent choice for almost any kind of computing task related to stat
Data Manipulation with R
โ Scribed by Statistical Computing Facility Phil Spector (auth.)
- Publisher
- Springer-Verlag New York
- Year
- 2008
- Tongue
- English
- Leaves
- 157
- Series
- Use R!
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Since its inception, R has become one of the preeminent programs for statistical computing and data analysis. The ready availability of the program, along with a wide variety of packages and the supportive R community make R an excellent choice for almost any kind of computing task related to statistics. However, many users, especially those with experience in other languages, do not take advantage of the full power of R. Because of the nature of R, solutions that make sense in other languages may not be very efficient in R. This book presents a wide array of methods applicable for reading data into R, and efficiently manipulating that data.
In addition to the built-in functions, a number of readily available packages from CRAN (the Comprehensive R Archive Network) are also covered. All of the methods presented take advantage of the core features of R: vectorization, efficient use of subscripting, and the proper use of the varied functions in R that are provided for common data management tasks.
Most experienced R users discover that, especially when working with large data sets, it may be helpful to use other programs, notably databases, in conjunction with R. Accordingly, the use of databases in R is covered in detail, along with methods for extracting data from spreadsheets and datasets created by other programs. Character manipulation, while sometimes overlooked within R, is also covered in detail, allowing problems that are traditionally solved by scripting languages to be carried out entirely within R. For users with experience in other languages, guidelines for the effective use of programming constructs like loops are provided. Since many statistical modeling and graphics functions need their data presented in a data frame, techniques for converting the output of commonly used functions to data frames are provided throughout the book.
Using a variety of examples based on data sets included with R, along with easily simulated data sets, the book is recommended to anyone using R who wishes to advance from simple examples to practical real-life data manipulation solutions.
Phil Spector is Applications Manager of the Statistical Computing Facility and Adjunct Professor in the Department of Statistics at University of California, Berkeley.
โฆ Table of Contents
Front Matter....Pages i-ix
Data in R....Pages 1-11
Reading and Writing Data....Pages 13-41
R and Databases....Pages 43-56
Dates....Pages 57-66
Factors....Pages 67-74
Subscripting....Pages 75-85
Character Manipulation....Pages 87-99
Data Aggregation....Pages 101-130
Reshaping Data....Pages 131-147
Back Matter....Pages 149-152
โฆ Subjects
Statistics and Computing/Statistics Programs
๐ SIMILAR VOLUMES
One of the most important aspects of computing with data is the ability to manipulate it to enable subsequent analysis and visualization. R offers a wide range of tools for this purpose. Data from any source, be it flat files or databases, can be loaded into R and this will allow you to manipulate d
This book is for all those who wish to learn about data manipulation from scratch and excel at aggregating data effectively. It is expected that you have basic knowledge of R and have previously done some basic administration work with R.</div> <br> Abstract: This book is for all those w
<b>Perform groupwise data manipulation and deal with large datasets using R efficiently and effectively</b><h2>About This Book</h2><ul><li>Perform factor manipulation and string processing</li> <li>Learn group-wise data manipulation using plyr</li> <li>Handle large datasets, interact with database s
Since its inception, R has become one of the preeminent programs for statistical computing and data analysis. The ready availability of the program, along with a wide variety of packages and the supportive R community make R an excellent choice for almost any kind of computing task related to statis
Since its inception, R has become one of the preeminent programs for statistical computing and data analysis. The ready availability of the program, along with a wide variety of packages and the supportive R community make R an excellent choice for almost any kind of computing task related to stat