R Dataset Guide
2023-04-19
Chapter 1 Introduction
1.1 Guide to tables
The tables in the following chapters provide detailed information about datasets in R packages. The complete list of output columns is as follows. Columns not initially visible can be viewed by clicking the Column Visibility
button.
Output columns |
---|
package name of package (optional)
|
name name of dataset
|
nr_or_len number of rows or length() (whichever is not NULL)
|
nc number of columns
|
add_dim additional dimensions (>= 3, such as for tables)
|
first_class first class listed
|
n_cols number of numeric columns
|
i_cols number of integer columns
|
f_cols number of factor columns
|
c_cols number of character columns
|
d_cols number of date columns
|
other_cols number of other columns
|
missing proportion of missing values overall
|
allclasses full list of classes (optional)
|
Table columns can be sorted and filtered. Dataset names are linked to documentation if available. Datasets without links are usually included in documentation for a dataset in the same package with a similar name. For example, documentation for alr4::BGSboys
and alr4::BGSgirls
is included on the help page for alr4::BGSall
.
1.2 Exploring data in packages locally
The main function used here is data_xray()
from the datacat package. You can explore datasets in R packages locally as follows.
Install datacat:
::install_github("jtr13/datacat") remotes
View information with:
library(datacat)
data_xray("ggplot2") %>% View()
1.3 Contributing
I welcome your suggestions on packages to add to this resource. You can either open an issue or add the package name to this file and create a pull request.
I have intentionally left out packages with nondescriptive dataset names such as ch5ex11
: AMCP, [Devore7]