When working with new datasets it is helpful to know what type of data is included and easily retype the columns and provide new column names. This function identifies the type of data in each column and provides unique examples data within each column. The results are written to an Excel workbook. The new column names and re-defined column types can easily be added to the Excel workbook, imported into R, and assigned to the original dataset. The goal is to reduce the logistical burden of the user.
This function will likely evolve overtime.
Re-running this command will overwrite previous versions of the file!!
Usage
dataset.summary(
dataset,
ExcelFileName,
n.examples = 4,
overwriteXLS = FALSE,
group.same.cols = TRUE
)
Arguments
- dataset
tibble
ordata.frame
of interest- ExcelFileName
string indicating the Excel workbook filename. The value is passed to
WriteXLS::WriteXLS()
. Re-running this command will overwrite previous versions of the file!!- n.examples
integer value indicating the number examples to return. Passed to
size
ofextract.unique()
.- overwriteXLS
logical to overwrite existing Excel workbook; default is
FALSE
- group.same.cols
logical indicating if the columns should be grouped by those with the same information.
Author
Emilio Xavier Esposito emilio.esposito@gmail.com (https://github.com/emilioxavier)