autocodebook: Automatic Codebook and Tracking for 'Spark' and 'dplyr'
Pipelines
Wraps 'dplyr' verbs (mutate, summarise, filter) to automatically
capture variable metadata (type, source columns, categories, and source
code), producing a codebook and eligibility tracking table with zero
manual documentation. Works with both 'sparklyr' (tbl_spark) and local
data frames. Adds big-data optimizations (caching, assume-unique
counting, checkpointing) and a standardized report module with an
eligibility flowchart, editable codebook export (HTML, DOCX, XLSX), and
cross-sectional or longitudinal variable inspection. The eligibility
flowchart follows the CONSORT statement (Schulz, Altman and Moher (2010)
<doi:10.1136/bmj.c332>) and the reporting of observational cohort studies
follows the STROBE recommendations (von Elm and others (2007)
<doi:10.1371/journal.pmed.0040296>).
| Version: |
0.1.0 |
| Imports: |
dplyr (≥ 1.1.0), rlang (≥ 1.0.0), tibble, gt, grid |
| Suggests: |
sparklyr, dbplyr, testthat (≥ 3.0.0), tidyplots, ggplot2, patchwork, rmarkdown, knitr, officer, flextable, openxlsx, scales, rvg, devEMF, svglite |
| Published: |
2026-06-08 |
| DOI: |
10.32614/CRAN.package.autocodebook (may not be active yet) |
| Author: |
Patricia Fortes C. de Macedo [aut, cre] |
| Maintainer: |
Patricia Fortes C. de Macedo <macedopatriciafortes at gmail.com> |
| BugReports: |
https://github.com/patriciafortesm/autocodebook/issues |
| License: |
MIT + file LICENSE |
| URL: |
https://github.com/patriciafortesm/autocodebook |
| NeedsCompilation: |
no |
| Materials: |
README, NEWS |
| CRAN checks: |
autocodebook results |
Documentation:
Downloads:
Linking:
Please use the canonical form
https://CRAN.R-project.org/package=autocodebook
to link to this page.