Downloads pre-processed, subsampled datasets from GitHub Releases
for use in vignettes and tutorials. The datasets are attached to
GitHub releases via the piggyback package.
Usage
copro_download_data(
dataset = c("colon_d3", "colon_d9", "kidney", "organoid", "brain_merfish"),
destdir = NULL,
tag = "data-v1",
overwrite = FALSE
)Arguments
- dataset
Character string specifying which dataset to download. One of
"colon_d3","colon_d9","kidney","organoid", or"brain_merfish".- destdir
Directory to save the downloaded file. Defaults to a package-specific cache directory via
tools::R_user_dir().- tag
The GitHub release tag to download from. Defaults to
"data-v1".- overwrite
Logical; if
TRUE, re-download even if the file already exists locally. DefaultFALSE.
Details
Available datasets:
colon_d3Colon Day 3 organoid data (Epithelial, Fibroblast, Immune). Demonstrates cross-cell-type co-progression with orthogonal CCA axes.
colon_d9Colon Day 9 organoid data (multiple slides). Demonstrates multi-slide analysis and score transfer.
kidneyKidney seqFISH data (tubular and vascular cells). Demonstrates supervised/guided spatial gradient detection.
organoid72hr organoid culture (single cell type). Demonstrates within-cell-type spatial pattern detection.
brain_merfishBrain MERFISH data (D1/D2 neurons). Demonstrates two-cell-type co-progression.
The data files are hosted as GitHub Release assets and are typically 5–30 MB each. They are subsampled from the full datasets to allow fast vignette execution while preserving biological signal.
Examples
if (FALSE) { # \dontrun{
# Download and load the colon D3 dataset
path <- copro_download_data("colon_d3")
dat <- readRDS(path)
} # }