Link Search Menu Expand Document

Manifest Types

A manifest is a file provided by a Stage 1 pipeline or data provider in a project zip that enumerates a related set of data samples. Scripts in the manifestTypes folder:

  • declare expected metadata in a specific type of manifest file
  • provide handler functions used by sourceFileUpload and related modules

All manifestTypes, e.g., manifestTypes$xyz, must be a list with members:

manifestTypes$xyz <- list(
    patterns = character(),
    load = function(file) ...,
    parse = function(data.frame) ...
)

where:

  • $patterns = file suffixes for manifest files that match the manifestType
  • $load = a function to read the manifest file from disk
  • $parse = a function that takes the data frame from ‘load’ and returns a list of two data frames:
    • $manifest = all input rows, potentially with modified column names/data types
    • $unique = one row per unique sample, often just a repeat of $manifest

In addition to any desired sample value columns, the parsed manifest data frames must ALWAYS have Project, Sample_ID, and Description columns - if necessary, create and fill those columns with a constant value (e.g., ‘all_samples’ or ‘NA’). ‘Project’ and ‘Sample_ID’ are concatenated to create unique sample identifiers, while ‘Description’ is a short, human readable, sample name that is the default value for the editable sample name shown in the UI.

Data frame $parsed$unique may optionally have Yield and Quality columns, when appropriate (they will be displayed as NA if absent).