Data processing instructions are stored as a set of scripts under the analysis/scripts
folder. The following flowchart shows the relationship between scripts and data files, leading to the final packaged dataset:
The scripts are typically run indirectly via make
, but instructions for running them individually can be found at the top of each script.
Data files are stored under analysis/raw
, analysis/intermediate
, and analysis/out
. Most intermediate files are stored in sqlite format so that they can be easily queried by the viewer app.