-
Notifications
You must be signed in to change notification settings - Fork 0
Downstream analysis
After running ACES, there are multiple files that will be generated that can be used in various downstream analyses. For instance the .GFA
file can be viewed in Bandage (https://rrwick.github.io/Bandage/) which can give insights about the conservation of your element of interest. For an example of this see Figure 1 of the ACES manuscript.
A second analysis that can be done is using PHAST (http://compgen.cshl.edu/phast/) to identify conserved elements within the alignment file. To do so please run the following steps after installing PHAST:
- Estimate a neutral model from the alignment
phyloFit --EM --tree <ACES_raxml_tree>.phy -out-root <file_prefix> <ACES MSA ouput>
- Generate conserved and nonconserved models from the data
phastCons --estimate-trees <file_prefix> <file_prefix>.mod --no-post-probs
- Identify conserved elements using the onserved and nonconserved models
phastCons <ACES MSA ouput> <file_prefix>.cons.mod,<file_prefix>.noncons.mod > <file_prefix>_scores.bed
The final file, <file_prefix>_scores.bed
, will contain the locations of conserved elements within you sequence of interest and the log odds score for that sequence being a conserved element. For detailed usage on phastCons see http://compgen.cshl.edu/phast/phastCons-HOWTO.html, this analysis here is based off option 2 from this page and estimates all parameters for the models from the data.