Evaluating HISAT2, Salmon and Kallisto alignment using Checkpoint Blockade treated CT26 Experimental Data
Cancer immunotherapy and checkpoint blockade therapy specifically have recently emerged as a big part of the current arsenal of treatments available for cancer patients. (Korman et al., 2006)(Hargadon et al., 2018) By targeting the immmunosuppressive microenvironment through PD-L1 and CTLA-4 blockade, this class of therapy re-activates exhausted effector T-cells ultimately driving tumor regression.(Wei et al., 2018) Today, this therapeutic modality is no longer just a monotherapy. It has extended into combination therapies with small-molecules (Manni et al., 2018), other antibodies (Taylor et al., 2019), and even adoptive T-cell transfer. (Grosser et al., 2019) The biggest issue this therapy faces is the lack of understanding on who are most suited to receiving the treatment. Since this type of treatment relies on the presence of tumor infiltrating lymphocytes (TIL), the initial assumption was that the presence of TILs was sufficient for success, but that is not the case. More often than not patients that are supposed to respond to thsi theraputic stratergy will relapse despite the presesce of TILs. (Liu et al., 2019)
Taylor et al, tackles this through a better understanding of mouse models that can assess the efficacy of different combination therapies. In their study they sequenced CT26 tumors that were treated with or without checkpoint inhibitor blockades on day 7 and day 14 post treatment among a variety of other benchmarks. (Taylor et al., 2019) Although they went on to assess the status of different immune cells and pathway analysis as part of their characterization, this is not something replicated here. This project will use the day 7 isotype vs treatment data and assess the differentially expressed genes. They had 5 biological replicates for each group, so this analysis was performed on 10 samples total.
Quantifying differential gene expression is on of the most popular uses for RNA-seq analysis. The method of quantification, however, can vary vastly with the multitude of options currently available. The tools that perform initial quantification can be divided into two main classes, aligners (STAR, HISAT2) and psudoaligners (Salmon, Kallisto). If you chose an aligner for this first step you will need to use a counter to quantify this alignment (Stringtie, HTSeq-count, featureCounts), something a psudoaligners does automatically. At the next stage of analysis, you can use a variety of tools to perform differential gene expression analysis (edgeR, DESeq2).
Each combination of tools has its own set of advantages and disadvantages and depending on the biological question being investigated there may be multiple paths that can provide the optimal answers. For example, if your goal were to find novel transcripts or splice variations, kallisto or salmon would not be an appropriate start to your pipeline since they discard all reads that do not match a reference transcriptome. In terms of biology, the accuracy of counts is the most important since this is the metric used to determine the specific differences between whatever condition you are testing.
In this project three different aligners/psudoaligners specifically, HISAT2, Kallisto and Salmon will be compared. We will compare them based on the correlation of the following criteria:
- Differentially expressed genes
- Abundance and raw counts between kallisto and salmon
- Calculated log2 fold change and adjusted p-value based on DESeq2 output
The methods section of this paper was not very specific on the settings and thresholds used when running their analysis. For this reason, most settings were left at the default level. The general tools used by the paper include Salmon, DESeq2 without any trimming. Here these basic things have also been followed. In addition, there was no gene list provided by the authors that detailed what genes they found as differentially expressed, so there was no way to compare their specific output to what was found here. They did provide overall numbers for significant genes, so that was used as a guide for what to expect from our analysis.
We will be exploring three pipelines in this project. All the data was initially put through FASTQC
to assess the quality of the run. We then processed the .fastq
files.
The Hierarchical Indexing for Spliced Alignment of Transcripts 2 (Kim et al., 2019) or HISAT2 route, as mentioned earlier, is an aligner that is splice aware and maps reads to a reference genome. It is based on the Ferragina Manzini index (FMI) that is the bases for bowtie2. This method takes .fastq
data and aligns reads to the FMI for a given reference and generates a .sam
file that tabulates the location of each read on each of the chromosomes. We then take this and use Samtools
to convert this data into sorted and filtered .bam
files that we count using the featureCounts
function within the Rsubread
Bioconductor package. The count data from this is used within DESeq2
to generate log2FoldChange
and padj
values for comparisons between the control and treatment group on day 7.
The other two methods used to generate counts are both psudoaligners. Salmon is one of the most popular methods used to quickly align and quantify transcript abundance in RNA-sequencing. (Patro et al., 2017) Unlike HISAT2 it aligns raw . fastq
reads to the index of a supplied transcriptome using quasi-mapping and followed by a quantification of abundance. It can quantify . bam
files but this is not a functionality used in this project. Counts generate here are once again supplied to DEseq2 for log2FoldChange
and padj
values for comparisons between the control and treatment group on day 7.
Kallisto is another psudoaligner that was used here to quantify abundance. (Bray et al., 2016) Similar to Salmon, it does not look for the specific coordinate position where the read maps to the genome. It maps to the transcriptome and quantifies abundance that are used within DEseq2. The motivation to choose these methods was to compare whether there are significant similarities or differences in what is called significant depending on the method used.
All comparisons between pipeline were assessed on the R^2 value between comparable metrics. The goal of this exercise was to determine what was similar and what was different between the different counting methods. In some instances, it was not possible to compare certain information because that parameter was not generated as part of the count process. This was not as much of a problem when it came to DESeq2 outputs since at that point everything was once again standardized. Unlike Taylor et al. we do not go into pathway level analysis or anything beyond basic DGE. The crucial comparison at the end is the number of similar genes that were significantly differentialy expressed (i.e., log2FoldChange > 1 and padj < 0.05) across the three count methods employed.
The first comparison made was between the raw counts and abundance generated by Kallisto and Salmon after the tximport
prior to DGE analysis. Since these pseudoaligners are most similar in alignment method, but use slightly different methods of quantification, here the result of this difference was explored. For the sake of simplicity, biological replicates were collapsed into their respective groups prior to plotting.
In all the four samples compared here (Fig. 6 and 7), count data was very consistent with the R^2 above 0.98 across all samples. This can be accounts for from the similarity in the alignment mechanism between the two methods. The abundance values were also quite consistent with R^2 values above 0.80. This might indicate slight differences in how the two software handled the counts downstream to calculate transcripts per million (TMP) values.
The next comparison between all three methods was done with a Ratio intensity or MA plot. (Fig. 8) As a general trend across the three methods, there seems to be broad consensus that the majority of significant genes are downregulated. The noise seems to be empirically higher in HISAT2 data. This might be down to the specific settings used to run featureCounts
.
This trend is also seen when we look that the common genes that are commonly up or down regulated between the three methods in Fig. 9 and 10, with the majority being down regulated (368 common genes across all three methods)
Overall, there seems to be more overlap in genes selected by Salmon and Kallisto vs these two methods and HISAT2. With over 200 genes in the HISAT2 category that do not seem to be present in the Salmon and Kallisto set.
When the DEG common to all three analyses were compared for the calculated log2 fold change (log2FC) and adjusted p-values (padj), there was interesting trend where the log2FC was highly constant across genes (R^2 > 0.95), but the padj for the same genes were all under 0.05 but varied greatly in absolute value (R^2 ~ 0.4 - 0.7). (Fig 11 and 12)
When the top 20 genes that are selected by HISAT2 but not by the other two aligners are compared, you see a common trend where the log2FC is mostly similar across the board, but it is the adjusted p-value that caused the exclusion of these genes. In some cases (ENSMUSG00000039252) the adjusted p-value for Salmon and Kallisto was NA, for a positive log2FC but for HISAT2 the gene was significant and downregulated.
Gene ID | log2FC.kal | padj.kal | log2FC.sal | padj.sal | log2FC.hisat2 | padj.hisat2 |
---|---|---|---|---|---|---|
ENSMUSG00000020658 | -1.356535 | 0.53097689 | -1.357175 | 6.111774e-01 | -1.000226 | 0.029095355 |
ENSMUSG00000015709 | -34.300029 | NA | -33.589563 | 2.890000e-12 | -1.014931 | 0.012030225 |
ENSMUSG00000024044 | -7.558069 | 0.14659854 | -6.668639 | NA | -1.023715 | 0.007608310 |
ENSMUSG00000030854 | -1.167033 | 0.16784485 | -1.218756 | 2.873419e-01 | -1.052755 | 0.038806329 |
ENSMUSG00000053113 | -1.010064 | 0.00343756 | -1.192120 | 6.746481e-02 | -1.075291 | 0.001207374 |
ENSMUSG00000026770 | -1.077229 | 0.06709004 | -1.061079 | 6.178079e-02 | -1.075449 | 0.047186361 |
ENSMUSG00000037709 | -1.158644 | 0.09655698 | -1.320613 | 5.427227e-02 | -1.078484 | 0.026103079 |
ENSMUSG00000024810 | -1.082532 | 0.06440562 | -1.055606 | 4.521379e-02 | -1.102773 | 0.021926060 |
ENSMUSG00000097113 | -1.017096 | 0.07880397 | -1.018722 | 6.474304e-02 | -1.107354 | 0.020556920 |
ENSMUSG00000039252 | 2.394058 | NA | 3.183328 | NA | -1.111985 | 0.026702981 |
ENSMUSG00000035448 | -1.130945 | 0.06308394 | -1.035933 | 9.499252e-02 | -1.120976 | 0.035099793 |
ENSMUSG00000080848 | -1.429144 | 0.70390475 | -1.608505 | NA | -1.124491 | 0.035932036 |
ENSMUSG00000021675 | -1.010883 | 0.10962906 | -1.132175 | 7.698013e-02 | -1.127357 | 0.046448316 |
ENSMUSG00000090084 | -1.050869 | 0.06038621 | -1.160448 | 3.970894e-02 | -1.135746 | 0.028466701 |
ENSMUSG00000096929 | -1.251021 | 0.22166135 | -1.018699 | 2.904156e-01 | -1.139663 | 0.029137355 |
ENSMUSG00000079033 | -1.312127 | 0.07277804 | -1.167783 | 2.159020e-01 | -1.149691 | 0.029944489 |
ENSMUSG00000028713 | -1.122202 | 0.07060479 | -1.141025 | 6.148902e-02 | -1.152508 | 0.029090021 |
ENSMUSG00000031442 | -1.153959 | 0.04717234 | -1.051022 | 1.388943e-01 | -1.153654 | 0.004888088 |
ENSMUSG00000022144 | -1.092337 | 0.02166857 | -1.927377 | NA | -1.155644 | 0.007176521 |
ENSMUSG00000038112 | -1.124223 | 0.12558901 | -1.137111 | 1.721675e-01 | -1.168819 | 0.032574158 |
In this project, we evaluated the differences in what genes were identified as differentially expressed based on the use of Salmon, Kallisto or HISAT2. The data indicated a strong correlation based on raw counts between the two psudoaligners, Salmon and Kallisto, and log2FolfChange values for genes that were found to be significant across all three methods. There were slightdifferences in abundance values when Salmon and Kallisto were compared, but there was an overall closer consensus between the psudoaligners on what is differentially expressed. Some differences may be present because Salmon accounts for sample specific biases throughout a run unlike Kallisto and uses a different quantification pipeline. HISAT2 may be identifying more genes over all because it is better at identifying genes that are lowly expressed compared to the two psudoaligners (Wu et al., 2018) leading to the overall higher number in DEG in the HISAT2 group. Depending on the biological question at hand this may or may not be of importance when choosing one method over the other. Ultimately repetiton of this analysis most be done with simulated dataset to have a ‘true’ value to compare against.
All data and full analysis can be found at: github.com/sk7-dotcom/Evaluating_RNAseq_aligners
Methods | Count Generation | DESeq2 pipeline | Comparisons
Taylor, M. A. et al. Longitudinal immune characterization of syngeneic tumor models to enable model selection for immune oncology drug discovery. J Immunother Cancer 7, 328 (2019).
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol 37, 907–915 (2019).
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 14, 417–419 (2017).
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol 34, 525–527 (2016).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
Wu, D. C., Yao, J., Ho, K. S., Lambowitz, A. M. & Wilke, C. O. Limitations of alignment-free tools in total RNA-seq quantification. Bmc Genomics 19, 510 (2018).
Wei, S. C., Duffy, C. R. & Allison, J. P. Fundamental Mechanisms of Immune Checkpoint Blockade Therapy. Cancer Discov 8, 1069–1086 (2018).
Manni, W., Liu, Y., Cheng, Y., Xiawei, W. & Yuquan, W. Immune checkpoint blockade and its combination therapy with small-molecule inhibitors for cancer treatment. Biochimica Et Biophysica Acta Bba - Rev Cancer 1871, 199–224 (2018).
Liu, D., Jenkins, R. W. & Sullivan, R. J. Mechanisms of Resistance to Immune Checkpoint Blockade. Am J Clin Dermatol 20, 41–54 (2019).
Hargadon, K. M., Johnson, C. E. & Williams, C. J. Immune checkpoint blockade therapy for cancer: An overview of FDA-approved immune checkpoint inhibitors. Int Immunopharmacol 62, 29–39 (2018).
Korman, A. J., Peggs, K. S. & Allison, J. P. Checkpoint Blockade in Cancer Immunotherapy. Adv Immunol 90, 297–339 (2006).
Grosser, R., Cherkassky, L., Chintala, N. & Adusumilli, P. S. Combination Immunotherapy with CAR T Cells and Checkpoint Blockade for the Treatment of Solid Tumors. Cancer Cell 36, 471–482 (2019).