-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Updating docstring and setting up sqanti rules
- Loading branch information
1 parent
caab797
commit 5eb3ede
Showing
2 changed files
with
47 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,46 @@ | ||
# Sqanti related quality-control and filtering rules, | ||
# Sqanti is being used to annotate/characterize novel | ||
# isoforms and to build an even higher-confidence, | ||
# filtered set of unique transcripts from flair. | ||
# The resulting annotation/transcriptome will be | ||
# used to quantify known/novel isoforms. | ||
rule sqanti_qc: | ||
""" | ||
Data-processing step to characterize the input transcriptome | ||
by computing a series of attributes by transcript, which are | ||
written to the classification file, and a series of attributes | ||
by junction, which are written to the junctions file. Please | ||
note although we are running SQANTI3, the actual version of | ||
the tool we are using is 'v5.1.2'. For more information, | ||
please read through sqanti3's documenation: | ||
https://github.com/ConesaLab/SQANTI3/wiki/ | ||
Github: https://github.com/ConesaLab/SQANTI3 | ||
@Input: | ||
High-confidence Isoforms (FASTA) from flair collapse | ||
@Output: | ||
Sqanti Classification file (TSV), | ||
Corrected Annotation (GTF), | ||
Corrected Transcriptome (FASTA) | ||
""" | ||
pass | ||
|
||
|
||
rule sqanti_ml_filter: | ||
""" | ||
Data-processing step to filter the sqanti qc output. The auhtor | ||
from sqanti highly recommends filtering its output before using | ||
it in down-stream analysis. Sqanti has a new filtering method | ||
that employs random forest to discriminate potential artifacts | ||
from true isoforms without the need for user-defined rules or | ||
manually-set thresholds (i.e. previous method). For more info, | ||
please read through sqanti3's documenation: | ||
https://github.com/ConesaLab/SQANTI3/wiki/ | ||
Github: https://github.com/ConesaLab/SQANTI3 | ||
@Input: | ||
Sqanti Classification file (TSV) | ||
@Output: | ||
ML Filtered Sqanti Classification file (TSV), | ||
ML Filtered Corrected Annotation (GTF), | ||
ML Filtered Corrected Transcriptome (FASTA) | ||
""" | ||
pass |