Epitranscriptomics

The High-throughput analysis pipeline of RNA modifications

Introduction

Post-transcriptional mRNA modifications play substantial roles of regulating biological processes in plants. The workflow here is to effectively classify, characterize, and compare a variety of RNA modifications identified from HAMR workflow derived from RNA-sequencing data.

General analysis of each dataset

Rationale

Each raw dataset will be processed under the same parameter to characterize distribution pattern and other genomic features of different types of modifications. Five general topics will be performed for each dataset as follow:

Genomic annotation of modifications based on gene annotation from respective species genomes.
Calculations of numbers of modifications and portions of modified reads over modified reads from both gene and single locus level
Comparisons of modification numbers from syntenic genes
Identification of enriched motif(s) for each type of modification
Comparison of gene density, modification numbers, and adjacent transposable elements frequency from the same dataset

input data

Reference genome sequences (FASTA)
Reference genome gene annotation (gff/gff3/gtf)
Known modifications position identified by HAMR (BED)
Known modifications mapping reads counts and modified reads counts identified by HAMR (txt)
Syntenic gene list
Transposable elements annotation (gff)

Integrated analysis for certain experimental setup

Rationale

General analysis for each dataset will be integrated to address certain biological questions of RNA modifications The capability of pipelines can be summarized as:

Compare differentially modified genes (DMGs) from multiple experiments
Generate enriched gene ontology and pathways for certain gene lists
Comparisons of modifications of syntenic genes across species

input data

List of modified genes
Statistics of modification derived from general analysis
Comparative genomics results (gene synteny) from inter-species comparisons

A schematic workflow of integrated analysis was shown:

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Python_scripts		Python_scripts
R_scripts		R_scripts
1_MODs_pipeline.sh		1_MODs_pipeline.sh
2_MODs_enrichment.sh		2_MODs_enrichment.sh
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Epitranscriptomics

Introduction

General analysis of each dataset

Rationale

input data

Integrated analysis for certain experimental setup

Rationale

input data

About

Releases

Packages

Contributors 2

Languages

License

Evolinc/Epitranscriptomics

Folders and files

Latest commit

History

Repository files navigation

Epitranscriptomics

Introduction

General analysis of each dataset

Rationale

input data

Integrated analysis for certain experimental setup

Rationale

input data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages