Articles, Abstracts, and Reports

RepeatModeler2 for automated genomic discovery of transposable element families.

Jullien M Flynn, Institute for Systems Biology, Seattle, WA 98109.
Robert Hubley
Clément Goubert, Institute for Systems Biology, Seattle, WA 98109.
Jeb Rosen
Andrew G Clark
Cédric Feschotte
Arian F Smit, Institute for Systems Biology, Seattle, WA 98109.Follow

Document Type

Article

Publication Date

4-28-2020

Publication Title

Proceedings of the National Academy of Sciences of the United States of America

Abstract

The accelerating pace of genome sequencing throughout the tree of life is driving the need for improved unsupervised annotation of genome components such as transposable elements (TEs). Because the types and sequences of TEs are highly variable across species, automated TE discovery and annotation are challenging and time-consuming tasks. A critical first step is the de novo identification and accurate compilation of sequence models representing all of the unique TE families dispersed in the genome. Here we introduce RepeatModeler2, a pipeline that greatly facilitates this process. This program brings substantial improvements over the original version of RepeatModeler, one of the most widely used tools for TE discovery. In particular, this version incorporates a module for structural discovery of complete long terminal repeat (LTR) retroelements, which are widespread in eukaryotic genomes but recalcitrant to automated identification because of their size and sequence complexity. We benchmarked RepeatModeler2 on three model species with diverse TE landscapes and high-quality, manually curated TE libraries:

Specialty

Institute for Systems Biology

Recommended Citation

Flynn, Jullien M; Hubley, Robert; Goubert, Clément; Rosen, Jeb; Clark, Andrew G; Feschotte, Cédric; and Smit, Arian F, "RepeatModeler2 for automated genomic discovery of transposable element families." (2020). Articles, Abstracts, and Reports. 3038.
https://digitalcommons.providence.org/publications/3038

Link to Full Text

PSJH Full Text

COinS

Articles, Abstracts, and Reports

RepeatModeler2 for automated genomic discovery of transposable element families.

Document Type

Publication Date

Publication Title

Abstract

Specialty

Recommended Citation

Browse

Links

Search

PSJH Research

Articles, Abstracts, and Reports

RepeatModeler2 for automated genomic discovery of transposable element families.

Authors

Document Type

Publication Date

Publication Title

Abstract

Specialty

Recommended Citation

Share

Browse

Links

Search

PSJH Research