Monarch geneset OGS2.0

DPOGS215510
TranscriptDPOGS215510-TA1299 bp
ProteinDPOGS215510-PA432 aa
Genomic positionDPSCF300132 + 351547-353139
RNAseq coverage1x (Rank: top 95%)
Annotation
Heliconius% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_Q8MY336e-4541.72%Reverse transcriptase n=9 Tax=Endopterygota RepID=Q8MY33_9NEOP
NCBI RefSeqXP_001949771.17e-3032.54%PREDICTED: similar to Putative 115 kDa protein in type-1 retrotransposable element R1DM (Putative 115 kDa protein in type I retrotransposable element R1DM) (ORF 2) [Acyrthosiphon pisum]
NCBI nr blastpgi|220040102e-4542.07%reverse transcriptase [Papilio xuthus]
NCBI nr blastxgi|220040042e-5242.07%reverse transcrpitase [Papilio xuthus]
Group
KEGG pathway 
InterPro domain[5-125] IPR0051351.1e-10Endonuclease/exonuclease/phosphatase
Orthology groupMCL16725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215510-TA
ATGCCAGTTAGGCCCATAGGTAAAGTTCTTCAGGGGAACTTAAACCACGCGGTCGCAGCACAGGACCTCTTGTACCAGACTGTGGCCGAGTGGAACATAAATGTAGCTATCGTTGCGGAGCCATACTCTATTCCCCGAACCCATAAATGGGCCGGGTCCGTGGATGGTTCCGCGGCTATTTTCTTTCCCGGCGTGGCCTGCACTCACTCCGTTGTGGAGAGAGGAGCGGGCTTTGTGGCAGCTCGATGGGGAGAAGTAGTGGTGGTCTCTACATACTTCTCCCCAAACCGCAGCCGGGCCGACTTTGAGTCGTTCCTGGCTACGGTTGAAGGAGTCATCCTTCGGGTGGCCCCCAGTCCGGTGCTGCTGGGTGGCAGACCACCTTTCCTAAGGTGGTCACTCGTTCGCCTCCAGCCCGATGTGGCTGAGGAGGCAGCGATGGTGAGAGCATGGGCCGCAGTGCCCGACACCATGGCTGGGGATGCCGACTGTATGGCGGACCTTTTCGCGGACGACATTAAGGTTGTCTGCGATGCCGCTATGCCGAGGACGCAGGCCTGCCCCCGAAACAGAGGGCAGGTATACTGGTGGACGCAGGAACTGTCCAGCCTACGTACCGCCAGTATGGGGGCTAGGCGCGCCTACCAGCGTTACCGTAGGCGCGCCCGAGGAACGCTCGGTGTAGAAGAAAGTCTATACCGGGCCTACCAGGATGCCAACAAGGCATTGCGGACGGCCATTCGCAAGGCCAAAGAGGATGCCTGGGACCAGTTCCTGGGCATACTCAATAACGACCCCTGGGGTAGGCCCTACAGGACGATTAGGGGGAAATTCTCTACTCCAGCTTCTCCTACCTCCTGTATGGAGCCTGGGTTGCTGCGGAGGGTACTTGGGACGTTGTTCCCTGATCCTGGACCGTTCGCACCTCCGCGCATGACTACTGCAGATCTCGCTCAAGGGGAGCGGGTCGACGGCCCTCCCGTGTCGGATGCTGAATTCAGCACGATCCGTTTGAGGCTCCGGTGCAAACGCAAGGCGCCGGGGCCGGATGGGGCCCCCTCCAAGGTGTTGGCTATCGCCTTAGGGCCCCTGGAGGACCGGTACCGCGCAGTGCTCAACACCTGCATTGCGGCGGCCCACTTCCTCAGGCGATGGAGAGTACGGCGGCTCTGTCTACTCCGTAAGGAGAACCGTCCGGCGGATGCCCCAGAGGGCTACCGGCCAGTGGTGTTACTGAATATTGTAATTAATAACAATATAGACATTATGCAATTGGACAACTTCATTGAAATACCGTGA

Protein sequence:

>DPOGS215510-PA
MPVRPIGKVLQGNLNHAVAAQDLLYQTVAEWNINVAIVAEPYSIPRTHKWAGSVDGSAAIFFPGVACTHSVVERGAGFVAARWGEVVVVSTYFSPNRSRADFESFLATVEGVILRVAPSPVLLGGRPPFLRWSLVRLQPDVAEEAAMVRAWAAVPDTMAGDADCMADLFADDIKVVCDAAMPRTQACPRNRGQVYWWTQELSSLRTASMGARRAYQRYRRRARGTLGVEESLYRAYQDANKALRTAIRKAKEDAWDQFLGILNNDPWGRPYRTIRGKFSTPASPTSCMEPGLLRRVLGTLFPDPGPFAPPRMTTADLAQGERVDGPPVSDAEFSTIRLRLRCKRKAPGPDGAPSKVLAIALGPLEDRYRAVLNTCIAAAHFLRRWRVRRLCLLRKENRPADAPEGYRPVVLLNIVINNNIDIMQLDNFIEIP-