Monarch geneset OGS2.0

DPOGS213969
TranscriptDPOGS213969-TA1410 bp
ProteinDPOGS213969-PA469 aa
Genomic positionDPSCF300498 + 39586-41075
RNAseq coverage1x (Rank: top 95%)
Annotation
Heliconius% 
BombyxBGIBMGA013961-TA7e-1636.23% 
Drosophila% 
EBI UniRef50UniRef50_Q868Q43e-6146.28%Reverse transcriptase n=42 Tax=Endopterygota RepID=Q868Q4_BOMMO
NCBI RefSeqXP_002430678.12e-3130.62%endonuclease/reverse transcriptase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|220040049e-6145.07%reverse transcrpitase [Papilio xuthus]
NCBI nr blastxgi|285698946e-6345.11%reverse transcriptase [Bombyx mori]
Group
KEGG pathway 
InterPro domain[147-353] IPR0051353.1e-35Endonuclease/exonuclease/phosphatase
Orthology groupMCL16725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213969-TA
ATGGAAAGGGCAGTACGAGAAGCAGTCGCCTTGTCAAGCGCGAGACTGGATGCGCGGCTGGAGTCGCTTGAGACCAGGCTCCTGCCCGCGCCACGTTTCTGCCCTCCCCTGGCCTTCGACAAGAAGGAGACGGATGCGCAGTCAGCTATGCCAAGGCCTCGGCAGAAAACTCCAGCCCCGGAAGCCGCGGTGACGACTCTGTCGCCTCCGTCGAACCCCAGTCCGGAGGAAAAGAAAAGGAGGAAAGCAAGGGCGTCCGCTGCGGCAAAAGAAGCGGCAGTGACCAGAAAGGACGCCTCCAAATCACAAGCGGCGACAAAAGCCCCCGCTGCCAGTGAATGGACAAGAGTCGCAACTAAGAAGGGAAACAAGGGGCAAAAGAAGCAACAGAGGGCAGCGCCACCGCAAAAGAAAGAGGGGAAGAAGAGAAATCTGCAAGCCCCAAAGTCCGCGGCGGTGGGCAACTTGAACTACTGCGCTCGCTCGCAAGATCTCCTGGATCACTATGCAGCGCAGTGGTCAATAGACATGGATATGGTGTGCGAGCCGTACCGCGTCCTCCCCCGAGATAATTGGATCGGGAACGTCGACTCGACGGTCGCGTTGGTCTTCGGCCCTAACATTGCGCCTCCGTGTTTCGGGGGTACTATTAAGGGTCGCGACTATGTTGGTGGCAAATTGGGGTCCGTTACCGTGTTGGGGGTCTACTTCTCCCCAAACAGGCCCCTGGCCCAGTTCGAAGCCTTCCTCCTCCAGTTGACCAGTGTGGTGAATGGAGCGACGGACCCAGTGATCGTTGCAGGGGACTTCAACGCCAAGTCGGCGCTGTGGGGCTGCGCTGCAACGGACGCTAGAGGCCGTGCGATGGAGCGCTGGATGGCGTCTACGGACCTATTGCCTGTCAGTCGCGGATCAGTGAGCACTTGCGTGCGTCAACAGGGCGAGTCAATCGTCGACGTAACGCTCGTCAGTTCGCCGGCCGCCCACCGTATCGCCAATTGGCGGGTACAGGAGGGTACGGAAACTCTGTCGGATCACAGATTCATCCGCTTCGACGTAATTTCACACACGTCGGAGCCGGCCTCCACAACACCACCTGGTGGAGGGCCAAGATGGTCTGTGAGACGGCTGGACCGTAACCTCCTCGAAGAGGCTGCTGTCGTCGCGTCCTGGGTGCCCCTACCTTCAGCCGACGCCGCGGCTTGTGCGGAATGGCTCAACGAGGCGTCGCGCAACATCTGCGACGCCTCCATGCGGCGAGTGGGCAATACCGGACGGCGACGACAGGCGTACTGGTGGCGGCCTGAGAGCGGAATGGCTCAACGAGGCGTCGCGCAACATCTGCGACGCCTCAATGCCATGAGTGGGCAATACCGGACGGCGACGACAGGCGTACTGGTGGCGGCCTGA

Protein sequence:

>DPOGS213969-PA
MERAVREAVALSSARLDARLESLETRLLPAPRFCPPLAFDKKETDAQSAMPRPRQKTPAPEAAVTTLSPPSNPSPEEKKRRKARASAAAKEAAVTRKDASKSQAATKAPAASEWTRVATKKGNKGQKKQQRAAPPQKKEGKKRNLQAPKSAAVGNLNYCARSQDLLDHYAAQWSIDMDMVCEPYRVLPRDNWIGNVDSTVALVFGPNIAPPCFGGTIKGRDYVGGKLGSVTVLGVYFSPNRPLAQFEAFLLQLTSVVNGATDPVIVAGDFNAKSALWGCAATDARGRAMERWMASTDLLPVSRGSVSTCVRQQGESIVDVTLVSSPAAHRIANWRVQEGTETLSDHRFIRFDVISHTSEPASTTPPGGGPRWSVRRLDRNLLEEAAVVASWVPLPSADAAACAEWLNEASRNICDASMRRVGNTGRRRQAYWWRPESGMAQRGVAQHLRRLNAMSGQYRTATTGVLVAA-