Monarch geneset OGS2.0

DPOGS215241
TranscriptDPOGS215241-TA1362 bp
ProteinDPOGS215241-PA453 aa
Genomic positionDPSCF300047 - 586603-588035
RNAseq coverage0x (Rank: top 99%)
Annotation
Heliconius% 
BombyxBGIBMGA013961-TA2e-1323.64% 
Drosophila% 
EBI UniRef50UniRef50_Q8MY333e-9044.75%Reverse transcriptase n=9 Tax=Endopterygota RepID=Q8MY33_9NEOP
NCBI RefSeqXP_002430678.18e-3829.30%endonuclease/reverse transcriptase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|220040047e-9245.61%reverse transcrpitase [Papilio xuthus]
NCBI nr blastxgi|20552761e-10347.87%Pol protein [Bombyx mori]
Group
KEGG pathway 
InterPro domain[3-189] IPR0051354.5e-37Endonuclease/exonuclease/phosphatase
Orthology groupMCL16725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215241-TA
ATGGCCGGACATCTCAGTTTCCTGCAGGCCAACGGGAACCACGCCGCCGGGGCTCAGGACCTCTTCCTGCAGTCCATGGTGGAGTGGGCCGTTGACGTGGCGGTCATAGCGGAGCCGTACTATGTCCCTGCCCAACCTCATTGGGCGGGAGACACGGGCGGCTCCGTGGCGATCGTCACGCGGCCGGGCGCCGTGCCCCCCTCGCAGTCAAGGCGAGGGGCCACGGCCTTCGAGAGTTTTCTCGACACTCTCGGGCCGGTGGTAGGGCAGTTAGCGCCTCTCCAGGTGGTTGTGATGGGGGACCTCAACGCCAAGTCCACGGCGTGGGGAAACCCCCTCACAACACCCAGAGGGAGGGAGCTGGAGGAATGGGCGCTAACTGCCGGGCTGTCCCTACTCAACACGGGGACAGTCCAGACGTGCGTGCGAAGGTCGGGAGGTTCAGTGGTGGACGTCTCGTTCGCCACTCCGACCATCGCACGCAGAGTGGAGGGATGGAGGGTTGAGGCAGGAGTGGAGACGCTTTCGGACCACCGCTACATACGGTTCGAAGTGTCTGCCACTCTTGCCGGTCGCCGGAGTCCAGCGTCGAGTTCCTCGTCGTCCCGGGAGAGAAGTCGGTTTCCGCGTTGGGCCCTATCAAGGCTCAACCGGGAACTGGCTGAGGAAGCGGCCGTTGTCGGCCGCTGGAGTCTCCCGGGGAGTGCGGAGTTGGGGGTGGACGAGGGGGCTAGTCGCTTTGGGGACGTTCTCCATAACGTCTGCAGAGCGGCGATGCCCCCCGTTGGACGTCGACCCCCGCGGGGTGCGGCGTACTGGTGGTCGGACCACATTTCCGACCTCCGGGTCGCCTGCAACGGGGCCAGGAGGGCATACACCCGGAGCAGGCGACGCCGCCCCCAGGACGAGGAGCGTGATGGCCGGCTGTACAGGGTCTACGTGGCGAAAAAGTTGATCCTGCAGCAGGCCATCTGCCGAGCCAAGGAGGCAGCCTGGCTGGAGCTGGTGGAGGGGCTCGATCGAGACCCCTGGGGCCGACCGTACAAGCGGGCGCGGAACAAAATCTGCGCCCAGTCGGCCCCCATCACGGAGGTGCTCCAGCCGGCTGTTCTGAGGGGGATCGTCGGGGAACTCTTCCCCGACGCCCCGGCGGGATTCATTCCCCCCAGAATGACTCGGCAGACGCTGGAAGAGGGCGACCGCGTTCCGCCCACCGTCACGGAGTCTGAAATGGAGGCGATTTTGGCTCGCCTCCAGAGTAAGAAGAGCGCACCGGGTCCGGACGGGGTGCACGGGAGGGTTCTGGCTCTCTCCCTGGTGCACCTCGGGGGAGCCCTCAGGGAGCTGTCGACCTCTGTCTGA

Protein sequence:

>DPOGS215241-PA
MAGHLSFLQANGNHAAGAQDLFLQSMVEWAVDVAVIAEPYYVPAQPHWAGDTGGSVAIVTRPGAVPPSQSRRGATAFESFLDTLGPVVGQLAPLQVVVMGDLNAKSTAWGNPLTTPRGRELEEWALTAGLSLLNTGTVQTCVRRSGGSVVDVSFATPTIARRVEGWRVEAGVETLSDHRYIRFEVSATLAGRRSPASSSSSSRERSRFPRWALSRLNRELAEEAAVVGRWSLPGSAELGVDEGASRFGDVLHNVCRAAMPPVGRRPPRGAAYWWSDHISDLRVACNGARRAYTRSRRRRPQDEERDGRLYRVYVAKKLILQQAICRAKEAAWLELVEGLDRDPWGRPYKRARNKICAQSAPITEVLQPAVLRGIVGELFPDAPAGFIPPRMTRQTLEEGDRVPPTVTESEMEAILARLQSKKSAPGPDGVHGRVLALSLVHLGGALRELSTSV-