Monarch geneset OGS2.0

DPOGS215162
TranscriptDPOGS215162-TA1425 bp
ProteinDPOGS215162-PA474 aa
Genomic positionDPSCF300539 - 9494-21207
RNAseq coverage1x (Rank: top 95%)
Annotation
HeliconiusHMEL0121622e-0730.00% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_Q868Q46e-8244.32%Reverse transcriptase n=42 Tax=Endopterygota RepID=Q868Q4_BOMMO
NCBI RefSeqXP_001949771.11e-4331.55%PREDICTED: similar to Putative 115 kDa protein in type-1 retrotransposable element R1DM (Putative 115 kDa protein in type I retrotransposable element R1DM) (ORF 2) [Acyrthosiphon pisum]
NCBI nr blastpgi|20552762e-8347.99%Pol protein [Bombyx mori]
NCBI nr blastxgi|20552761e-9447.99%Pol protein [Bombyx mori]
Group
KEGG pathway 
InterPro domain[24-86] IPR0051355.7e-09Endonuclease/exonuclease/phosphatase
Orthology groupMCL16725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215162-TA
ATGGCCACTGTCCGATGCAACATGGCCACTGCAACCAACAACTCTGGCACCGACTGTGCACAGACAATACAAATCAATGTATTCGTAAGTGTATCGGGGACAGTCCAGACGTGCGTGCGACGGTTGGGAGGTTCAGTGGTGGACGTCTCGTTTGCCACTCCGACCATTGCACGCAGAGTGGAGGGATGGAGGGTTGAGGCAGGAGTGGAGACGCTTTCGGACCACCGCTACATACGGTTCGAAGTGTCTGCCGCTCCTGCCGGTCACCGGAGTCCAGCGTCGAGTTCCTCGTCGTCCCGGGAGAGAAGTCGGTTCCCGCGTTGGGCCCTGTCAAGGCTCAATAGGGAACTGGCTGAGGAAGCGGCCGTTGTCGGCCGCTGGAGTCTCCCCGGGAGTGCGGAGTTGGGGGGGGACGAGGGGGCTAGTCGCTTTGGGGACGTTCTACATAACGTCTGCAGAGCGGCGATGCCCCCCGTTGGACGTCCACCCCCGCGGGGTGCGGTGTACTGGTGGTCGGACCACATTTCCGACCTCCGGGTCGCCTGCAACGGGGCCAGGAGGGCATACACCCGGAGCAGGCGACGCCGCCCCCAGGACGAGGAGCGTGATGGCCGGCTGAATAGGGTCTACGTGGCGAAAAAGTTGATCCTGCAGCAGGCCATCCGCCGGGCCAAGGAGGCAGCCTGGCTGGAGCTGGTGGAGGGGCTCGATCGAGACCCCTGGGGCCGACCGTACAAGCGGGCGCGGAATAAAATCTGCGCCCAGTCGGCCCCCATCACGGAGGTGGTCCAGCCGGCTGCTCTGAGAGGGATCGTCGGGGAACTCTTCCTTGACGCCCCGGCGGGATTCACTCCCCCCAGAATGGCTCGGCAGACGCTGGAAGAGGGCGACCGCGTTCCGCCCACCGTCACGGAGTCTGAAATGGAGGCGATTTTGGCTCGCCTCCAGAGTAAGAAGAGCGCACCGGGTCCGGACGGGGTGCACGGGAGGGTTCTGGCCCTCTCCCTGGTGCACCTCGGAGGTGCCCTCAGGGAGCTGTTCGACCTCTGCCTGAGGTCCGGGCAGTTCCCGAGGGCCTGGAAGGAAGGCCGGCTTTGCCTACTCCCAAAAGGCAGCCGGCCTCTGGACTCGGTTTTGGCGGTGCGACCGGTGGTTGTGCTGAACGAGGCGGGGAAGGCCCTGGAGAAGATAATGGCCACCCGTCTCGTTCGGCACCTGGAGGAAGGCTCGGGCCCGGGACTGTCAGAGTCCCAATTTGGGTTCCGAGCCCGTCGGTCGACCGTCGATGCCCTCAAACGCCTGAGGGCGGTGACGGAAGAGGCGGAACGCAGAGGAGAGAGCCCTGAATACAGGGATTTGTGTGAGCTAGAGCTCAATTTATCTCACCACTGTGTATATATGACGCACATGATAACACCGAAATAA

Protein sequence:

>DPOGS215162-PA
MATVRCNMATATNNSGTDCAQTIQINVFVSVSGTVQTCVRRLGGSVVDVSFATPTIARRVEGWRVEAGVETLSDHRYIRFEVSAAPAGHRSPASSSSSSRERSRFPRWALSRLNRELAEEAAVVGRWSLPGSAELGGDEGASRFGDVLHNVCRAAMPPVGRPPPRGAVYWWSDHISDLRVACNGARRAYTRSRRRRPQDEERDGRLNRVYVAKKLILQQAIRRAKEAAWLELVEGLDRDPWGRPYKRARNKICAQSAPITEVVQPAALRGIVGELFLDAPAGFTPPRMARQTLEEGDRVPPTVTESEMEAILARLQSKKSAPGPDGVHGRVLALSLVHLGGALRELFDLCLRSGQFPRAWKEGRLCLLPKGSRPLDSVLAVRPVVVLNEAGKALEKIMATRLVRHLEEGSGPGLSESQFGFRARRSTVDALKRLRAVTEEAERRGESPEYRDLCELELNLSHHCVYMTHMITPK-