Monarch geneset OGS2.0

DPOGS205343
TranscriptDPOGS205343-TA1602 bp
ProteinDPOGS205343-PA533 aa
Genomic positionDPSCF300292 + 201079-206251
RNAseq coverage46x (Rank: top 71%)
Annotation
Heliconius% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_E2AA348e-4044.65%Putative uncharacterized protein n=1 Tax=Camponotus floridanus RepID=E2AA34_CAMFO
NCBI RefSeqXP_001949771.13e-1830.34%PREDICTED: similar to Putative 115 kDa protein in type-1 retrotransposable element R1DM (Putative 115 kDa protein in type I retrotransposable element R1DM) (ORF 2) [Acyrthosiphon pisum]
NCBI nr blastpgi|3071824833e-3944.65%hypothetical protein EAG_11668 [Camponotus floridanus]
NCBI nr blastxgi|20552761e-4543.85%Pol protein [Bombyx mori]
Group
KEGG pathway 
Orthology groupMCL16725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205343-TA
ATGTCACGGAGTAAGGGCGGAGGACTAGTGCGGTGTGATAAGAGTAGAGCGAGGCTTGTTTATGACGCGAGTACATACGTTACGCTCTTTCGGCTCGACCGGGAACTGGCTGAAGAAGCGGCCGTTGTCGGCCGCTGGAGCCTCCCGGAGAGTGCGGAGTTGGGGGGGTGGATGGGGGCTAGTCGCTTTGGAGACGTTCTCCAAAACGTCTGCAGAGCGGCGATGCCCCCCGTAGGACGTCCCCCCCCGCGGGGAGCGGTGTACTGGTGGTCGGACAACATCTCCGACCTCCGGGTCGCCTGCAACGGGGCCAGGAGGGCATACACCCGGAGCAGGCGACGCCGCCCCCAGGACGAGGAGCGTGATGGCCGGCTGTACAGGGTCTACGTGGCGAAGAAGTTGATCCTGCAGCAGGCCATCCGCCGAGCCAAGGAGGCAGCCTGGCTGGAGCTGGTGGAGGGGCTCGATCGAGACCCCTGGGGCCGACCGTACAAGCGGGCGCGGAATAAAATCTGCGCCCAGTCGGCCCCCATCACGGAAGTGCTCCAGCCGACTGTTCTGAGGGGGATCGTCGGGGAACTCTTCCCCGACGCCCCGGCGGGATTCACTCCCCCCAGAATGGCTCGGCAGACGCTGGAAGAGGGCGACCGCGTTCCGCCTACCGTCACGGAATCTGAAATGGAGGCAATTCTGGCTCGCCTCCAGAGTAAGAAGAGCGCACCGGGTCCGGACGGGGTGCACGGGAGGGTTCTGGCTCTCTCCCTGGTGCACCTCGGAGAAGCCCTCAGGGAGCTGTTCGACCTCTGTCTGAGGTCCGAGCAGTTCCCGAGGGCCTTGTTGGGGGTAGGTTTCTTTGGGCTGATCATCCCATGTCAATGTATAGATTTCATGGACGTTATCCTGAATGATACAAACATAGAACAGGAATGGGCATCCTCGGACCAGGAAGCTGAAAATTTTGTAAAGAATATAAAAGTTTTAAAACTACTTCCTTTGAATAAAGAATGGAAAAGCAAATTCGGTGAGAACGCGATACAAGCGGACGGCAAAAAGATGGAGTCTGTGCCATTAGTCTTAATACTGGCTGCAGCTGGCGGTTCTGAAGAAATGAGGGATAGATCGCAACCAGAGACCTCTGTTGATATGAAGTCGATACTTATTAGTGACGAAGATTTAGAAAATATTAGAGAAGAAGATGAAGATCAAAATACAGATTCAAATCTAAGTTTAACTGTAACAAATCCAGAAGTAGAAGATTCCGTTAAAAAGATGGAAGAAGCTTTGGATATTGTCAGAGAAGAATTAAGAACAACCGACCAGCCGTTTAATGATCAAAACAAAGTTCAAGAAATAGATAAAGAACTTGTAGCAGAAATAGAAGAAATCAGGAATGATATAGATCAAGCTATATTGGATTGCAAGAAGGACAAAAATTGCAAATTTACACCAGACGTGGACAGTTCAGAGGTGGCAGAAGATATATTTTTTAGTCAAGAGAAATACAGTTTAGGGGATTCAGATCAGCGTTTAGTTAATGAGAAAAGTTTAGATAAAACTTATAAACTATCGATCGTGATCAACAAATATATAATTGTTAAATAA

Protein sequence:

>DPOGS205343-PA
MSRSKGGGLVRCDKSRARLVYDASTYVTLFRLDRELAEEAAVVGRWSLPESAELGGWMGASRFGDVLQNVCRAAMPPVGRPPPRGAVYWWSDNISDLRVACNGARRAYTRSRRRRPQDEERDGRLYRVYVAKKLILQQAIRRAKEAAWLELVEGLDRDPWGRPYKRARNKICAQSAPITEVLQPTVLRGIVGELFPDAPAGFTPPRMARQTLEEGDRVPPTVTESEMEAILARLQSKKSAPGPDGVHGRVLALSLVHLGEALRELFDLCLRSEQFPRALLGVGFFGLIIPCQCIDFMDVILNDTNIEQEWASSDQEAENFVKNIKVLKLLPLNKEWKSKFGENAIQADGKKMESVPLVLILAAAGGSEEMRDRSQPETSVDMKSILISDEDLENIREEDEDQNTDSNLSLTVTNPEVEDSVKKMEEALDIVREELRTTDQPFNDQNKVQEIDKELVAEIEEIRNDIDQAILDCKKDKNCKFTPDVDSSEVAEDIFFSQEKYSLGDSDQRLVNEKSLDKTYKLSIVINKYIIVK-