Monarch geneset OGS2.0

DPOGS214051
TranscriptDPOGS214051-TA1575 bp
ProteinDPOGS214051-PA524 aa
Genomic positionDPSCF300171 - 317849-322033
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0212938e-9867.67% 
BombyxBGIBMGA010570-TA2e-10588.41% 
DrosophilaCG13295-PA1e-9138.17% 
EBI UniRef50UniRef50_E2AZ371e-11847.05%Uncharacterized protein C3orf23-like protein n=15 Tax=Endopterygota RepID=E2AZ37_CAMFO
NCBI RefSeqXP_001603699.12e-11946.93%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|2700070772e-11845.51%hypothetical protein TcasGA2_TC013527 [Tribolium castaneum]
NCBI nr blastxgi|2700070773e-11745.51%hypothetical protein TcasGA2_TC013527 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL12499 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214051-TA
ATGCAAGATCGACTGCGCATATGTAAATATCGGCACATTTACAAGCGAAAGGGTTCACCAGAGCGGCGCACTAACGCGTTGCGCGGGAACCAGTCCGGTGAGATTCCTTCGACGTCCGATAGAATAATGTCGAATGAACTTCAGAAGGCTCATGGGAAATATGTTTTTAGATGTGGTCGTCTTATAGGAATTATAAGTAGAAATTTAAGTTCTGCTGAAATATCTACTGCTCTAAGACCCTTCTACTTTAGTGTTCACCCTGATCTTTTTGGGAAGTACCCAGAACAAAGAAAAACAAATGAACATTCTCTCCAACAACTGAGTGCTTTGTTGGAAGCACAGCAATCTAATCGTAATATGACAATGTCAACACTACCATTTTATTTACGTCAAAAGGATATGCCAGAAGGTAACTTCAAATTAGTTAACATTAATCTTAATAGTAAAAATGTGAGAGAAACTGTAGTGAAGATCTTAAATGCTTGTGATATATCAACTAACTATGTAGACAAAATACCAAGGAGCGCACCCAAATCTATCAACAGGGATATAGATTTCACAAAGGCTTATAAAGAATATGATACGGAATTTGAAAAAACTGTACGAATGAAAAGAAAAGTGGAGGAAAAGAAAGCTATAACTAACATAATTGACTGGATATCTGATAATAGTCTACATGGCAGAGAAAAGTATGAACTGACGAGTGCAACACGCGAGCAAGTGAAGTCTCTTATAAATGAGCTCTGCAATATTTATGGTATAAAAGAAGTAAAGTATGACAGTGGTTGGAATATCAGTCACATTAGAGGTGCCTTGCAGAGTCTAGTGTCTATGGCCTCACAGCATTCAAAGCACATGAAGAATTTGAAAGGTAGAACTATTGCATTGGGACAATTCACGGGTGTCAGTTTAGACGGTGACGTTTTTCTTAATATAATTGATGTTAGAAACGAATGGCTCTCGCTTATCAAAAAAGTCAGTCAAGAAGATGGTGCCTTAACTGAAATTCCCAATTATGAGAAAGCCCTGTCTAGTATTTTAAGAGATATTCACATATGTAGAAGAAAGTTTATGCCCAAAGTATCAGCAGCCCAATACTGCAGTCATCTACGTCAATTGATCACATCTATTGGGGATTTCTATGGAAGTGGCAAGAAATTTCCGGAATCTGTGCCAGAATCACTCAGCAAATATGAGATAGTAGTAGAGCCTGAAGCGGGTCCATTGATGGTGTCCCCCACTGGTCAGTTCATCACGCCATCATCGTGCCCAGCTGATGAACTCATAACATTCATCACTCATCATTTAGATGAAGCCACTTTGTTACTTACTGAATATAGCATCAATAAACACGTTGAGAAAAAGCTATGTAAAGAGGTCAAGGAACGCTTTGGTCTCATAGATTTGAATAAAGATGACAGCATCACCCCTGGTCTCATGATAATGTGTTGTCAGAGACTTCTCACGAGAATAGATAAACTTGATACGAAATTAAAAGGGAACATTCTTTACGTGACACATTACTACTCAGTTTTATCCGAGGGTGTGTTATGCATACCGTGGAATTTCAAATGA

Protein sequence:

>DPOGS214051-PA
MQDRLRICKYRHIYKRKGSPERRTNALRGNQSGEIPSTSDRIMSNELQKAHGKYVFRCGRLIGIISRNLSSAEISTALRPFYFSVHPDLFGKYPEQRKTNEHSLQQLSALLEAQQSNRNMTMSTLPFYLRQKDMPEGNFKLVNINLNSKNVRETVVKILNACDISTNYVDKIPRSAPKSINRDIDFTKAYKEYDTEFEKTVRMKRKVEEKKAITNIIDWISDNSLHGREKYELTSATREQVKSLINELCNIYGIKEVKYDSGWNISHIRGALQSLVSMASQHSKHMKNLKGRTIALGQFTGVSLDGDVFLNIIDVRNEWLSLIKKVSQEDGALTEIPNYEKALSSILRDIHICRRKFMPKVSAAQYCSHLRQLITSIGDFYGSGKKFPESVPESLSKYEIVVEPEAGPLMVSPTGQFITPSSCPADELITFITHHLDEATLLLTEYSINKHVEKKLCKEVKERFGLIDLNKDDSITPGLMIMCCQRLLTRIDKLDTKLKGNILYVTHYYSVLSEGVLCIPWNFK-