Monarch geneset OGS2.0

DPOGS212026
TranscriptDPOGS212026-TA4830 bp
ProteinDPOGS212026-PA1609 aa
Genomic positionDPSCF300054 - 500926-510617
RNAseq coverage538x (Rank: top 23%)
Annotation
HeliconiusHMEL0129740.074.79% 
BombyxBGIBMGA010180-TA0.068.57% 
Drosophilawoc-PB0.046.61% 
EBI UniRef50UniRef50_Q17D610.049.69%WOC protein, putative n=4 Tax=Endopterygota RepID=Q17D61_AEDAE
NCBI RefSeqXP_001648900.10.049.69%WOC protein, putative [Aedes aegypti]
NCBI nr blastpgi|1571055060.049.69%WOC protein, putative [Aedes aegypti]
NCBI nr blastxgi|1571055060.045.83%WOC protein, putative [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[1487-1606] IPR0218931.5e-40Protein of unknown function DUF3504
Orthology groupMCL10480 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212026-TA
ATGGATGAAAAAGAAATACCTGAGAATCTCGGTGAAAACGACAATGAGACCGGTGATGGGATAAATACGACTGACATAAAAGAAGTAAGCGAAAATAATTGTACTAAAAGTGAAGAAAATAGTGAATGTGGACCAAGTGAACCTAATAGTATACAAAATCATCTAGAATGTGACGTTATCAAAAGTGATGTTGTAGAAAAGAGTGAAAAATGTGACGAACAAGGTGATGTAGAAGTATCTGTCAGTGTTGGTGATGATACAAAAGATATAGGAGAGATTCAAAAAGAGGAACTTCAAGAAATCTCAGAATTAGACAATGAATCAGTAGAGCCAGAAAAAGAAATACAAGATTTAGAAAAGCAAGTTTTAGACTCAGATTCAAAACGTGATGTCCCTCTGGAGAAACCTAGCTTGTGTGATGCTAATGAAGGAGTTTCTAAATCTGAAATTAGTGAAGAGCTTGTTGTTAACAATGCCACAGACCCCACTGTATCAAAAGAACTATTGAAAGAAGATAGTGCAGTTACAAAATTAGAGACTGATGAAACCCGCAATGCTGAATGTATAGAGGACATAAAATTAAGTATTGACAACCCTGATACTGAACATTCTGAATGTATAGAAAACATAAAATCAAATATTGACAACCCTGACACTGAAAATAAAAGCCCTAATATAGAAAGTATTACAAAAGAACCGAGTCAAGAACATGCAAATGAGGCTAATGAAGAGGATTATGAAAAAAGGGAGGTCGGCAAACCCGAGTTTGAAACAGCCCCTAATGTGTCTATGGATGATCACCACGAAGATCATACACAGGCTTTAGACCCATTTGATGCTCTTTTGAAAGACCGAACAGATGCAGCGGAAACATCAGAGACAGCCACTCAAGCTGATCTGTCCTCAATTAATATAGATGATGATGATCATCACAACGCCGATGATGCCCATGATATGATTCCTGATGATGAAGATGAACATCATCCGGACGATGAGAGTGGAATACCAGTTATAACAGAGCAACCAGGAGCTGATGAAGAGGTATGCTTGTTACCCGACACCGAAAGAGAAATATCTGAGGCTGATAAAGCGGCTGCTGAAAAGGTTTTAGCTGAGAAGAGAAAAAGAGATGAAGAATTGGCTATGCCCAAAGAGGAAGCTGAAGCGAGTGAATCTAATGAAGGTGTTGCAGAAGAACAGGTGCAGGAAGATGTAAACGAAAGTCTGGAAGGGGAGGGGGAGGGGGAGTCTGAAATATCCACGGGAGATGCGTCACAGGATAATGTAGGGGAAAAAGAAAACGATCAGGAAGATGCAGTCGATTACGAACAGGAAGAGCCAGAACCTAATTCAATTCGACAAATAAGCGCCACCGACTCCGTCTGTGTACAGTGTGACGAAGAGAAGTCATGTCAGTACAGATATTCAGACAAAGACGGAGCCGTACATTATATATGCGTTCTCAGTTGTGTTAAATTGTTCCAAGCGGCGCACGCGGGGCAGTACATCGTTGTAAATAAAAAATATATGGTCGAAGAAATTGCACCCAAAATACTGACGTGTTCGGAATGCGAGGAGAAAAAGACTTGCTACTTTTATTACAACTTCGACGGCGAAGACACGAACTACTGTTCAGTAGAGTGTCTGCACAGCATGATGGCGGATGAAAGAGACAAGTACACGTTCAAAAGACGAAGGATCACCGTCGAGGAAAAGTCGCCCAAGGAAGATCAATGTTGCGTCTGTGAGAGCACCAAGGAGTGTATATATAGTTTGACGCGCTACGGACAACAACTTTGGATTTGTGAACAAACATGCCTCAGGGCGATCAATTGTAAAGAAAACGGCAGGTATTTATTGAGAAAGAAGAGAGTGCAACGAGTACAAGCACAAAAACCCGTCAAGACTAATCCGCCCCTGCTGAAACTGAAAGTTATTAGTAATGCGACTGATAAATATTTAGACGAGGCTTACAAAGTGCAAGGAAAAACGCCGGCGATGGTTCAAGCGGCCAGGGAAGAGAGGGAACGGACGTTTATAAGGTCTTGCATGAATTGTCACATGATCCTCAATAATGAGGAAAAAATGCTAACGTGGGAAGCTATGGACTATTGTAACGAGACATGTCTGGGAAGATATCAGAATAAATTCGGTTCCAAATGCACCAATTGCAAAGTACACGTGCAGCATACGAGCATAGGGAAGTACTGCGTCAGGTTCGGATATAATATACGACAGTTCTGTAACTCGGCGTGTTTGGAGGACTTCAAAAAGGGTTTGAAAATTTGTTGTTACTGTCAGAAAGATATATCGGACGGCAGTCAAGGGTTCCTAGCGCCGGTGGGCGACAAGGGCCAGTTCAAAGACTTCTGTTCGCAGCTGTGTATGGAGAAGTTCGACAAGATGAGCAAGAACCCAGTTCCTAGGCCTGTTTGGGCGAAGTGCGCGGTCTGTTCGCTGGAGAAAGCTACCACAATCGAGGTGGAAGTCGCTCCCGATGAATCACAAAGACTGTGCTCCGATCCTTGCTTTGCTGCTTTCAAATTCGTCAACAACATTTTCCCTGATCAGTGCCGATGGTGTAAAATATATTTTGAGAGGAAAATAAGTCAATTTTTTACGATATACGAGGGTTCATCACCTCAGTGTTTCTGTTCCAAGTCCTGTATGAATATCTATATAAGTAATTCTAGGCACATAGTACCCTGTAACTGGTGCAAAGTTAAAAAGTACAATTTCGATATGATCAAACGCGTCCAACCCAACGGCCAGGACATTATGATGTGCTCCGTAAACTGCCTGAATCTATATCAAGTGTCCATCAACGCTGTGTCCTCGAGGAGAACGAAATGTGACCTGTGCAAGAATTCTGCTCTGGCGCAATATCACCTCACCATGTCCGACGCCACAGTCCGAAACTTCTGTACATACCAGTGTGTGATGACATTCCAGGGACAATATTCAAAACAACCGGCCCCGTTAATGTCTGGAGATTCTATCGACCAACAGAAAGCCGTTCCCACGGGCGCCCCCAGGCGCACGTATAACGCGAACACTCACAAAAATAACACTATGAAGTGTCAAAACCGTTCCGGTTCAGGCATGCCTGTGATATCCAACGTGCAGTCATTAGCTGCGCCGCCGCCTCTAGTGCCTACGAACGCTCGCAACAAGACCAAACGAGTAACACCCGAGGAGAACGCGGCGGAGCCCGCCATCCCGCAACCGCCTAGACCGCCCACACCCCCACCACCGCCCCCCAAGATATACAACCACGTGATCGTCAAGACTCTCCCGCCACAAGAAGTCGCCAACAAAGCCACTATGTCCAAACCCATGATGGTGTCCAAAGGAGTGTCGTGTAGACCCCATCCATGCACGAAAGAGTGTCAGACAGACCCCAGCCTGGAGCGTCGTGTTCTGATACCGGTTCCAGTTCCTATATACGTTCCGGTCCCCTGTGTGATGTGGTCGCTTCCGTTTCCGGTCCCCGTGCCCATACCGATTCCCATACCGACGCCGGTGTTTATACCAACTACGAGGAATTCGGCTAAGGGCATAATGAAAGAAATTAATAAGATCCATGACAAAATGCCGACGGATCCCTTCGAGGCTGAACTACTGATGATGGCGGAGATGGTGGCGGGAGACAAGAAGAAGGATCACAGCGACTCGGACACCGAGGACGAGAACGAGGAAGGTTTCAGTCCGGTGGCCGGTATGGACGGTAACAATGCGTTCGGCGAGGACGTGCTGCAGATGGCATTGAAAATGGCCACCGAGTACGAAGACCAGCCCGTGGACCTGGAGTCAGCTATGACCGCCAACACCATCACACCCAGCTCACATCCCGGAATGCCAGGTCTAGAAGGCGAGGGCATGCACCAGCACCATATGATGGTACTGGAACAGCAGCGTGCTGTGGCAGCCCTGCGCGCATCAAGCGTGGGCGGCGTGGGCGTGGGCGGTGTCGGTGTAGGCGTGGGCGTTGCTCGGAAACGAGCGCCCGCGGTTGCGCCTCGTGGGCGACCCTCCAAGCGACGGCGGGAGCCAGCGCCCGCGCCGCCGCCCGACCCGCCTCGCGAACCACAGGAGAAACCGGATGCTAATATGTGTCTTAAGTACACTTTCGGCGTCAACGCGTGGAAGCAGTGGGTGATGACGAAGAACGCAGAAATAGAGAAGAGTTCGATAAGACGAAAACCTTTTAAATCTGAAATATTACAGCTGACGGCCGACGAGTTGAACTATTCCCTTTGTTTGTTTGTTAAAGAGGTGCGGAAACCTAACGGCAGTGAATACGCACCGGACACTATTTATTATTTGGTTTTAGGAATTCAACAGTATCTGTTTGAAAACGGTAGGATAGACAATATATTCACGGATCCATATTACGAAAAGTTCACCGACTGTTTGGATGAAGTTGCTAGAAAATTTTCAGTTTTATATAACGATTCCCAGTACATCGTGACCCGTGTGGAGGAGGAGCACCTCTGGGAGAGTAAACAACTCGGCGCACACTCTCCACACGTGCTGTTGTCAACTCTAATGTTCTTTAACACCAAACATTTTAATCTAGTAACGGTAGAGGAACACATGCAATTATCATTCTCACATATAATGAAGCACTGGAAGCGAAATCCCAACCAGCCGGGACAAGCCAAAATACCCGGCTCTAGGAACGTTCTGCTCAGATTCTACCCTCCACAGTCAGCTCTAGAGGCGAATTCAAGAAAAAAGAAAGTTTATGAACAACAAGAGAATGAGGAGAACCCGCTGAGATGTCCCGTTAAATTATATGAATTTTATATATCGAAATGGTATGTTTGA

Protein sequence:

>DPOGS212026-PA
MDEKEIPENLGENDNETGDGINTTDIKEVSENNCTKSEENSECGPSEPNSIQNHLECDVIKSDVVEKSEKCDEQGDVEVSVSVGDDTKDIGEIQKEELQEISELDNESVEPEKEIQDLEKQVLDSDSKRDVPLEKPSLCDANEGVSKSEISEELVVNNATDPTVSKELLKEDSAVTKLETDETRNAECIEDIKLSIDNPDTEHSECIENIKSNIDNPDTENKSPNIESITKEPSQEHANEANEEDYEKREVGKPEFETAPNVSMDDHHEDHTQALDPFDALLKDRTDAAETSETATQADLSSINIDDDDHHNADDAHDMIPDDEDEHHPDDESGIPVITEQPGADEEVCLLPDTEREISEADKAAAEKVLAEKRKRDEELAMPKEEAEASESNEGVAEEQVQEDVNESLEGEGEGESEISTGDASQDNVGEKENDQEDAVDYEQEEPEPNSIRQISATDSVCVQCDEEKSCQYRYSDKDGAVHYICVLSCVKLFQAAHAGQYIVVNKKYMVEEIAPKILTCSECEEKKTCYFYYNFDGEDTNYCSVECLHSMMADERDKYTFKRRRITVEEKSPKEDQCCVCESTKECIYSLTRYGQQLWICEQTCLRAINCKENGRYLLRKKRVQRVQAQKPVKTNPPLLKLKVISNATDKYLDEAYKVQGKTPAMVQAAREERERTFIRSCMNCHMILNNEEKMLTWEAMDYCNETCLGRYQNKFGSKCTNCKVHVQHTSIGKYCVRFGYNIRQFCNSACLEDFKKGLKICCYCQKDISDGSQGFLAPVGDKGQFKDFCSQLCMEKFDKMSKNPVPRPVWAKCAVCSLEKATTIEVEVAPDESQRLCSDPCFAAFKFVNNIFPDQCRWCKIYFERKISQFFTIYEGSSPQCFCSKSCMNIYISNSRHIVPCNWCKVKKYNFDMIKRVQPNGQDIMMCSVNCLNLYQVSINAVSSRRTKCDLCKNSALAQYHLTMSDATVRNFCTYQCVMTFQGQYSKQPAPLMSGDSIDQQKAVPTGAPRRTYNANTHKNNTMKCQNRSGSGMPVISNVQSLAAPPPLVPTNARNKTKRVTPEENAAEPAIPQPPRPPTPPPPPPKIYNHVIVKTLPPQEVANKATMSKPMMVSKGVSCRPHPCTKECQTDPSLERRVLIPVPVPIYVPVPCVMWSLPFPVPVPIPIPIPTPVFIPTTRNSAKGIMKEINKIHDKMPTDPFEAELLMMAEMVAGDKKKDHSDSDTEDENEEGFSPVAGMDGNNAFGEDVLQMALKMATEYEDQPVDLESAMTANTITPSSHPGMPGLEGEGMHQHHMMVLEQQRAVAALRASSVGGVGVGGVGVGVGVARKRAPAVAPRGRPSKRRREPAPAPPPDPPREPQEKPDANMCLKYTFGVNAWKQWVMTKNAEIEKSSIRRKPFKSEILQLTADELNYSLCLFVKEVRKPNGSEYAPDTIYYLVLGIQQYLFENGRIDNIFTDPYYEKFTDCLDEVARKFSVLYNDSQYIVTRVEEEHLWESKQLGAHSPHVLLSTLMFFNTKHFNLVTVEEHMQLSFSHIMKHWKRNPNQPGQAKIPGSRNVLLRFYPPQSALEANSRKKKVYEQQENEENPLRCPVKLYEFYISKWYV-