Monarch geneset OGS2.0

DPOGS216202
TranscriptDPOGS216202-TA4911 bp
ProteinDPOGS216202-PA1636 aa
Genomic positionDPSCF300080 + 345931-380661
RNAseq coverage133x (Rank: top 56%)
Annotation
HeliconiusHMEL0073550.081.50% 
BombyxBGIBMGA004549-TA0.083.87% 
Drosophilacas-PA3e-8551.60% 
EBI UniRef50UniRef50_D2A6448e-14442.39%Putative uncharacterized protein GLEAN_15058 n=2 Tax=Tribolium castaneum RepID=D2A644_TRICA
NCBI RefSeqXP_973001.12e-14442.39%PREDICTED: similar to transcription factor castor [Tribolium castaneum]
NCBI nr blastpgi|2700085323e-14342.39%hypothetical protein TcasGA2_TC015058 [Tribolium castaneum]
NCBI nr blastxgi|3503977023e-15033.67%PREDICTED: hypothetical protein LOC100749506 [Bombus impatiens]
Group
KEGG pathway 
Orthology groupMCL14970 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216202-TA
ATGTCTCGAGGCGAGTTTAAAAAGCTCCGTTTCGAAGTAAAGAATGTTGTGTACAATCTCGGTTCAGACAAAAAAGTCGGACTTGCGCGTCGCGGTTATCAAGGTTCAAGAGCAATGACTACTGCGATGGCACATCAGGAATCCATACACGAGAGACTGCAGTCGGGACCCGAGGAACCGCTGTACCTGCACCATGGAGCATCCTGGAACTCAGAGACGTCGGAACACCAAGAGGATCACAGCACAGACATGTCGCCCTACTACAACATGCCAAAAAGAAATAAGAGAAAGAATTTCAAACCACGATGTGCTCCCAACGCTACGACGTTCTCTGACGACTCCAACGGAGGCAACTGCGAGAGCAGCCTCACAAACGGAAATGACAGAACTACACCTGAACAGGCTGATGAGGAAACGGATGGCTACAATAAAATTGGTGTTAAGAACTTCGGTCTGATAAAAGGTGGAGCTGTCGACCTCAGCCGGCGATTGAGCACTGATTCCAATGAAAACATAGAATCCACCTACCAGAACATCACGGACGAGAAGAAAATAGACTCGTTCCAGCGCTCGTACATACACAGCCTGAAGGTGAACCAGCCGGCCGAGAGAGATCACACCGTGCTTAACTTTAGTAATTTACAAAACAAAACATTCTGTGCCAAAAATCCTTTCGCCATATCCCAGTTAATAAAAACCAATTCGCCTCCACGGAGAAGGGAATCCGATTCCGACGATGAGGCTAAGGGTCAGGATATAAGTGCTAATGACAGGTTGGAGAACCAGGTTATAGGGGAGTCTAATGAAGGTCAACAAAACGGTGATCCGTCTAACGCTACAAATAGAATCTTGAGGGATTACGCTATGAACACCATGAAAGAGCTATTGGGTATCTATGGCTTAAGAGCTTTGGAGGCTCTTCAAGGTAAACCCTTCACTCAGTTACCTTTGAGTCTGAAGCGACGTCATGATTCAGGGGATGGAACTGATAGAAGCATCCGAAGATCCTCTTCAGCGACTGAAGATGAAGGATCCACCAACATAACCCCTCCAAAAACACCAGACAACGACATAGAAGCCCAAAGACAAAGGGCCCTTCGGATAATAAATCAGCAATCAATGCGTCTGTTTCGTTCTGGGCGAGAGGAAGACAGTTCTGATGATGGAGACCAGACCCTCGACCTCAGTGTGTCCAGAGATACAGACGGACAGGAAATGTCTACAACTTCGAGCGCCGGTAACAGCGTGTCGGAGTACGATCAGAACATGATGATGAGGAAGAACCTCGAGGACCTGGCGAATTTACAAATGCAGAATTTCTTCCAAAAACAATCCATGATGGATCCCGAAGACTCTCAGGAACCGAAACTCCAAGCCAGTGGACTGCTGTCAGCCTTGAATCATCTGGATCAGGCTCGCAAGAACTCCATGCAAACGACCGTGGATTACTCCAGATACGTCAAAAGGTACAGTTCAACACTAGAGTGTGGTTCATCATATTGTAAGGATTTGGGATATCGAGAACATTTTCATTGCATGGATTGCACTGCGAGGGTCTTTTGCAAAAAGGAAGAAATGATAAGACATTTCAAGTGGCACAAAAAACGAGACGAATCTTTAGCGCATGGTTTTATGAGATATTCTCCCCTCGACGACTGCTCCGAGAGGTTCAGTGACTGTCCTCATAATCGAAGTCAAACACATTACCACTGCATACAGGATGGCTGTGATAAGGTGTACATATCTACATCGGATGTCCAGATGCATGCGAACTATCACCGCAAGGACTCTGCCATACTTCAAGAAGGGTTCCAAAGGTTCCGCGCCACTGAGAGCTGTTCCGCTCCACATTGTATGTTCGCCGGTCAACGGACCACGCACTTCCACTGTCGAAGGCCTGGATGCACCTACACATTTAAAAACAAAGCCGACATGGAGAAACACAAGACTTACCACATAAAAGACGAGGCGTTATCAAAGGACGGTTTCAAAAAGTATATGAAGAGCGAAGCCTGCCCGTATAGAGATTGCAGGTTTAGTAGGACCTGCAACCACATACACTGCATCAGACCGCACTGCAACTACGTACTGCACTCCAGCAGCCAGATCTTCACACACAAACGGAAACACGAGAGGAAGGAACAGGAAATGAGTTTTGGGTTGCCAACGTCTGTGATCCAAAATGCTTTAATGCAAGGAGATCTGTCATATGCCATGCATGATGACGACATATCCGTTGAAGATTACACCCAAGCTTATGTGAGTAATGAAATTCAAAACCAACCCTTGGTGGATGAAGTAGCACGCAAGTATATTGAAACGTTCAACGGCGCGAAGGAAGCTGAAGACAAGTGCAGCAAATGTAGCGCCTTCAACAAACACTACCACTGTTTGGCTGAAGACTGCAAGATGGCGCATAGTTGTCACAACACAATGGTAAAGCACGTCATGGAACACGAACAACAGAGCCAAGTAACGGAAGCTTACTTCGTGACGTATACGAGAAATAATCCTTGTTCCAATTCGGCGTGCCAGCATATCAAAGACATCACCCATTATCACTGTATTTGGGAGAACTGTGGAGCTGTCATACTATCTTCAGAGGAACATCCCTTCAGACGCTTGGAGCACTATCGCCAACACGCCAATCTCAGTCCCATGTCGCCAAACTTGGCAGCCAGATTTGCGCCCACGTTCACTCCGCAACTATCGGCTCAATTACACCCAAACATGGCGGCCCAACTTGGCCAGGTGACAAATTTACCCAATCTTCCCCCTAATTTGGCTCAGTTGGCCCAAATGGCACCAAGTCTAACGTCCCAATTGACGCCGAACTTACAAGCTCAATTGAATCTTGGGCCTAATCCCACGATTGCACCACAACAAGTGAACCCATCGAGTTCGTTGGACGAGATGTTTAGCCGGAAGAGGGGGAGACCTCCAAAGAATCGTGTTGTAGAGGTCTGGACAGATAATGTGACTCCTCAAGCTATATTTACATCGTTCAAACTGCCGAAGAGCAACCAACTGCCGCCTCTAGTACAATCACCAATTATTGCTACAGTCGAAGAATTCCCAAGAATCAGATTCCAGAATTTCGAAGTGTTTACAACACCTGATTCGTGCATTCAGTATTATGCGAATTGTGCATTAAGATCAACAAGGTTGCATTTCCATTGTTTCAACTGTAGCTTCACGGGTGAAACCCACGTGACAATGGAAACTCACATGAGGGAATCCCATTCGATAGCCCCGCTATTGGAAGGCTTCGACTATTTCTCCTCCTTCGACCAATGCGGTGGAAAGAGCTGTTTTAAGAACCAATTAAGATCGCACTTCCATTGCGCCCAAGGAAATAATTGTCCGGCGATTTTAACCCAATATTCAGCATTGGCCACCCATAAGCACGGAAGGGGTACTCCTGTTATTAAGAGTGAATCAGAACAAGAGAAATCTTATGAAGAAGCCAAAGATTATTCTATTAAAGAACGAGCTTCAGCCGACAGCGTTGCTTCCATGAAAGATCAAGCTGAATCATTTACACCCACAAATTTCTCTATCAAAAGCGAATTCGAAAGAGAACAGGATAATATTTCGGTAGTAAAAGCGACAGGAACTTTCTACCCGTCATCTCATCTCAGGAACTCGTCTCCATCCCCAAGAAGTATCTACGACGCCTCCCCGTCTAAGGATATTAAGTACGAAAAGAAATTCGAGCCAAAGTTAACTGGGCCAACAATACCGCCGTCTTGTCACGATCAAAACTGCCCGTTGAACAAAAACGCTGGCGCAATGAACTTGCACTACCACTGCCCGCACTGCAGCCAAGCCTATGTGGATTTGAAGCTGTTATTCAGTCACATGGTCAAAAAACATAGCAACGCCATCGAACCAGGGCCAGTCGGATTGGCCGCTACAAAGGAAACGCTGGAGAGAGAGTACCCAGAAATTTCAATTCTACCGTCAACAGCGACATCTTCAAATGCCCAAACACCACAACAAAGTCATCGACCTCCGAATCCTGCTGAACAGGTGCAAGCTGTTCAATCCCTACTGCTGCAACAGTACTTGGGTTCAGGTCGCAAATCGCTCCAGGATCAATTGAAGATGCAGCAGTACTCCTCACTGGCAGGGCTGCCTGGATTAGCTCAAGTGGCGTTATTCTCTCAGGGTGGTTCCGCATTCCCTATGTATCCGACTATGTTGTATCCGCCCGAGCTTCTTCTTGAGCAGAGTCTTCTTCAAAATCACGGTCTGCCGCCAGGCCTGGACAAAGAGGCGGAAATGATCGCTAAATCCAGGAGAAGTACTGGAGCTAGAGGACCTCATATGAGGGTTTTAAAGGATGAACCCATACCGGATGGATACTTGCGTTTCCGGTTTAATGAGGATTGCGCCTATCAGCAGTGCGGATATAGGGAACATCAAACTCACTTCCACTGCACAAGAAAGGATTGCGGTTACTCATTCTGCGATAAAACAAGATTCGTCCAACACACGGCTAGACATGAACGTTTGGACACTCTAATGGGTGGGGACTTCCAACAGTACCGAGCGAACGTGTACTGCCAGCGACCAGAGTGCCCTCACGCCTCCACATTCGGCACGGGACAGAACAAAGCTTCCCATTTTCACTGTCTCAAGTGTGACTTCGTATGTACGGACACCAATAAGGTTGTTGCTCATCGCCGACAACATCAGAAGCTCGACTCCATACAGGCCGCTGGCTTCCAGAAATTTCCTCCAAGCAAAGCGTGTGGTTACGAACCCCAGTGCATTCACAGCAAGAAACAGACCCACTACCACTGTTTGCAATGCGGTTTCGCTGTTCTTGGGTTATCGCAGATGACGTCACACAAGTACAAGCACCAGGAGGCGAGCCTCGGACCGTCGACCAGCTCGACCAACTGA

Protein sequence:

>DPOGS216202-PA
MSRGEFKKLRFEVKNVVYNLGSDKKVGLARRGYQGSRAMTTAMAHQESIHERLQSGPEEPLYLHHGASWNSETSEHQEDHSTDMSPYYNMPKRNKRKNFKPRCAPNATTFSDDSNGGNCESSLTNGNDRTTPEQADEETDGYNKIGVKNFGLIKGGAVDLSRRLSTDSNENIESTYQNITDEKKIDSFQRSYIHSLKVNQPAERDHTVLNFSNLQNKTFCAKNPFAISQLIKTNSPPRRRESDSDDEAKGQDISANDRLENQVIGESNEGQQNGDPSNATNRILRDYAMNTMKELLGIYGLRALEALQGKPFTQLPLSLKRRHDSGDGTDRSIRRSSSATEDEGSTNITPPKTPDNDIEAQRQRALRIINQQSMRLFRSGREEDSSDDGDQTLDLSVSRDTDGQEMSTTSSAGNSVSEYDQNMMMRKNLEDLANLQMQNFFQKQSMMDPEDSQEPKLQASGLLSALNHLDQARKNSMQTTVDYSRYVKRYSSTLECGSSYCKDLGYREHFHCMDCTARVFCKKEEMIRHFKWHKKRDESLAHGFMRYSPLDDCSERFSDCPHNRSQTHYHCIQDGCDKVYISTSDVQMHANYHRKDSAILQEGFQRFRATESCSAPHCMFAGQRTTHFHCRRPGCTYTFKNKADMEKHKTYHIKDEALSKDGFKKYMKSEACPYRDCRFSRTCNHIHCIRPHCNYVLHSSSQIFTHKRKHERKEQEMSFGLPTSVIQNALMQGDLSYAMHDDDISVEDYTQAYVSNEIQNQPLVDEVARKYIETFNGAKEAEDKCSKCSAFNKHYHCLAEDCKMAHSCHNTMVKHVMEHEQQSQVTEAYFVTYTRNNPCSNSACQHIKDITHYHCIWENCGAVILSSEEHPFRRLEHYRQHANLSPMSPNLAARFAPTFTPQLSAQLHPNMAAQLGQVTNLPNLPPNLAQLAQMAPSLTSQLTPNLQAQLNLGPNPTIAPQQVNPSSSLDEMFSRKRGRPPKNRVVEVWTDNVTPQAIFTSFKLPKSNQLPPLVQSPIIATVEEFPRIRFQNFEVFTTPDSCIQYYANCALRSTRLHFHCFNCSFTGETHVTMETHMRESHSIAPLLEGFDYFSSFDQCGGKSCFKNQLRSHFHCAQGNNCPAILTQYSALATHKHGRGTPVIKSESEQEKSYEEAKDYSIKERASADSVASMKDQAESFTPTNFSIKSEFEREQDNISVVKATGTFYPSSHLRNSSPSPRSIYDASPSKDIKYEKKFEPKLTGPTIPPSCHDQNCPLNKNAGAMNLHYHCPHCSQAYVDLKLLFSHMVKKHSNAIEPGPVGLAATKETLEREYPEISILPSTATSSNAQTPQQSHRPPNPAEQVQAVQSLLLQQYLGSGRKSLQDQLKMQQYSSLAGLPGLAQVALFSQGGSAFPMYPTMLYPPELLLEQSLLQNHGLPPGLDKEAEMIAKSRRSTGARGPHMRVLKDEPIPDGYLRFRFNEDCAYQQCGYREHQTHFHCTRKDCGYSFCDKTRFVQHTARHERLDTLMGGDFQQYRANVYCQRPECPHASTFGTGQNKASHFHCLKCDFVCTDTNKVVAHRRQHQKLDSIQAAGFQKFPPSKACGYEPQCIHSKKQTHYHCLQCGFAVLGLSQMTSHKYKHQEASLGPSTSSTN-