Monarch geneset OGS2.0

DPOGS204490
TranscriptDPOGS204490-TA3471 bp
ProteinDPOGS204490-PA1156 aa
Genomic positionDPSCF300002 + 1228875-1260440
RNAseq coverage472x (Rank: top 26%)
Annotation
HeliconiusHMEL0146890.078.10% 
BombyxBGIBMGA007834-TA0.087.18% 
Drosophilasick-PB7e-6042.49% 
EBI UniRef50UniRef50_F4WXJ51e-7236.27%Protein sickie (Fragment) n=1 Tax=Acromyrmex echinatior RepID=F4WXJ5_ACREC
NCBI RefSeqXP_967205.29e-7956.34%PREDICTED: similar to neuron navigator 2 [Tribolium castaneum]
NCBI nr blastpgi|1892414972e-7756.34%PREDICTED: similar to neuron navigator 2 [Tribolium castaneum]
NCBI nr blastxgi|1892414977e-13637.75%PREDICTED: similar to neuron navigator 2 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL17381 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204490-TA
ATGGGTAATACAAATTCGGGGCAAGGACATTCACGTCATCACAACAAGAGTAGGAAATCGAGGAGTTTGCCAACTTCGCCAGAAGTGAGAAGACAAGTCTCTGTGCTGCAGACGTCTATCCCTCTGCCGGCGTCTGCGGCGGCGGTGAGACGCGCTCCACCCGACAAACGACCTCTGCCAGCGACCCCTGTACACGGAAAGAGTGGGCTCTCAGCGAGCTCAAGTCGGTCAACAAGTCCTCTTGGTCAAGGAGCAGTAAGTTTTATACCTCGGGCGCCTGCATCTTCTTCAGGTTCCAGACCTAATTCGTCGTTACTTGTACCCGGTAGCAAAATTCCTTCGGCCTTATCCCAACCTCAACATTCGCAAAACATAACTCATCAAAATGGTGCTCCAAATAAACAGTCCATGCTAGATAAGTTAAAACTCTTTAATAAAGACAAAGTTAGTAGTGAAAAACAAAGCAGCAAAAGTACCGCGGTATCAAAACGTACAAGTTCATCAAGTGGATTCTCATCAGCAAAAAGCGAGCGATCTGATTCTAGTTTGAGTCTAAACGAATCTGCGAATGCTACAAACACACATATCAAATCATCTAGTTTGATAGGACCTAAAAGTGTCAGACCACACAATGATACATTAACAAAAGATAGATCTGGTAAAAACGTGAAATCTAAACTAGTAAATTCGAAGATTGTGAAGGAATCATCGACAACTTTAAATAAAACAAATGTCAAATCAAGTAATGAAAAGTCAAGAAGTAGTCCGAAATTACCTGCGCGTGACAAAGAATCGAGGTTAGCTGCTCCAAAGTCGGTTAGTAATACGAAACTAAATCAAATAGAAGACCATTCAAAAAGTCCGAAGTCGAGACTGGAGTCGAAAATGGTGAAGTTGTCTGGGAGTCAAATGAAACTGTCTGAACGAACTGAGAATAGTGACGTTAAAACTTATGAGAATACAAAAAATCATTCTCATCAAATACAACCACCTTCACCAACTAGTGCTTCCCAAGTGCCTCTTCAAAATCAGACAGGTATTCCTAAACCTACTGCAGCAGTAAAAGGGACAACGAAAATATCAAAAGATGAAAAACACGGTATCCATAAAAGTACAAATAGTCAATTAAGTCCTATTCAAAGTAATTCTAGCTTAAACTCTACCAATAATGTTAGTGCTCTTCCTAAGGAAGTAAATCAGAAACAAACTCTCGCTGTCTCTCCTATGCCTGTTATAAACAGTGGAAATCAAAATCCACCATCCCAAATGTCTGAGAGTTCGCATTCAAATTCAACACATAGTACCACTGGCCAACAATCTAATTCTAGTGATGGCAGTGTCATATACCGCCCATCAAGTGAATCGGGATCCGAAATTTCAAAAGGAAGCAATTTAGTGTGTAACAAAAGAATAGATATGAATACAACATATATCAATGACGTAATAAATGAAGCAGAAATATCAGAAAAAGAAGCTGCACAAAAACGAACTACTGATAAACCTAGTCCCAAGCCAATATTTGATGCTAATAAAACTCTTACGGAATATGATAAGAACGATAGTCGTTCTAGTACCCCGTCTCATTGTAGGGATAACTCCCTAGGAGAAGATGAAAACCCTTTAATGAATGTTCTACCGATGAGACCTTTACTCAGGGGATATAATAGTCATTTAACTTTACCAATGAGAACAACTGGGTTGGCGCAAAAAAATATACCTGGATATCCCCATCATGCAAATACTGTCAAAGCTAATTTTGGTAGGGACAACATGGCATTGCGTGAGCGAATAAATTATGGGCCTGGATTTTCTAACCCTGACTATTGTGACCTTGATATTGCCTCTGGTTACATGTCCGACGGGGATTGTCTGAGGCGAATAAATGTAAATGAAATGGACTGTGAACGTAATAATGATATTATGGACGGATATATGTCAGAAGGTGGAGCATCTTTATATGGCCGGAGAATGAATTATCAACAATCATCACAGTTCCAACAACTTGATGAGAGACGTGGTCGTAGAGGCATGGAAGGCGGAAGCGGTGTGGTGTACCGAGTGGTAGGCCGTAATCGCAGTAAGGCTGACTGCGGCCAACAAACCGAACGCCAACCGCCAGCCCCCAGACAAGATACCACCTGGAAGAAATATACCGACTCCCCTGGTGTTGGAACTCCACCTTCGAACCAGCCAACACCAGCTCCACCAAGTCCATCTCATGGTAGGAAAGGCGAACGACGTGCTGGACACCATTCGCCACAACATCACAAGAGAGAAAAGCTCACTGCTGCCCAACAGTTGGGAATCGCGCCACATCCTCAATATCCAGCTTCAAGCCAATCAAGTCAACATTCGTCAAGAAGTGGAGGATCGCAGTTACAGTCACCTAGTGGTTCATCGCGACCGTCCAGTGGGAATGGTAGTTGTTCCAACAAAGCCAAGGTCCCACAGAACTTTGGTTACGTCAAAAGACAAAACGGCCAGCCACCTCCGCCACCTCCAAATGGACCTCCACAACATGCACACGGTGGAAGAACAGCTCAAGTATCAGCAGTGCCCAGAACAAAAGTTAAAGTTTCTGGCGGAACACAAACATGTACTCAAGACCTCCAAATACACAAGAATGGCATAGGTCCGAAGTCCTTCTCCCTCCAAGGCACGGCGGCGGCACAGCTCTCTGCATCAGTTCGTGAGAGACTCCTCGGCTCACAGTCACTACCAAAGCCAGGGACACATGAATTCGCCGCCCTCTTTCATCATCATAGGCCGTCGCCGAGGGGAGGTATGAAGATCAGCGATGGAAGTCTCTCCGATACACAAACGTACTCCGAAGTGAAATCCGACTACGGCATACCATACGCTCCATGGCTGAGACATAGTAATACATACACAAGTGGAAGGCTGTCTGAAGGCGAGTCTATGGAGTCGCTAACGTCGCTGCACTCCGCGCAACACACACAATCCCCTAACTCACGCAGCTCACTCACACATAACAAGCTCATCATGCATCGAGACGCACAGAGTACGAGATTGAACAGGAGCAACAGCATCAGGTCAACGAAATCCGAGAAACTATACCCGTCAATGCTTCAAAGGTCGTCCGAGAGTGACTATGAACCGTACTACTGTTTACCAGTTCAATATGGACCGAGTGGACAAGGTATAAGTTACGGCGTGTCCGAGCCGCCGTCTCCGTCGCCGCGATCAGCTCTGAGCCCAACACATGCGCCTGCCATCCACACGCCGAGACACTCACACCATTATCCCAAGAAAAATGACGACGTTCACGGTTCAACGGCGTCTCTGGTATCAACGGCGTCATCACTAGCGGCCGGCGCGGGATCAGACGAGAGACATAATCATGAGGTTCGAAAGTTACGGAGAGAGCTGGCCGACGCGAAGGAAAAGGTGCACACCCTGACGACGCAGCTGACCACCAATGACGAGGTGGTCTTTACTGAATCGGCTTAA

Protein sequence:

>DPOGS204490-PA
MGNTNSGQGHSRHHNKSRKSRSLPTSPEVRRQVSVLQTSIPLPASAAAVRRAPPDKRPLPATPVHGKSGLSASSSRSTSPLGQGAVSFIPRAPASSSGSRPNSSLLVPGSKIPSALSQPQHSQNITHQNGAPNKQSMLDKLKLFNKDKVSSEKQSSKSTAVSKRTSSSSGFSSAKSERSDSSLSLNESANATNTHIKSSSLIGPKSVRPHNDTLTKDRSGKNVKSKLVNSKIVKESSTTLNKTNVKSSNEKSRSSPKLPARDKESRLAAPKSVSNTKLNQIEDHSKSPKSRLESKMVKLSGSQMKLSERTENSDVKTYENTKNHSHQIQPPSPTSASQVPLQNQTGIPKPTAAVKGTTKISKDEKHGIHKSTNSQLSPIQSNSSLNSTNNVSALPKEVNQKQTLAVSPMPVINSGNQNPPSQMSESSHSNSTHSTTGQQSNSSDGSVIYRPSSESGSEISKGSNLVCNKRIDMNTTYINDVINEAEISEKEAAQKRTTDKPSPKPIFDANKTLTEYDKNDSRSSTPSHCRDNSLGEDENPLMNVLPMRPLLRGYNSHLTLPMRTTGLAQKNIPGYPHHANTVKANFGRDNMALRERINYGPGFSNPDYCDLDIASGYMSDGDCLRRINVNEMDCERNNDIMDGYMSEGGASLYGRRMNYQQSSQFQQLDERRGRRGMEGGSGVVYRVVGRNRSKADCGQQTERQPPAPRQDTTWKKYTDSPGVGTPPSNQPTPAPPSPSHGRKGERRAGHHSPQHHKREKLTAAQQLGIAPHPQYPASSQSSQHSSRSGGSQLQSPSGSSRPSSGNGSCSNKAKVPQNFGYVKRQNGQPPPPPPNGPPQHAHGGRTAQVSAVPRTKVKVSGGTQTCTQDLQIHKNGIGPKSFSLQGTAAAQLSASVRERLLGSQSLPKPGTHEFAALFHHHRPSPRGGMKISDGSLSDTQTYSEVKSDYGIPYAPWLRHSNTYTSGRLSEGESMESLTSLHSAQHTQSPNSRSSLTHNKLIMHRDAQSTRLNRSNSIRSTKSEKLYPSMLQRSSESDYEPYYCLPVQYGPSGQGISYGVSEPPSPSPRSALSPTHAPAIHTPRHSHHYPKKNDDVHGSTASLVSTASSLAAGAGSDERHNHEVRKLRRELADAKEKVHTLTTQLTTNDEVVFTESA-