Monarch geneset OGS2.0

DPOGS200161
TranscriptDPOGS200161-TA3321 bp
ProteinDPOGS200161-PA1106 aa
Genomic positionDPSCF300128 + 192082-200721
RNAseq coverage301x (Rank: top 37%)
Annotation
HeliconiusHMEL0058340.076.04% 
BombyxBGIBMGA002912-TA0.073.60% 
Drosophila% 
EBI UniRef50UniRef50_D6X1B42e-10243.47%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6X1B4_TRICA
NCBI RefSeqXP_972982.27e-10343.47%PREDICTED: similar to Nck-associated protein 5 (NAP-5) (Peripheral clock protein) [Tribolium castaneum]
NCBI nr blastpgi|2700130637e-10243.47%hypothetical protein TcasGA2_TC011613 [Tribolium castaneum]
NCBI nr blastxgi|2420228925e-11833.09%conserved hypothetical protein [Pediculus humanus corporis]
Group
KEGG pathway 
Orthology groupMCL17974 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200161-TA
ATGGCTCTAACGCCGCGCTCGAATTCCGTAACCAATGCATCAGCGTCGCACTACCAAAGCCAACTCTCAGTTCTGTCTGCTGAGAACGAGCGTCTCCGTGAAGAACTGACAGCGCTGTCGGCTGGCGTCCGTGACTCTGAACACGAAAAGAGGCTAGATGATGTGGCGCAGCAGGTCGTGCGAGCTTTACTCTCGCAGAAGAGTGTTCGTGAGGAGTTGGGCTGCGCTCGCGCACGTATCCGCGAGTTGGAAGCTCAGAACCGAGCGTTGAGTTCGCTCCTCGTACGACAGCTGAGACCACAACCGAGACCCTCGCCAGCCACACCGCTCACACCACACACCAATAGGGACCTCCAAGTACACCTGGTGGATGTCGGTTCGTGCGGCTCGTTGGTATCGTTCAGGGACTCCCCTCCCCCCGCGCCGCCGGCTCCTCAACCTCACCCCCTCAGCCAGGACGACAAGCGGAGACATCAAATACTAGCTGATATCTGGACTGAGCTGAAGGGTCTGGAGGTGACTCCCGCCAACTTAGCGCGCGCGTTGTCCGCGGTGGACCCCACGTTGTGGGCACCGCCGGCGAGGCCCGCGACGCTCAGCCTCAGCGTGCTCCAACCCACCGAACAGACACAAAGTGCTACGAGGGGGAAAGCGACGGAAGAGGAAACCGGCGAAGCGGGCGTCGAGTCACCGGAAAGCGGGGCGAAGGACGAGGGTTACTCGACGATGTCTAGCGACGTGCAAGCTGACGCGTCACGACAGAGTGACCACGTGGGCGACCCCCTGCCGGACCTCAACGAAGCCTCCGACGAAACGGACAACCAGACCATCGTTTCCATCAACCCCAGAGAACCCCGGCGTCGCGCCAGACTGATAGCTGAGGCTGATTATATATATTTTCCTATAGGTGTAGCATTCGCTGGTATAAGAGGCAGCTACCCGCCCTCGCGGCCGGTGTTACCTTTCCAGCACGTCGTGAGAAGTTTCTCAGACTCTCATTTGTGCTTAAAGTTATTGACCAGCACGTCGTGTCCTCCGAGCTGCTTGGAAACACCATCGCCGAGCTCGGGCATCTTAGTTTTAGATCTGAAACCTGCTCCAGAGAGGCCACTGAGACGGCCAGCTGTAGCGTCTACCACGAGCTCTGAGAGAGTGTCGTGGGGCAGCACCATCGATGAGCGTGCTGACGTCTCACAGTACGACGCTGATTACGTTCAGCATTGGCTAGAATTAGACGACGCCAGATCTGCACTGCAACAAAGACATAGGGATCTCGCAGACTTGGAGTACGATAGAGCGGAATTAGAAGACTGGAGTCTGTCACTGTCGTGTGAGGATCTTAGAGACAGACAATCTCCGTTTGCGGAGATAACTACCCCCGGACAGATATCTCATTCAACATTACCAAGCATCAGGGAAGACGATGCGCTAGAGCTGGAGGAGGACGTCGGTGATTGTTTGTGGAATGACTGCGGATTCGCGACGGTTGAAATCGATGAATGCAGAATAGGCGACGAAGTTGAAAACTCAGAGAAAAGATGGGAGTACACGGGAACACATTCCCCGGGCGGATCCTGGTCCAGCGCATCCGATGCTCCTGAAAAGCGATCTAGTACAGCTTTGAGTGAAGACGGCGACTGCGCTAACATAGGACTCGATTTTACGAGGGATTTTTACAGACTCGTCAAATATGAAAGCACGAAGAGTTTAGCATCCAATTCATCGAAAGGTGTCACAGCTCAGGATCCAGCAAACCATTTAAGAATAACGGATGTTCAGACTGTGGGTTTGCAGGATCGTGAACAGGCACTTCAGAATGTTCTCAATTTTATAGCAGAACAGCAGAAGTACTGTCGTGACAGAGAAGAATCCGATTCTATGTCTTCTCGTCCTGTGTCCGAAATACGCGAACTTCCACCTCCGTACGCCGCCGCTGATTTTGACGACGAGTCCGTAGATCCCCGCAGTGAAATATCAGAAGACAGGCAGAGACCGGATTCCTTTGGCAGTTTTTCTGAAAACGATTCATGCGACGTCATCCCTCTAGACAGAACGAGACTCCCCGTTTGCGAAAATATATCCGAGCCGAGATCGACACCTAGATTCATAGAGCGCGAAGATCCTTACGTGGAATCTGAAGACTACTACGACAGGTCGCGCGTCGAAAATGCGGAAAATGAAATCGATGCTCACCATCTTTTAAAGGTACAGCGAAAAAATGAAATTAATAGAAATATTGACATCAATAACTTAGCCGACATAGAGCCGCCGAGCTCGGAGAAAGACGACGAAACTTGTGATAACGAATCTAGTCGAACCTTAGAACATAACTCGGCGTTGAAAGACAATACTGTCAACATCACGTCGAGCAAAGAAGGGTCTAGCGAAAGAGAGACGGAGGCGTCCCTCGCCAAGTCCAGCAGTCTTCATAGCGCTGTGGAAAGTGAAATATCCGTCGTCGACGAAACGTTGACTATTTGTAGAAGAACGTCACTCGGCACCGTGCCCGAAGAAGAGGAGAGCTCCTCGCCCGAGACGAGTTCTCCGCAAATGACTGAATCAAACACAACGAGCACGTCGACAGCTGAAACTGTGATAGTTAGTAATAAGAATGAGAGTTTCAACAGAGAAGTCAGGCGCAGAAACGACAAAAGCCGGATACCGACTCTGACGGGCGGCAAGCGACCGCCGTCCTCACCGCACAAGGCGAGGTCGAAGATCCCAGTCTCGGACAGAGGCAAACCAACCCAAAAACAAGCGACGCCGCCCCCAGAACCCATCATCGTGAAGCAAGAAAACACACTGAGCTTTCACGAAGCTGCTACCTCGAAGGAGGTCATAGAAGAACTTAACAGGATGATTCGCCAAAGCGAAGGTGCAGCGACAACGACCGACGTGAAGACCGAAGAGGGCCAAGAGAAACCGTACGGACAAAAGGATAGTGCGTTATGGGCGCCCACGGGTTGGGTTCATGTCGAAAAAGACATCGACTTCAGTGACCCAAAGGCGCGCGCTAATCTTCTGGACGTGATGCTGGCCTCAAGTGACTCGTCTCCATCGTCCTGCGGCTCGTCGCCGGCGGAACAGCCGCCCTACTCCCGCCTCCACCGCCTCCACCGCTCAAGACGACAGAAGACCGCGGCTGCGCTGCGAGTCCGCGGCCTGGGAGCCCTACGGCACGCCAGGCACCGCCGCCCCTCCATACTCGGCCGCGACGGCTTCTTCGTCCGCTACGCCGAGCCCGAGAAGGCCGCCGTCGCCACGTTCGACTTCCTCGATGAGCTCTCGGCCGGATCCTCGCCTGACTCCAAACACAAGTAG

Protein sequence:

>DPOGS200161-PA
MALTPRSNSVTNASASHYQSQLSVLSAENERLREELTALSAGVRDSEHEKRLDDVAQQVVRALLSQKSVREELGCARARIRELEAQNRALSSLLVRQLRPQPRPSPATPLTPHTNRDLQVHLVDVGSCGSLVSFRDSPPPAPPAPQPHPLSQDDKRRHQILADIWTELKGLEVTPANLARALSAVDPTLWAPPARPATLSLSVLQPTEQTQSATRGKATEEETGEAGVESPESGAKDEGYSTMSSDVQADASRQSDHVGDPLPDLNEASDETDNQTIVSINPREPRRRARLIAEADYIYFPIGVAFAGIRGSYPPSRPVLPFQHVVRSFSDSHLCLKLLTSTSCPPSCLETPSPSSGILVLDLKPAPERPLRRPAVASTTSSERVSWGSTIDERADVSQYDADYVQHWLELDDARSALQQRHRDLADLEYDRAELEDWSLSLSCEDLRDRQSPFAEITTPGQISHSTLPSIREDDALELEEDVGDCLWNDCGFATVEIDECRIGDEVENSEKRWEYTGTHSPGGSWSSASDAPEKRSSTALSEDGDCANIGLDFTRDFYRLVKYESTKSLASNSSKGVTAQDPANHLRITDVQTVGLQDREQALQNVLNFIAEQQKYCRDREESDSMSSRPVSEIRELPPPYAAADFDDESVDPRSEISEDRQRPDSFGSFSENDSCDVIPLDRTRLPVCENISEPRSTPRFIEREDPYVESEDYYDRSRVENAENEIDAHHLLKVQRKNEINRNIDINNLADIEPPSSEKDDETCDNESSRTLEHNSALKDNTVNITSSKEGSSERETEASLAKSSSLHSAVESEISVVDETLTICRRTSLGTVPEEEESSSPETSSPQMTESNTTSTSTAETVIVSNKNESFNREVRRRNDKSRIPTLTGGKRPPSSPHKARSKIPVSDRGKPTQKQATPPPEPIIVKQENTLSFHEAATSKEVIEELNRMIRQSEGAATTTDVKTEEGQEKPYGQKDSALWAPTGWVHVEKDIDFSDPKARANLLDVMLASSDSSPSSCGSSPAEQPPYSRLHRLHRSRRQKTAAALRVRGLGALRHARHRRPSILGRDGFFVRYAEPEKAAVATFDFLDELSAGSSPDSKHK-