Monarch geneset OGS2.0

DPOGS213788
TranscriptDPOGS213788-TA3789 bp
ProteinDPOGS213788-PA1262 aa
Genomic positionDPSCF300212 + 699670-710446
RNAseq coverage532x (Rank: top 24%)
Annotation
HeliconiusHMEL0160980.068.90% 
BombyxBGIBMGA009269-TA0.075.74% 
Drosophilakibra-PA2e-13336.77% 
EBI UniRef50UniRef50_UPI00020639E90.048.41%UPI00020639E9 related cluster n=1 Tax=unknown RepID=UPI00020639E9
NCBI RefSeqXP_396884.30.047.74%PREDICTED: similar to WW, C2 and coiled-coil domain containing 1 [Apis mellifera]
NCBI nr blastpgi|3838632800.047.42%PREDICTED: protein kibra-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3838632800.049.34%PREDICTED: protein kibra-like isoform 1 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL10985 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213788-TA
ATGTTCCCTCTACCTGAACTCTCAGCCACCAGATGTGCCACATCTCGGATCAGACATTCCGTACTGAAGTCATATGGTCGCGATGGAGCCGGGAGCGGGCCCTTCGGAGAGATCGACCTCCGAAATGCGCCCGAGATTGCATTCTTTCGGCTGCGCGGGCGCATTCCACTGAACGGCCAGAGCACCGCGCGCCCGCGCCAGTCGACGCGCGCCCTTTTGGGGGTTAAGATGTGTCCGTCGCGGCGGGCGATCGCTATGTGGTCCAAACAACCGCCTCCCGAACTAGAGACTTTCCGCGCCCGAACAGCTTCCTTGACACACACGTGTTACAGGGGATCCTATGGTTCAGGTCATATAGCAAGGATGGTCCGCGAACCTCCCGAGTATGGAGTCCGTGTAGATCCAACCTCCATTCGTCCTCCCGGCCTTCGCGTTCGCGATGAGCCGCAGTATCAAAATTTGGACGAACTCCGCAAGAATCCGGAATACGCTAATCTGGATGAGATCAGTCGCGAATACGGTTATCGGCTCTGTCCCGAAGGTCAGAATCTTGAGGACGAGGCTATATACGAGAACATATTATCCGTGCGTCAGATAAGTTCCGCGGAAGTGCGTCTAGTGCCTGTGTCTGAAGTGCCGTACTACGAGGATCAAGGTTACGTGGCTACCCAAGTATTGAGCAGAAACGGCACCTTCCCTCAATGTCCTCCGCATACCTGGAGCCGTCTCCAGCAAGCCGGCAGGATGACTGGCTCCTTGAGGCTCTTCACTGCGAAGCGGGAAATGTACGATGTCAAAAAACAGAGACTATGTCTGGCTCAAGATGAGTACAAGCACCTCAACAACGCGCTGTCCACACTCGGAGCGTCCAGGACTAGCTTGTGTTCGTCAACTACATCAGTGACAACAACGTCGACGACTTCTCGTCATGACCCCGACCAGCTTCGTGTTGAGGTCACTCAAGCCAGGGGTCGGCTGGCTCAGCTCAGGAAAGAGTTGAGACAAGCAAGGGCGGAAGTAGCCAGCGCGAGACGCGGTTTTGACACGTTGGCTGAGGTAGAGCAAAAACTTAGCGCTCAACAAGGTTGTTACAATATAACAGAAGCTCAAGCTATTATGACTGAACTGAAGAATATACAGAAATCTCTCACTTCCGGTGAGAAGGAGAGAGCAGATCTTATGCAATCTTTAGCCAAGCTAAAGGATGACTTAACGAGGCTGCAATTAGGTGACGCTTCCCCGGAACTGTCCACTTTAAGTTTACCGCAGGAGAAGTTAAGCACTGCTTCCCAAACGGATCTTTGTGCGGATTTAGTACCAATAGGCACTAGACTAGCTGAAATGGCGCGGGTTCGCCTACAATACGACGAGGCTAGGAAGAGAATTCAACAGATCCAGCAACAGCTGGCTGATCTGGAGGACAAAGTCCAACCAGGTCAGGCGGAGTCTGATAAAGATAGGCTCTTACTGTTCCAAGAGAAGGAGCAATTACTGAGAGAATTGAGGAGTATCACACCAAGGACGAGGTCGAAACAGGAAATGAGCGACATCCAAACTGAATGTAAGAAATTGGAGCAAGATCTGAAGAATGCTTTCGAAATGTCCAACAAGTGTATAGCGGATAGGTTAAGGCTGCACGAAGAGAAACAGCTATTGTTGCAGCAGTTAAAGGACGCTTTAACTTCTATGACTGCGTTGGAAGGACAATTGAAGACTTTATCAGCTTCAACGTTGTCAGTGTCTAGCAGTTCCAGCTTGGGATCGCTGTCCACAGCGAGCAGTAAAGGCTCACTAAGTTCCGGGATAAGCTTCACGGATATTTACGGTGGACCACAAATAGCTACGTCCTTCCAAGCAGACAAACCAATTGATATGGTGGATCTTCATAGGCGAGTTGAAAGGTTACTCCGGGGTTCGTATGCTGAGCCTCTCACCAGCTCGCCGTCACAGCCGTCTTTATCACCACGGAGTAGTCTCTCCTCAGCGTCACCACCACCTCCACCATCCTATCATCAGGTGGAAAGACAAAGGCGACAGCAGAAGGAGTTGGAGGACAAACTGGCTGAGATGAGAATCGGCGTCGCTACCAGCCTCAGTGAAGTCACGGGACTTTCAACAATTCCAGTACAGCTACAGGGTCCGGGTCGGCCGGCGGAGCCTCTCTCTCCAATATCGGAGACGCCACCAACCGCTTCCTCTAGTGGCACGAACACTAGATCGGTATCAGCGGCTGTCAGTGATGAATCCGTTGCTGGTGACTCAGGGGTGTTTGAGGCGGCGCAGGCCGGGGAAGCCGGGTGCGTGAACAGTGCTCAAATTGAGATCAAATTGCGTTACTGTTCGGACGAGAGCGCATTAGAAATAGGCATCCTGCGGGCGAGGAACCTCCACGCGCTGTACATAGACGTGGGAACCGAAGTGTGTATCCGCGGAGCTCTGGTGGTAGGTGGTGGGGGTTCCGTGTCGTTCACGAGTCGCCCTCTGGTGTGGGCGGGGTCAACGCTGCTCTTCCGCTGGCAGCAGCGAGCAGCGCTGCAGCAGAGAGCACTGCAGGGGGGCACGCTACAGGTCAACGTCGCTGCCGGCGCTGAATGCCTGGGCTGTACACAAGTCAGTCTGGCTGACTTCGACCCTGACACTGTGTCACAGAAGTGGTACAACGTGCTCAGCTTCCGAAGCATAAGGAGAGATGAGAGCTCAGACGAATCCACTGTCATATCGTCTCAGACCTCCACACTAACTAGGAACAGAGGTCCGGAGAGTATGGCCGCAGCTGAATGTAACGCTGATTGCTGTGATAACTCAGCATCCGAAGACGAGGAATCCAGGGAACCGCTCAATCAGATAGTTGAGGAGGACTCCTTTGAGGACTACATTCCTGAAGAGGACATACATTTGGAGGACGAGTACCTCCCGTCCACAGCGGAAAAGGAAACCAACACCGAATGCAACTTCTGTCCGGAGGGAGCGAGACAGTTGCACAGACGAAAAAGTCAACAAGTAAGCTCAAACACAGAAGAGTCGCTGGCGACTATTAAACGATCTCAGACGTTCTCACCGCAACAGGGTGTTGGGAAGGGACAGTATTTATGCAGGCTCAATCGTTCCGAATCTGATTCGTCGATGTCCGCGCACCGCCGGTTGACGCTGCGGCCGCCTTCCCTCGGGATACCTTTCGATAGGAGCGTCAGGGAAAGAAGATCTTTAAGATGGTCCAAATCCGCTGGCCGTCCTTCTCGTCGCTCGGGTCGCACCTCGCTAGACCTCAGCCTGGACCTCCGAGCGCAGCACGAGAGACTCACGGATCTTAGAAACGAGATCGCTCATCTCACAACGCTCAAGAAGAGGATTGACTTTAAAAGCAATGACCCCGCTGTAGCTTCGTGGGTCACCGAGGACGAAACACTACAGAAGCTGTTATCAGCGCCGGCTGATAAGGATCGAGTGGCGAGACTCTTCACGAGGACCTGCAAGGAGGTGTTCAGATTGAGGAAGTCGAGGCAGGGCGGGCGGAAACCGGATCTGGTGACGTTCAAAGAAAAAATGGCGTTCTTCACTCGCAGCAAGTCGATCACGATAGCCGACGACGAGGAGCTGTCCGACGATTGGGAGGAACTGGAAAGACGGACTCCCGACGGGAGGTCTTCCAAGGTGTCGTACGAGAGAGGCCTGGTCGTGGAGTGCAAGAACCCCTGCATCCCCGACAAGATGGACGGCATCAGTATAGACGGCAACAACGACGACGTCTACGAGTACGTCGTGGACAGGGCGCTCGGGGTCCAAGTCTGA

Protein sequence:

>DPOGS213788-PA
MFPLPELSATRCATSRIRHSVLKSYGRDGAGSGPFGEIDLRNAPEIAFFRLRGRIPLNGQSTARPRQSTRALLGVKMCPSRRAIAMWSKQPPPELETFRARTASLTHTCYRGSYGSGHIARMVREPPEYGVRVDPTSIRPPGLRVRDEPQYQNLDELRKNPEYANLDEISREYGYRLCPEGQNLEDEAIYENILSVRQISSAEVRLVPVSEVPYYEDQGYVATQVLSRNGTFPQCPPHTWSRLQQAGRMTGSLRLFTAKREMYDVKKQRLCLAQDEYKHLNNALSTLGASRTSLCSSTTSVTTTSTTSRHDPDQLRVEVTQARGRLAQLRKELRQARAEVASARRGFDTLAEVEQKLSAQQGCYNITEAQAIMTELKNIQKSLTSGEKERADLMQSLAKLKDDLTRLQLGDASPELSTLSLPQEKLSTASQTDLCADLVPIGTRLAEMARVRLQYDEARKRIQQIQQQLADLEDKVQPGQAESDKDRLLLFQEKEQLLRELRSITPRTRSKQEMSDIQTECKKLEQDLKNAFEMSNKCIADRLRLHEEKQLLLQQLKDALTSMTALEGQLKTLSASTLSVSSSSSLGSLSTASSKGSLSSGISFTDIYGGPQIATSFQADKPIDMVDLHRRVERLLRGSYAEPLTSSPSQPSLSPRSSLSSASPPPPPSYHQVERQRRQQKELEDKLAEMRIGVATSLSEVTGLSTIPVQLQGPGRPAEPLSPISETPPTASSSGTNTRSVSAAVSDESVAGDSGVFEAAQAGEAGCVNSAQIEIKLRYCSDESALEIGILRARNLHALYIDVGTEVCIRGALVVGGGGSVSFTSRPLVWAGSTLLFRWQQRAALQQRALQGGTLQVNVAAGAECLGCTQVSLADFDPDTVSQKWYNVLSFRSIRRDESSDESTVISSQTSTLTRNRGPESMAAAECNADCCDNSASEDEESREPLNQIVEEDSFEDYIPEEDIHLEDEYLPSTAEKETNTECNFCPEGARQLHRRKSQQVSSNTEESLATIKRSQTFSPQQGVGKGQYLCRLNRSESDSSMSAHRRLTLRPPSLGIPFDRSVRERRSLRWSKSAGRPSRRSGRTSLDLSLDLRAQHERLTDLRNEIAHLTTLKKRIDFKSNDPAVASWVTEDETLQKLLSAPADKDRVARLFTRTCKEVFRLRKSRQGGRKPDLVTFKEKMAFFTRSKSITIADDEELSDDWEELERRTPDGRSSKVSYERGLVVECKNPCIPDKMDGISIDGNNDDVYEYVVDRALGVQV-