Monarch geneset OGS2.0

DPOGS213297
TranscriptDPOGS213297-TA5250 bp
ProteinDPOGS213297-PA1749 aa
Genomic positionDPSCF300479 + 24566-32920
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0071200.087.26% 
BombyxBGIBMGA011636-TA0.075.44% 
DrosophilaCG32333-PA1e-14653.95% 
EBI UniRef50UniRef50_Q9W0I12e-14453.95%CG32333, isoform A n=10 Tax=melanogaster subgroup RepID=Q9W0I1_DROME
NCBI RefSeqXP_001600120.11e-15055.81%PREDICTED: similar to CG32333-PA [Nasonia vitripennis]
NCBI nr blastpgi|3072071707e-15159.47%Protein FAM135B [Harpegnathos saltator]
NCBI nr blastxgi|1953765231e-15340.25%GJ12143 [Drosophila virilis]
Group
KEGG pathway 
InterPro domain[1479-1670] IPR0077516.5e-57Domain of unknown function DUF676, lipase-like
[33-89] IPR0221227e-15Protein of unknown function DUF3657
Orthology groupMCL11695 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213297-TA
ATGGAACTGAAACACCAAGTATTTGGTTCAGTGGATACTATACTTGATAGCCGCAACCTTAAAGAGTCCTTGGAAAGGGCGGAATGGAGTCTTGGTGTTGAACTCTGGTGCAGTGAGGGTGCCCAGTCTAGTACTTTGACTCCTGTCTCCTGCCGCGTGCTGAAGCTGCACTTTCAGCCTTCCCAGGGCTTGCACTACCATCTGCCTGTGCTGTTTGACTACTTCCATCTCTCTGCAGTATCTGTCACCATCCACGCTAGCTTGGTTGCCTTACATCAGCCGTATATCAACACGCCCCGCTCTAGCAAACAGTGGGGTAAACTGATGAAGAGTGCAGGTTCAAGCTTCAATGCTACTGGCAGCTTGGAGAACATTCTATTTGGATCGAGTGGAAAATGCGCTGGAGGAAATTCAAGCAAAATAGCTCACGCCAGACAAGTTCATGCTGAAGTTTGCGGTATATTATTGGGTTCATTGGAAGGGCTTCAGAGTGCATTAACTGTGTGCGGTTCAACTGGAGTGTTGTTGACTCCTGAAGAGTCTGCTTCCAACTCTGTGGTCGGTGAAGTAGCACAGCGGATAAAAGACTTGAGTGATTTGGGCAAAAAGTCTGAAGCTGGGGAAGAAGAAGAGTGGGCAATGGGAGCTGCCCATGACATTGCTCGGCTCTGCGCAGAAGTAACACTACGATGGCGTGCTTTTCTTCACCACACCTCTGATAGGCCACACGTTCATCACGCGCTTGCAGCCAGACATCATGCACTACGCATCAAACGCTTCTCCGAGTGTTTCTTCGTATTGGACAATCCTCGTCATTCCCTCGCCGGCTGTTACGACAGTAACTACCAGTCCTACCAGAGCGTGGGTGAGACCGCCCGGCGTTCGCGTTACCTGGCGCTTCTGCCACCATTGCCAGTACACTGCATTGCTTTGGATGGCGATCCACATATAATGCCTATTATATTTGAGGATAGATACCAGGACACAGCCGAGTTTGCAAGACGGCGATCGTTTTTAAATTCTAACGCCAATAAAGCTGGAAGTGATCCGTTCCTATGTTCGGACCCTGCCGGAGAAGACTGTGCTTGCAGTGTTAGTTCTATAATGGACTCTAGGAGACCGTCATCAGAAGCTTCTCGCGTAACTTTATCATTGCCAAAAAATATAATAGATGTTCTCGAACCACATTTTGATAACCAAAAAGCACCCACAGGTGTAGTACAAGCCACTTTAACGTTAGCTCCTCAAAAATCAGCCAGAGGATCTCCAAAGCGAACACTTGTAAAACAACACGTCGTAGAGATGACTAAACCCCATAATAAGCAAGTAGAACTTACGGGAAGAAATAGATACATTGTACAAAACAATCAAATTAGCGAAATCCAACTTCCCCCTAATAATGTCTTTGCTTCATGTCCTATCCAGCAAGGACAACAAATGTCCAGATACTCAGCCTCAACGCTATTAGACATAAACTCCGCCAAGAGTACAACAAATGTTTCACATTTAGGTAAGGAGAAAACTGATCGTGATGAAGTTTTCAATCAAAAGCATGACATTACAAGTGGCGTTAGTACATTACCGGCAAGACATAGTAAATCTTTAGATCAGTTAAGAGTATCACAAATGGCGCCTCTAAGTGTCCAAACATTATCGAAAAATGTGAAAAATAGCTCAAACAACTCCGTGAATTATCACAAATCATACGCACCGCCACAACAACAAGTAAAACCAACTACTCTCCCTAGAAGTATTACAACTACATCTTATCCTACTAGCACAACGACAAGATGTGGATCGAAAGGTGATGAATATAGACGTAATATAAACGAATTTAGAGAAAAATATAAAAATCCAGGTAATTCTGTTAATTTTCATGGTTTAAATAATGAAGTATTTCACTCTGTAGGCCATAAGTACGATGGGACAATGAAAGGAGGTAGTAATCAAATTTATGGTTACACTTTCGGTAATCAGGGTAATATGACAGTGAATAATACTATTCGTAATCCTTTAGAGTTTCAAAGAGGCTACAGTGTAAATTTACCCCGTCCTATGTTGCCTCCTCGTAATTCTTATCCACCTGTTAGTGGAAAAACACAAGTGACTGAACAGAGCAATTTAGCTTCTAGCAAACATGTTCATAGATCGAATTCAACTCATGTTTTCAACCCAAATGTAGAAAAACAACACATGGACATTGAGGCCGCACGAAATCAATTAAGACACGAATTAAACTATTACCTTCAAAAAGGGGACTGGAGCTATGACTTTAACACAGGGAGTCTTATTCAAAATTATCAAAAGAAATCGAGTAAAAGTAGTGATGATAGCGGCTTTGTAATGACTTTAGACCGGAGTAGCGATGGAACATCAAAATCAACATATTCAAAAACTCCACCATCAACTCCTCATTCGACATTACCATCTGTAAGACATAAAAAGAGTCGTTCTAAAGATTCTAAATTAAACCTTAATGGAACTGAAATTACAAAGATCAATTCAAGTCGCGATTCTCTGAGTAGCCATCATTCTGTCAAATCTAGAGACAGCAAATCCGATCGACAGACGCCAAAATCAGATAGAAGTACTCCAAAATCTGAAAAAGAGACACCTAAATCAGGTAAGGCTACGCCTAAATCAGGAGTAGTAACTCCGAGATCTGGTTTAGCTACACCAAAATCAGGTGGAACAACGCCCAAGAGTGGTAGATCAAGCGGTAAACCAGAAAAACGAGGAAAACATAAAGCATCCGAGAAAATGACTCAACTAATGTCAGAGGCAGCGTCGTCATCTAGTGATGAAAGCATACCTTTTGAACTGAGGCGCTTAGAATCTTCATCCAGTGTACCTTATAGTTTAGATCACGGCACTGAGCTACGACATTGTCGAAGTGCTGTATCAATAATAGAGGAAGCCAATCTTCACAACAACTTTGGATCAGAGTCTTTACCAAATTTAGCTCCACCGCCAGCATTTGAGTCACCGCCTCTAGATATAACTGATAAAGATTTTAAAATATTACCACCAGATAATTTTCTCAACAAAGATGAAAAACACAGTAACTCAACATCGAGTCTAAGCGAACAAAGTGGTTGGGTTTCTAGTGGACGTAGTTCAGGCCCTTCTTCGCCGGATAATGGGAGTTGTCAATTAAGTCAAAATTTTCCACCCACTAAACCCACAACAATGACGAAGACTAGTAGCAAGAAAGAGGATGAAGCTACTGAAACTCGGAAAAAAGATGGAGATAAAATTTGCAACTTTAAAAGAACAGTTTTAAATGGAGAACAACTACGGGAACGCCTACTTAAATTGGCAATAAAAGCAGCTCAAAGTAGTTCAACTGAAAAAATAGAGTGTTGTGATAAAGAAGTATGTGAAGAATGTACAATATGTAGTGATTCTATATGCACAGACAGCCAGTGTGAATATAATAAAATGATGCATACCAACGGAATTGATGCGGGATGTAATTCCGATACATGCACAGCCAATTGCTTTCAACAGTATCCTGAAACTAAAGTTAATAAAATTAACCAAAGACATAATTCATTCAGTGCAATTCAAGGAAAGACTTATAGCACTGTAAGTAGAAATTCTAGTGAGGCTACAGTAAAAAAATCACAAACAGTTAGTGATAACTTACATCCATATATAAGAAAGGAAATAGGTTCAAAAACACAACCCCCCACTACACAAGCAGGGGTTCCGGACAAACAGACAGATAAAAAGTCCAGCACGCTAGATAAAACTAAATTAAAAGATAGAATACAATACACAGAAAAGTTAATAGCTGCAGAAATTTTAAGTTTAACTGTCGACAGATCATGTAAACCATTTACACCAAGTGGTGCTGTGAGTGCTGCAAAAACATACCAGGGTAAAGCAAAATCTGAAGTGGATTTGAGTCACTTTGGAACGAACGTTTACGAACATTTACCACCACCGCAACAATTTAGAGATGCTCCACCACCACCGGAACAATTTAAGGACCCACCATCTAAGTCTCCAATGTTAAGCCGTCAAAGTAGTAAATCTTCAACACCTTCGAAAGGTCAACGAAAAGCTTCTCAGCCTACTACGTTGCAGCTAGACAATCCTCTATATCATGTGTGTGAAGGAATATTGGAAAGACGAAAGTTAAGGCCAAACCAATCAATGACTTACAATGCTACACCCTCATCTAGTACACTAAACAAAAGCCAGAGCACAGGTGAATTAGCAAGCAGAAACGAACTTAATAAAGAACAACGCCAAAGCAACCTTATCAGTATTTTTGGATTGGCTGAATCTGAGAGGCTGTTTGTGAAATGTAGGGAAGAGTTTAGACAAAGCGTTAAATACCCAGGAGCGATATATTCGGATTTTCCACCTGTCGAAAATTCCTTGCCATATTTCCATATAAGCGACGAATACAGAATGTTTAATCCTGAAGGATTACATCTCATCATATGTGTACACGGCCTAGATGGCAATGCAGCCGACCTACGGTTGGTTAAAACATATTTGGAGTTAGGATTACCAGGCGCGCGGTTAGACTTTCTTATGTCTGAGAGGAATCAAGGTGATACATTTTCTGACTTTGACACAATGACGGACAGACTGATTCAGGAGATCATGACACACATACAAAGCTCAAACGAGCCGGCGAGAATTAGTTTTGTTGGTCATTCATTAGGTACCATCATCATTAGGTCTGCCCTTGCAAGACCACAAATGAAACCATTTTTGGGCAAGCTGCATACCTTTTTGTCGCTAAGCGGGCCTCATTTAGGAACTCTATACAACTCTAGTGGACTTGTGAATGCAGGAATGTGGTTTATGCAGAAATGGAAAAAGTCTGGATCCTTACTCCAGTTATCACTTCGTGACGCCTCCGATCCCCGAAAATCGTTTCTGTATCGTCTAAGCGAACGAAGTCAATTGCACCAATTCAAACATATCTTACTTTGTGGTTCAGGGCAAGATAGATATGTGCCCTTACACTCAGCGAGACTGGAGCTGTGCAAAGCTGCTGCCAAGGACACATCTTTATTGGGACAGGCTTATAGAGAAATGGTCCACAATATGGTATCTCCTCTCGCTGCACGTGCTTCCTCAGTGTCCGTCGTTCGTTATGATGTCCAGCACGCATTGCCGCACACCGCCAGCGCTCTCGTAGGCCGTGCAGCACATATTGCAGCATTGGACTCCGACCTCTTCATTGAGAAGTTTTTATTGGTCTCCGCTCTGAAATACTTCCGTTAA

Protein sequence:

>DPOGS213297-PA
MELKHQVFGSVDTILDSRNLKESLERAEWSLGVELWCSEGAQSSTLTPVSCRVLKLHFQPSQGLHYHLPVLFDYFHLSAVSVTIHASLVALHQPYINTPRSSKQWGKLMKSAGSSFNATGSLENILFGSSGKCAGGNSSKIAHARQVHAEVCGILLGSLEGLQSALTVCGSTGVLLTPEESASNSVVGEVAQRIKDLSDLGKKSEAGEEEEWAMGAAHDIARLCAEVTLRWRAFLHHTSDRPHVHHALAARHHALRIKRFSECFFVLDNPRHSLAGCYDSNYQSYQSVGETARRSRYLALLPPLPVHCIALDGDPHIMPIIFEDRYQDTAEFARRRSFLNSNANKAGSDPFLCSDPAGEDCACSVSSIMDSRRPSSEASRVTLSLPKNIIDVLEPHFDNQKAPTGVVQATLTLAPQKSARGSPKRTLVKQHVVEMTKPHNKQVELTGRNRYIVQNNQISEIQLPPNNVFASCPIQQGQQMSRYSASTLLDINSAKSTTNVSHLGKEKTDRDEVFNQKHDITSGVSTLPARHSKSLDQLRVSQMAPLSVQTLSKNVKNSSNNSVNYHKSYAPPQQQVKPTTLPRSITTTSYPTSTTTRCGSKGDEYRRNINEFREKYKNPGNSVNFHGLNNEVFHSVGHKYDGTMKGGSNQIYGYTFGNQGNMTVNNTIRNPLEFQRGYSVNLPRPMLPPRNSYPPVSGKTQVTEQSNLASSKHVHRSNSTHVFNPNVEKQHMDIEAARNQLRHELNYYLQKGDWSYDFNTGSLIQNYQKKSSKSSDDSGFVMTLDRSSDGTSKSTYSKTPPSTPHSTLPSVRHKKSRSKDSKLNLNGTEITKINSSRDSLSSHHSVKSRDSKSDRQTPKSDRSTPKSEKETPKSGKATPKSGVVTPRSGLATPKSGGTTPKSGRSSGKPEKRGKHKASEKMTQLMSEAASSSSDESIPFELRRLESSSSVPYSLDHGTELRHCRSAVSIIEEANLHNNFGSESLPNLAPPPAFESPPLDITDKDFKILPPDNFLNKDEKHSNSTSSLSEQSGWVSSGRSSGPSSPDNGSCQLSQNFPPTKPTTMTKTSSKKEDEATETRKKDGDKICNFKRTVLNGEQLRERLLKLAIKAAQSSSTEKIECCDKEVCEECTICSDSICTDSQCEYNKMMHTNGIDAGCNSDTCTANCFQQYPETKVNKINQRHNSFSAIQGKTYSTVSRNSSEATVKKSQTVSDNLHPYIRKEIGSKTQPPTTQAGVPDKQTDKKSSTLDKTKLKDRIQYTEKLIAAEILSLTVDRSCKPFTPSGAVSAAKTYQGKAKSEVDLSHFGTNVYEHLPPPQQFRDAPPPPEQFKDPPSKSPMLSRQSSKSSTPSKGQRKASQPTTLQLDNPLYHVCEGILERRKLRPNQSMTYNATPSSSTLNKSQSTGELASRNELNKEQRQSNLISIFGLAESERLFVKCREEFRQSVKYPGAIYSDFPPVENSLPYFHISDEYRMFNPEGLHLIICVHGLDGNAADLRLVKTYLELGLPGARLDFLMSERNQGDTFSDFDTMTDRLIQEIMTHIQSSNEPARISFVGHSLGTIIIRSALARPQMKPFLGKLHTFLSLSGPHLGTLYNSSGLVNAGMWFMQKWKKSGSLLQLSLRDASDPRKSFLYRLSERSQLHQFKHILLCGSGQDRYVPLHSARLELCKAAAKDTSLLGQAYREMVHNMVSPLAARASSVSVVRYDVQHALPHTASALVGRAAHIAALDSDLFIEKFLLVSALKYFR-