Monarch geneset OGS2.0

DPOGS208821
TranscriptDPOGS208821-TA5097 bp
ProteinDPOGS208821-PA1698 aa
Genomic positionDPSCF300036 + 439445-461219
RNAseq coverage165x (Rank: top 51%)
Annotation
HeliconiusHMEL0049160.078.20% 
BombyxBGIBMGA007928-TA0.083.36% 
Drosophilasws-PA0.061.63% 
EBI UniRef50UniRef50_D6W9Q70.061.40%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W9Q7_TRICA
NCBI RefSeqXP_001120383.10.062.09%PREDICTED: similar to swiss cheese CG2212-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3287772230.061.09%PREDICTED: neuropathy target esterase sws [Apis mellifera]
NCBI nr blastxgi|3800300300.061.60%PREDICTED: neuropathy target esterase sws-like [Apis florea]
Group
Gene OntologyGO:00081524.1e-47metabolic process
GO:00066292.3e-20lipid metabolic process
KEGG pathway 
InterPro domain[1275-1443] IPR0160354.1e-47Acyl transferase/acyl hydrolase/lysophospholipase
[158-304] IPR0184907.7e-26Cyclic nucleotide-binding-like
[152-291] IPR0147103.1e-24RmlC-like jelly roll fold
[1282-1443] IPR0026412.3e-20Patatin/Phospholipase A2-related
[183-277] IPR0005951.3e-16Cyclic nucleotide-binding domain
Orthology groupMCL10958 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208821-TA
ATGGATGTGGTAGGTCTTTTAAATAACATAAATGATAAGACTGATATGTTTGCCGTTAAAACATGGACTTCAGAGTGGACAAATAGTTTTCAAGACAATCAAGTGCTGTGGTCATTCTGTGGCTGTTTGTTGGTTTCAGTTTTAGTCGTTTTTTTCTATTATTACAAAAGATGGAAATCAAAAGAGCCGGCTGGAGGCGCTGGGGCCACGGCTGCCGGGGAACCGGCGAAACGTTTCCGGAAACGCGACAAGATGTTATTTTATGGCAGACGTATGTTACGGAAGGTGAAGTCCATATCCAATTCCGGGCAGGGCAGGAAGCGGCGCGCCGTGATGAGGTTCGCCAGGAAGTTGCTGCAGCTGAAGAAAGAATCAGCTCCTGAACAATTGAAGGTCCTGGAACCCCCAGCGGAATATCTCGAAGAGGACTTGACGAATGATGATCGAGTCCCGCCAGACGCTCTCTACATGCTGCACAGCATACGAGTGTTCGGACACTTCGAGAAGCCAGTCTTCCTCATGCTCTGTAAACATACAGAGATATTGAACCTGCCCGCTGGATCTTTCCTCTTTAAAGTTGGTGATACAGATGAGAACGTGTACGTGGTTCAGAACGGCCGTGTGAACGTTTACATCACCAACCCTGATGGCAGCAGTCTGTCGCTGAAGATCGTCCGCGCCGGTGAGAGCGTCACCTCGCTCCTGAGTTTCACCGACGTGCTCACGGGTCACTCTCAGCCATACAAAACAGTGAATGCGAAGGCCCTAGAAGATTCCCAAGTCATTAAGTTACCAATGAGGGCTTTCCAAGAGGTCTTCAAGGAGTATCCAGACATATTTGTCAGAGTTATACAGATAATTATGGTGCGCCTCCAGAGAGTGACCTTCACAGCTCTCCACCAGTACCTGGGTCTCAGTGCTGAGTTGGTGAATCCCGGTCGTGAGAAACGCCGACCGGCCACAGCTCCTACGTCTTCACCAGCCAAGGTCAGGGTTGACAACACCCTGAACTCGCCGCACCATGAGAAGATCGAATTGGTGGAGGGATCTCAAGTCTCCTCTCCCATCCACATCCCCACCCGGAAGCGACCCGACATGGTACCGGATGTAACATCCAACACGCCCCAAAATACAACGCAGCTTCAACCGGACGTGCAACCAACATCATCGTTCCAAAGATCAAAAGAGGGCTCCTTTAAGAAACCAAATACTGACAATTTGGATGAGCAGGCTCTCATAAAGATTGCATCAGAGGCTTTCGTGAAGGAATTGGGTTTAGACAATGATCAGATACTGAAAGGGAATGTTCAAGTCAGAGATCTCCCGGCTGGGACTTACATCATGAAGGAGGAAAGTCACAAGGATGTAGCGCTAGTGTATCTATTGTCAGGCGCTCTGTTGGTGTCACAACGTGTGGCCGAGGGGGAGGGGGAGGTCCATATGTTCACCGCATATCCAGGTGAAGTGGAAGGCGGCCTGGCGGTGCTGACGGGGGAGCCGAGCTTCTTTTCAATACGAGCAAAGCATTTCTCTCGCATCGGTCTGTTGTCTAAGACAACGGTGTATAGTATTATGAGAGAACGCCCGTCCGTAGTACTTCACATAGCTAACACGGTAGTCAGAAGACTATCGCCTTTCGTTAGACAAGTGGATTTCGCCTTGGACTGGGTGTTCCTGGAATCAGGTCGGGCTGTGTACCGTCAGGACGAGGAATCCGGCTCAACGTTCATAGTACTCAGCGGACGACTTCGATCAGTCATCACGCACCCCAATGGAAAGAAGGAACTTGTTGGGGAATACGGCAAGGGCGATTTAGTTGGCATTGTGTTCCTGGAATCAGGTCGGGCTGTGTACCGTCAGGACGAGGAATCCGGCTCAACGTTCATAGTACTCAGCGGACGACTTCGATCAGTCATCACGCATCCCAATGGAAAGAAGGAACTTGTTGGGGAATATGGCAAGGGCGATTTAGTTGGCATTGTAGAGATGGTGACTCAAACCCGTCGCAGTACGACCGTCATGGCGGTGAGGGACTCCGAGCTCGCTAAACTCCCTGAAGGACTGTTCAACGCTATCAAGCTGCGGTTCCCCGTCGTGGTGACGCGACTGATCAATTTATTGGGTCACAGAATTCTAGGATCCTGGCAGAAGCCCACCCGCGGTCTGGGCACTGCTGCTATCGAGAGTCGCCCATCTCAACACAACTTCTCAACGGTGGCCGTGGTGCCCGTCAGTGACGACGTGCCGCTCACAGCATTCACTTACGAGCTATATCACTCACTGTGCGCTATCGGTCCGACGGTTCGTTTGACGTCTGACGTCATCCGAAAACTTTTGGGTTTGACCATAATGGATCCGAACAACGAGTATCGTCTCAGCTCCTGGCTCGCACAACAAGAGGACAAGCACAAAGTAGCGTTATATCAATGTGACCCAAGTCTCACTCAGTGGACCCAGCGATGCATTCGACAAGCAGATTGTATATTGATAGTAGCTCTTGGAGATAAGCAACCCAGTATCGGCAAAATTGAGAAAGAGATCGAGCGGCTAGCCATCCGTACTCAGAAGGAGCTAGTATTGCTACACCGTGAGGGAGGTCCCAACCCATCGGGGACTGTGCACTGGCTGAACATGAGGTCATGGGTCAGCCAGCACCATCACGTCCGCTGCCCCCACAGAATGTTCACCAGGAAGAGCCAGTATAGAATTAGTGAGCTGTACAGTAAAGTTCTGATGTCGGAGGCCAGCGTGCATTCAGATTTCTCTCGACTTGCTCGCTGGCTGACTGCCACGGCTGTAGGACTAGTGCTGGGAGGGGGCGGAGCCCGGGGCGCCGCACACGTCGGAATGATAAGAGCCATACAGGTGTTCCTGGAATCAGGTCGGGCTGTGTACCGTCAGGACGAGGAATCCGGCTCAACGTTCATAGTACTCAGCGGACGACTTCGATCAGTCATCACGCATCCCAATGGAAAGAAGGAACTTGTTGGGGAATATGGCAAGGGCGATTTAGTTGGCATTGTAGAGATGGTGACTCAAACCCGTCGCAGTACGACCGTCATGGCGGTGAGGGACTCCGAGCTCGCTAAACTCCCTGAAGGACTGTTCAACGCTATCAAGCTGCGGTTCCCCGTCGTGGTGACGCGACTGATCAATTTATTGGGTCACAGAATTCTAGGATCCTGGCAGAAGCCCACCCGCGGTCTGGGCACTGCTGCTATCGAGAGTCGCCCATCTCAACACAACTTCTCAACGGTGGCCGTGGTGCCCGTCAGTGACGACGTGCCGCTCACAGCATTCACTTACGAGCTATATCACTCACTGTGCGCTATCGGTCCGACGGTTCGTTTGACGTCTGACGTCATCCGAAAACTTTTGGGTTTGACCATAATGGATCCGAACAACGAGTATCGTCTCAGCTCCTGGCTCGCACAACAAGAGGACAAGCACAAAGTAGCGTTATATCAATGTGACCCAAGTCTCACTCAGTGGACCCAGCGATGCATTCGACAAGCAGATTGTATATTGATAGTAGCTCTTGGAGATAAGCAACCCAGTATCGGCAAAATTGAGAAAGAGATCGAGCGGCTAGCCATCCGTACTCAGAAGGAGCTAGTATTGCTACACCGTGAGGGAGGTCCCAACCCATCGGGGACTGTGCACTGGCTGAACATGAGGTCATGGGTCAGCCAGCACCATCACGTCCGCTGCCCCCACAGAATGTTCACCAGGAAGAGCCAGTATAGAATTAGCGAGCTGTACAGTAAAGTTCTGATGTCGGAGGCCAGCGTGCATTCAGATTTCTCTCGACTTGCTCGCTGGCTGACTGCCACGGCTGTAGGACTAGTGCTGGGAGGGGGCGGAGCCCGGGGCGCCGCACACGTCGGAATGATAAGAGCCATACAGGAGGCCGGCATTCCCATAGACATGGTGGGTGGAGTCAGCATTGGTGCTTTCATGGGGGCGTTGTGGTGTATGGACAGGAATATAACCACTGTGACACAGAAAGCTAGGGAGTGGTCCACGAAAATGACGCAATGGGGTAAGCAGCTCTTGGACCTGACATACCCGGCGACCTCTATGTTCTCCGGCAAGCAGTTCAACACAACCATAAGGACCACCTTCGGAGAGGTCCACATCGAGGACCTCTGGCTGCCGTACTTCACAGTCACTACAGACATTAGTTCCAGTTGTATGAGGATTCATAGACACGGTTCACTATGGCGTTACATACGCGCCTCGATGTCTTTGAGCGGGTACATGCCCCCACTCTGCGACCCCGTAGACGGCCACCTCCTATTGGACGGCGGTTACGTCAACAACCTCCCAGGGATGTTGTGGAGATATTGCCGCGCGTCTATGAGCATCGCCGGCATCTTCCCGCCGATATGCGACCCCATCGATGGACACTTGCTTCTGGACGGTTGCTATGTTAACAATGTGCCCGCTGATGTGATGAGATCACTCGGCGCCAAACACATTCTGGCTATAGACGTTGGTTCTCAAGATGACACGGATCTCACCAATTACGGTGACGACTTGTCCGGGTGGTGGTTGCTTTGGAAACGGTGGAATCCATTCACGACACCGGAAGTCAAGAAATCCGATTACTGCGAATACATACGCCCGCCAATAGACGCGTACAAGACGCTGCAGTTCGGATCGTTCGATGAGATCCGCGAGGTCGGCTACCGGCATGGATCGGCGTACTTCGAGGGCCAGAGACGTGGCGGCGGAGGCGGCGTCAGTGGTGCTGCTGCTGAGGGCAGGAAACACTCCGCACAGCCGGCCCTGACTGATTACACGTTCACGGATCTGGCGCAAATGGTGTGCTCAGTGAGGACAGCGCGAGACGACAACGACACCAGCTCGGAGTCCGACTACGAGGATCAGAGACACTTCGAGGGATACGCCTCCGAGCCCAGCGGTGGGATACTAGAGATGTCTTCCAGCGTTGAGGACGGCAACGCCTGGATCAGCGACACGGAACTGGAGGGTCTCAGGACCCGCCGTGTTGGAGGATCGCTCTCGTTATCGGAGGACGAAGTGGACTCCGAGGCCGAGATCTACGAGTCGATGAACAAACGGATCAGATGA

Protein sequence:

>DPOGS208821-PA
MDVVGLLNNINDKTDMFAVKTWTSEWTNSFQDNQVLWSFCGCLLVSVLVVFFYYYKRWKSKEPAGGAGATAAGEPAKRFRKRDKMLFYGRRMLRKVKSISNSGQGRKRRAVMRFARKLLQLKKESAPEQLKVLEPPAEYLEEDLTNDDRVPPDALYMLHSIRVFGHFEKPVFLMLCKHTEILNLPAGSFLFKVGDTDENVYVVQNGRVNVYITNPDGSSLSLKIVRAGESVTSLLSFTDVLTGHSQPYKTVNAKALEDSQVIKLPMRAFQEVFKEYPDIFVRVIQIIMVRLQRVTFTALHQYLGLSAELVNPGREKRRPATAPTSSPAKVRVDNTLNSPHHEKIELVEGSQVSSPIHIPTRKRPDMVPDVTSNTPQNTTQLQPDVQPTSSFQRSKEGSFKKPNTDNLDEQALIKIASEAFVKELGLDNDQILKGNVQVRDLPAGTYIMKEESHKDVALVYLLSGALLVSQRVAEGEGEVHMFTAYPGEVEGGLAVLTGEPSFFSIRAKHFSRIGLLSKTTVYSIMRERPSVVLHIANTVVRRLSPFVRQVDFALDWVFLESGRAVYRQDEESGSTFIVLSGRLRSVITHPNGKKELVGEYGKGDLVGIVFLESGRAVYRQDEESGSTFIVLSGRLRSVITHPNGKKELVGEYGKGDLVGIVEMVTQTRRSTTVMAVRDSELAKLPEGLFNAIKLRFPVVVTRLINLLGHRILGSWQKPTRGLGTAAIESRPSQHNFSTVAVVPVSDDVPLTAFTYELYHSLCAIGPTVRLTSDVIRKLLGLTIMDPNNEYRLSSWLAQQEDKHKVALYQCDPSLTQWTQRCIRQADCILIVALGDKQPSIGKIEKEIERLAIRTQKELVLLHREGGPNPSGTVHWLNMRSWVSQHHHVRCPHRMFTRKSQYRISELYSKVLMSEASVHSDFSRLARWLTATAVGLVLGGGGARGAAHVGMIRAIQVFLESGRAVYRQDEESGSTFIVLSGRLRSVITHPNGKKELVGEYGKGDLVGIVEMVTQTRRSTTVMAVRDSELAKLPEGLFNAIKLRFPVVVTRLINLLGHRILGSWQKPTRGLGTAAIESRPSQHNFSTVAVVPVSDDVPLTAFTYELYHSLCAIGPTVRLTSDVIRKLLGLTIMDPNNEYRLSSWLAQQEDKHKVALYQCDPSLTQWTQRCIRQADCILIVALGDKQPSIGKIEKEIERLAIRTQKELVLLHREGGPNPSGTVHWLNMRSWVSQHHHVRCPHRMFTRKSQYRISELYSKVLMSEASVHSDFSRLARWLTATAVGLVLGGGGARGAAHVGMIRAIQEAGIPIDMVGGVSIGAFMGALWCMDRNITTVTQKAREWSTKMTQWGKQLLDLTYPATSMFSGKQFNTTIRTTFGEVHIEDLWLPYFTVTTDISSSCMRIHRHGSLWRYIRASMSLSGYMPPLCDPVDGHLLLDGGYVNNLPGMLWRYCRASMSIAGIFPPICDPIDGHLLLDGCYVNNVPADVMRSLGAKHILAIDVGSQDDTDLTNYGDDLSGWWLLWKRWNPFTTPEVKKSDYCEYIRPPIDAYKTLQFGSFDEIREVGYRHGSAYFEGQRRGGGGGVSGAAAEGRKHSAQPALTDYTFTDLAQMVCSVRTARDDNDTSSESDYEDQRHFEGYASEPSGGILEMSSSVEDGNAWISDTELEGLRTRRVGGSLSLSEDEVDSEAEIYESMNKRIR-