Monarch geneset OGS2.0

DPOGS203957
TranscriptDPOGS203957-TA3681 bp
ProteinDPOGS203957-PA1226 aa
Genomic positionDPSCF300005 + 370202-382416
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0038022e-2927.68% 
BombyxBGIBMGA014489-TA0.079.09% 
DrosophilaCG3280-PD2e-14857.17% 
EBI UniRef50UniRef50_Q7QJR80.050.24%AGAP007628-PA n=5 Tax=Arthropoda RepID=Q7QJR8_ANOGA
NCBI RefSeqXP_308243.40.050.24%AGAP007628-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582853220.050.24%AGAP007628-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3214572650.053.31%hypothetical protein DAPPUDRAFT_63207 [Daphnia pulex]
Group
Gene OntologyGO:00160212.7e-39integral to membrane
KEGG pathway 
InterPro domain[579-694] IPR0124962.7e-39TMC
Orthology groupMCL10738 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203957-TA
ATGGATGCCAAGGGCTCCTATGACCGTCCTCCTGGTTCACCGCAACCGTCCGTTGATCGATGGTCAGTCACCTGGGCGGACCCTGGCAGTATGACTTCTTCGGGTGTCAATCTTTTTCATCTACCTGCCGACACACCAGCTAGTAATGAAGTCAAATTTTCGCCAACAGCGGCAGATGGCTCTGAAGATGAAGATTACTCAAGTTTGAGTGCCGTTCTGAAACAACGTCGAGCTAGTGTGCGTCGATCTCGTAAGGGTCGGTCAAGACGCCCATCTTCTCCATTCTTGCCAGACGAAAATCGTTCAAGAAGAAGATCCTCCGTTTTTACTACCAGTTCTGGAGACACTGCTATATCCATTGATGAGCAACCAGTTGTTACCCAGGAACAAATTTTTGAAAACATACATCTTCACAAGGAAGTTCTTGGTTCAGTCAAACAACAACCACTTGGAATGAGAAGGAAATTAAAAATTGTTCATCAGGCAAAAGGTTATGTTAAACGACATGAAGGACAACTGCAAGAAAGACTTGCTCAATCCAAAAGTACGAGAGACATCTATGCGAGGTTTAATATACTTCTAGCAAGTAAATGGCAACAAATGAAAAGGGAAGCTGCAAATACATCAAATCTACTCATTCCTTGGGAATTGCGCATAAAAGAAATTGAGTCGCATTTTGGCTCTGTCGTTGCATCTTATTTTACATTTTTGAGATGGCTGTTCTGGGTGAACTTAGTCATTGGTTTTATATTGCTCGTATTTGTAATAATACCTGAGTACCTTACGGCAAATCCTCTTCAAGACGGAGAAAGAAAAATTATTATGGACGATGAGTATCGTAATGCAACAAATTTATTAACGCTTTGGGAATTCGAAGGTGTCCTTAAATACTCTCCTATATTTTATGGCTACTATAGCAACGTGGAACGAGACCAATATAGAATGCCACTCGCTTATTTTCTCACTGGTTTGGTCGTGTACATTTATAGTTTTGTCGCCATATTAAGAAAAATGGCTGAAAACTCTCGAATGTCTAAGTTGTCGGAAAAAGAAGATGAATGTATTTTTTCTTGGAAGCTTTTTACTGGTTGGGACTTTATGATAGGTAATGCTGAAACCGCACATAATCGTATTGCATCAGTTATATTAGGATTCAAGGAAGCTCTTCTAGAAGAAGCCGAAAAGAAAAAGAACTTAAGAAATTGGCGAATAATATCCTTGAGAGCAGTCGTAAACATCTGCGTTATTATTCTTTTGGCGGTTTCAGCGTATGCCGTTGTGACAGTAGTGTATCGTTCAGACGATAACGCTAAGATACGAAGTTGGTGGCGTGAAAATGAAACCACTATAGTGGTTACTGTAATTTCAATTACATTTCCATTGTTATTTGAACTTCTTGGTCTTTTGGAACACTATCATCCAAGAAAACAACTCAGATTGCAACTAGCCAGAATTATGCTTCTTAATTTACTTAACTTATATTCACTTATATTTGCCTTATTTAGTAAGATTGAGGGCATGAGTAAGGAATTAATTTCATTACAACCAACTTTAAACTTGAATGCCACAATTTTTTCTGAAAATACTTTAAATATGAGCGAAACTCTAATAGGCACCTTACCAATAACATGTTTTGAAATAGCCGTTCCCTTTTATGACACCGTATTTTTTGATAAATATCATAGAAAAATGTATAACTTGACCATGGGTACAAGAAAAAAGTTATTGAAACTTTGTTGGGAAACCATGTTTGGTCAGGAATTGGTGAAATTAACAATGATGGATCTTGTTTTTGTTTTGTTAGGAACTTTATTTACAGATTTTTTCCGGGCATTATTTGTCAGATATATGAATAGATGTTGGTGTTGGGATTTGGAAAAGAAATGTCCTGAATATGGAGACTTTAAAATAGCCGAAAACATCTTACATCTTATTAACAATCAAGGGATGGTTTGGATGGGAATGTTTTTTTCGCCAGGACTCGTTGTATTAAATGTTGTAAAACTGATGATAATGATGTATTTAAGATCGTGGGCAGTCATGACCTGCAACGTACCACATGAAGTAGTGTTTAGAGTATCTAAGAGTAACAATTTTTATTTAGCATTGTTATTGACAATGTTATTTCTTTGCGTTCTACCAGTTGGCTATACCATTGTTTGGGTAACACCATCATGGCACTGTGGTCCTTTTTCAGAATATGACAAGATTTACAAGATATTGACAAATAATATCTATAAAATTTTACCAACCAGTCTTAATTTTACATTGGAGTATATTGCTTCGCCAGCAATAGTCATACCGTTGCTAGTTTTGTTAATATTAATTATTTACTACTTAACATCGTTGACCAATTCTCTAAGAGAAGCTAATAATGATTTAAAAATTCAACTAAGAAGAGAACGCACTGAAGAACGTAGAAAGATGTTTCAATTAGCAGATACAAGAAGAAGGGGAGGATCTTCGTCAATAGACAACACACCTTTTTCAAGATGGAAAAAAGCTTTACCTTCTCTTCCTGTATCAAAATCAATCGATTCTGATGATAGAAAAACCACAAGTGATATTACTAAAGAACCTGCCAAAAAAACAATGAAAAAGAAAGGTGGAATATTTGCCAAAATTGTTAGCTTAGCAATCGATAGAAAAACTAACGAGGTAACTATCAGTGATACCTCACCAATCCCAAAAAATATTGAAGAAGAAACGGATAGCGATTTTCATGAGGTACTACCAAGAGAAATTTTAAACATGAACGATATGGATGTGTCTATAAAGGTTGAAAGTAAAAGGAAAACAATCGTGGATTTTAAAACTGAGGGAAATATAGATAAAGACCAATTTAATACTTATGATAATCTAAAAAAAACAGAGTCACTTCCAGAAAATGGAGTTGGCCTAGAAGAAAAAAATAACGATAAAAGAAAAGCAAGTCTCAGTGGTTCCGAAAGACATAAATCTAATTCTAGCAAACAATCTAAGCAAAATGATTCATTGGGGTCGGCTATACCAGTTATAACAATTAGCAATACAGAAAGCGATGACGAAGTATTGCAACCTCCAAATATAACTAATACCCAACATGAAATGAAAATACAGCACAGTAATAAAGGGGAAAAGGTGAAAGGTCACCTAAAAAGAAATAAAAAATGTTCCGATCTTAAATCCTTAAAACGTCAAAGTAGTGTCGATAGTATAAATGAAAACAAAAGTCCAAAGGAAAAGGATAATGAAAGCGCTGGACACAATCTTTTGCAAACAATATTAGGTTCATTGTTGTCACGACTAAAAAATCGTTATGAATATCTACCAGAAGAAATAAACGCTGACTTAACAACGGAAGCTTTATATGATTTCGAATTCATAAGTGTAAGCCCTGAAGAAAGAGTTGAACGTTTTTTACATACAAATGAAGGCTTGGAAAATGTACAGTATGATAAGGAAACTCAAACTAATGAGTTTGATACCACAAATTATATGAAAAATCCAGTAACCAATAATGACTTCCTATATAACACAAATTACATTTCAACTGATCCTTTCTTTATTTCGAGAGCGACAATCGCCTCTACTTTTGAATCAAGAGTAACCTCTATTAAAACTAGAAGGACTTTTCCAACTCATTTTCCTACATTACAAAAAAGCAAAAACAAGGGTAACCAGAAAGAAACCGTACAATAA

Protein sequence:

>DPOGS203957-PA
MDAKGSYDRPPGSPQPSVDRWSVTWADPGSMTSSGVNLFHLPADTPASNEVKFSPTAADGSEDEDYSSLSAVLKQRRASVRRSRKGRSRRPSSPFLPDENRSRRRSSVFTTSSGDTAISIDEQPVVTQEQIFENIHLHKEVLGSVKQQPLGMRRKLKIVHQAKGYVKRHEGQLQERLAQSKSTRDIYARFNILLASKWQQMKREAANTSNLLIPWELRIKEIESHFGSVVASYFTFLRWLFWVNLVIGFILLVFVIIPEYLTANPLQDGERKIIMDDEYRNATNLLTLWEFEGVLKYSPIFYGYYSNVERDQYRMPLAYFLTGLVVYIYSFVAILRKMAENSRMSKLSEKEDECIFSWKLFTGWDFMIGNAETAHNRIASVILGFKEALLEEAEKKKNLRNWRIISLRAVVNICVIILLAVSAYAVVTVVYRSDDNAKIRSWWRENETTIVVTVISITFPLLFELLGLLEHYHPRKQLRLQLARIMLLNLLNLYSLIFALFSKIEGMSKELISLQPTLNLNATIFSENTLNMSETLIGTLPITCFEIAVPFYDTVFFDKYHRKMYNLTMGTRKKLLKLCWETMFGQELVKLTMMDLVFVLLGTLFTDFFRALFVRYMNRCWCWDLEKKCPEYGDFKIAENILHLINNQGMVWMGMFFSPGLVVLNVVKLMIMMYLRSWAVMTCNVPHEVVFRVSKSNNFYLALLLTMLFLCVLPVGYTIVWVTPSWHCGPFSEYDKIYKILTNNIYKILPTSLNFTLEYIASPAIVIPLLVLLILIIYYLTSLTNSLREANNDLKIQLRRERTEERRKMFQLADTRRRGGSSSIDNTPFSRWKKALPSLPVSKSIDSDDRKTTSDITKEPAKKTMKKKGGIFAKIVSLAIDRKTNEVTISDTSPIPKNIEEETDSDFHEVLPREILNMNDMDVSIKVESKRKTIVDFKTEGNIDKDQFNTYDNLKKTESLPENGVGLEEKNNDKRKASLSGSERHKSNSSKQSKQNDSLGSAIPVITISNTESDDEVLQPPNITNTQHEMKIQHSNKGEKVKGHLKRNKKCSDLKSLKRQSSVDSINENKSPKEKDNESAGHNLLQTILGSLLSRLKNRYEYLPEEINADLTTEALYDFEFISVSPEERVERFLHTNEGLENVQYDKETQTNEFDTTNYMKNPVTNNDFLYNTNYISTDPFFISRATIASTFESRVTSIKTRRTFPTHFPTLQKSKNKGNQKETVQ-