Monarch geneset OGS2.0

DPOGS213614
TranscriptDPOGS213614-TA3396 bp
ProteinDPOGS213614-PA1131 aa
Genomic positionDPSCF300033 + 866797-873411
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0136900.075.79% 
BombyxBGIBMGA011676-TA0.063.17% 
DrosophilaCG12272-PA0.036.87% 
EBI UniRef50UniRef50_Q127680.041.59%WASH complex subunit strumpellin n=83 Tax=Opisthokonta RepID=STRUM_HUMAN
NCBI RefSeqXP_002431289.10.043.28%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420217160.043.28%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420217160.043.31%conserved hypothetical protein [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[1-1132] IPR0193930WASH complex, subunit strumpellin
Orthology groupMCL13349 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213614-TA
ATGAGGGTATTTTTGGCAGAAGATAATCTTTGCGCTCAAAATTTATTGAAGCTTGTTTCGCACGGGAATGCGATACTTGCCGAGATATTAAGACTTAAAGATCACGTCCCCAAAATATTTTTATTAGAAAGCAAGGAGATTCAGCAAAATTATCAAGATGTTATTATGGATTTTAGTTACTTTAAAATCTCAGAAATGCAAGAAAAGAAAATTAATGCCAATTCAAAACTCCAAGACCTGGATGATGACTTGAAAGAAAAGTATCTGGAATTAATCACTCGGTTTTATCTGCTATTTGAGAATATCTATCAGTATATAATAGATCTCAATACCTTTGTAGAACAACTGCATGATGGAGCATTTATACAGCAAAGTATGGAGACTGTTATGAAAGATGTGGAAGGGAAACAATTACTGTGTGAATCCCTGTACTTATATGGGGCAATGCTTTTGTTATGTGATTTATATATTCCGGGGAATGCAAGGGAAAAGCTCTTAGTAGCATTCTATCGCTACAGTACCAATCAGTCGCAGTCTAACGTTGATGATGTTTGTAAATTATTAAGAGAAACCGGGTACAATCAGCAGCATGGGAAGCGCCCCTACGATTATCCTGTGGAATACTTTAGTCGTGTGCCCATTCATCCAGATTTTTTAGAGAAGATAATTGGAAAATTAAGATCAGAGGATATTTACAACCAGCTCACAGTGTTCACGATACCAGAACACCAGTCGTCTGCACTCGCTACTCAGGCAAGTATGCTTGTTGTGTGTTTATTCTTCACACCCCACTATTTACACTCCGATACAACCAGAATGAGGGAGATTGTTGATAAATTTTTCCCCTGCAACTGGATCATACCCGTCTATATGGGTGTTACGATGAACATAATCGATTACTGGGATGGGTTTAAAGCAGCGAAAAATTCACTCAATAATACATGTAATACGAAAAACGTCAAAGAAATCTATTCGAAGAGAGGCAACTCGGTGCAGCTGCTGATAAACAAAAGTCAGCTGCTGTTGAAGGAAGGCACCTTGACTGATGACTTTGTTCTCGATAACATCAGTAAGATCATAAACGTTATATTGAATTCAAATCACGTGTTGCGTTGGTTGCTGTTACACAACAGTGACGTCATATTTTTCGATAACAACAAGAAGAGTAAACAACTCAAAGATCTGGTTATCAAGGAATCCGGGTACGATCCGGTGAAGACACTCGAGCTCCTCATATGTACAGCCGAACTAGAGCTGAAGATTCGTGAATTCTTAAAGAAACTGCTAGATTCCAGAAGCGAAACGTGGAATAGAAATAAAAATATAGCATTGGACGCCGTTAACAATTTATCAGAGTTATTTTCTGGTAGCATCCCGAAATTTACTAAAATTGATGAAAACGAACAATTGAAGCAGTGGTTTGAAAATATAGCGAAACAAATATCGTCTCTGAACGAACCCATTACATCGAAAACAATAAAAAAAGTCACCCAGCTGTTACAAGTGCTTGATAATGTTGAAGAGTTCCATGGAATCAAAAGTACTTCGACCGTGGTACAGTTAATATCTGAGAGCAAAGACGCCCTGAAGAATGTTCTGAGAGCTGCCTCATTGAAAGAAGACTCGCTCGTGACCTTGGAGACTGTAGCAGACTTCAGTTACGCATGGTGTACAATAGATCTGTACACGGTTCACATGCAGGATAGCATCAAAGAGAATCCCGCTGTCACGAGTCGTTTACGAGCTCTGTTCTTGAAATTGGCGAGTGCAATGGAAATACCACTTTTGCGTATTAATCAAGCGCAAAGCGACGACCTAGTCTCCGTCTCGCAGTATTACAGCAGCGAACTTATAAAATATATACAAAAAGTTCTTCAAATTATTCCCGAGATGGTATTTAAGATTGTCGAAAAAATTGTCGACCTCCAGACATGGAAGATCACGGAGGTACCAACCAGGATAGATAAAGAGAAGCTAAGAGACTACGCTCAACTAGATGATAGGATGGAGGTCGCGAAGTTGACGCATTCAGCTTCTATGTTCACTACCGGAATCCTAGACATGAGATCAACCCTGGTCGGCGTCATAAGAGTGGATCCGGCGGAATTACTGGAAGAGGGTCTATTGAGAGAACTAGACGGCCACATCAATAAGAAGTTTTTTGAATTCATTGAGCCGCAAGCGAAAAAAAATAACAGTTTGATGAATAGACTGCAGAAGCTTGCGGAGAGTATGGAGGGTTACAAGAGGTCTCTGGAGTACATCCAAGACTATATAAATATCCATGGATTGAGAATATTACAAAAGCAGTCTGTGATGGCATATGCCTATCTCTTAAGGATAACATACTTTATCCATTTTACAACTTTTAAAAATTGGCATTATTACATAAATACGGTACTAAGTCTTATCAAAATAAATTACTTCGTTTTCAAGCCCACATCAGTATCTGTAAATTACTTTTTCAATCATTTAAATCGGGGAATATGTGTCTATATAAACATATGCTCATCCTGGTTTGATGTAAAGTCTCACAATGAGGTTGTGAACACTAAAACATTTACTAAGTTAAAAGAGGCCATCGGTGTTGTTGGCCTCCATGGCTTGGACACCCTGTATGGCTACATGATAAAGAACCAGCTACAGCATGTTCAGAATATATTTAGAAGTCATCCGGAAAGAAATGTAGTTAGCACGGATATGAAAGATTTCGACCAGTTTGTGCTCAAGGGACAGAAGTCGTATCAGCAGTTAGCAGACACTGTCGTACAGATTGGTACCTTGCAACTACTGAGAAAACATATAGCGTATCAATTGAACACCACTTGTAAATTTGATTCTGCACAATTGGAAGCGTCCTTGAGAACTATGAACGAATCAGTTTTAAACGAAATCAAAACAGGAAGTAAATCAGGTTTTAAAACAGTTCCTTTAGCTTTGATGCAAAACTTAGAAGAGTATCTCAGCCGGTGTGGAATTTCAGAACCTTTTGATAAAATATATCTAAAGAATGCTAATGAGTTGGTAAATATTGATATGGGACGATTGATGGCTACTGTGTTGATATCGCAGTTAGCGAGGCTGCAGCTGTGTCAAACTACAGGTGACTTAATATCGCGGAGAACCGGCGACAATATCGACGGTTATCCTCTGTTGGTCGGGGCTTATACTTTATTGCGTCAATCCAAGACTGAGAGCATTGATGTGTTTGTTGATTTCTTATGTCAATACTCCAAGGTCGCTGCTGCGAACAGACATAAGGGTGGTGAAACTACAAATGACGGGGGACTTACCACACAAATACTAAGACTTTTCTGCGAGACATTCAATTACTCCTTCAATAAATTAGAAGATAAGCTACCACTGGCTTTACTATCACAATGCCCTCAAAAATGA

Protein sequence:

>DPOGS213614-PA
MRVFLAEDNLCAQNLLKLVSHGNAILAEILRLKDHVPKIFLLESKEIQQNYQDVIMDFSYFKISEMQEKKINANSKLQDLDDDLKEKYLELITRFYLLFENIYQYIIDLNTFVEQLHDGAFIQQSMETVMKDVEGKQLLCESLYLYGAMLLLCDLYIPGNAREKLLVAFYRYSTNQSQSNVDDVCKLLRETGYNQQHGKRPYDYPVEYFSRVPIHPDFLEKIIGKLRSEDIYNQLTVFTIPEHQSSALATQASMLVVCLFFTPHYLHSDTTRMREIVDKFFPCNWIIPVYMGVTMNIIDYWDGFKAAKNSLNNTCNTKNVKEIYSKRGNSVQLLINKSQLLLKEGTLTDDFVLDNISKIINVILNSNHVLRWLLLHNSDVIFFDNNKKSKQLKDLVIKESGYDPVKTLELLICTAELELKIREFLKKLLDSRSETWNRNKNIALDAVNNLSELFSGSIPKFTKIDENEQLKQWFENIAKQISSLNEPITSKTIKKVTQLLQVLDNVEEFHGIKSTSTVVQLISESKDALKNVLRAASLKEDSLVTLETVADFSYAWCTIDLYTVHMQDSIKENPAVTSRLRALFLKLASAMEIPLLRINQAQSDDLVSVSQYYSSELIKYIQKVLQIIPEMVFKIVEKIVDLQTWKITEVPTRIDKEKLRDYAQLDDRMEVAKLTHSASMFTTGILDMRSTLVGVIRVDPAELLEEGLLRELDGHINKKFFEFIEPQAKKNNSLMNRLQKLAESMEGYKRSLEYIQDYINIHGLRILQKQSVMAYAYLLRITYFIHFTTFKNWHYYINTVLSLIKINYFVFKPTSVSVNYFFNHLNRGICVYINICSSWFDVKSHNEVVNTKTFTKLKEAIGVVGLHGLDTLYGYMIKNQLQHVQNIFRSHPERNVVSTDMKDFDQFVLKGQKSYQQLADTVVQIGTLQLLRKHIAYQLNTTCKFDSAQLEASLRTMNESVLNEIKTGSKSGFKTVPLALMQNLEEYLSRCGISEPFDKIYLKNANELVNIDMGRLMATVLISQLARLQLCQTTGDLISRRTGDNIDGYPLLVGAYTLLRQSKTESIDVFVDFLCQYSKVAAANRHKGGETTNDGGLTTQILRLFCETFNYSFNKLEDKLPLALLSQCPQK-