Monarch geneset OGS2.0

DPOGS200350
TranscriptDPOGS200350-TA1197 bp
ProteinDPOGS200350-PA398 aa
Genomic positionDPSCF300026 + 557372-568780
RNAseq coverage925x (Rank: top 14%)
Annotation
HeliconiusHMEL0000362e-11472.12% 
BombyxBGIBMGA005643-TA6e-9661.40% 
Drosophilawash-PA1e-1927.08% 
EBI UniRef50UniRef50_D0AB845e-11272.12%Putative WAS protein family homologue 1 n=2 Tax=Nymphalidae RepID=D0AB84_9NEOP
NCBI RefSeqXP_968173.11e-4632.29%PREDICTED: similar to open reading frame 19 [Tribolium castaneum]
NCBI nr blastpgi|2613359462e-11172.12%putative WAS protein family homologue 1 [Heliconius melpomene]
NCBI nr blastxgi|2613359461e-13558.72%putative WAS protein family homologue 1 [Heliconius melpomene]
Group
KEGG pathway 
InterPro domain[192-260] IPR0218541.4e-17WASH complex, subunit WASH
Orthology groupMCL10947 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200350-TA
ATGGAGGGTCTTTATAAAATTAATTTGATACCCAACGACCTTAGCGTCGAAGAAACCGTGTTACAAATAGCCGATACATTAGACAATCTGAATGGGATAGTTGACGATGTTTTTAAACGTATATCAAATAAAATTAAAATCAACGTTGAAAAGACGTCGAAACTTCAGGAGAGAATCAATGTATCCAGGACAAAAGTTGAAAAACTTGCCGGGACACAGAAGGCGATCAAAGTATTTTCAAGCGCTAAATACCCGTCGTCTATAACACACGAACATTACAAATCCATTTTCGAATCAAACGATTATAATTATGAACCCAAAAACGTTATACCAACCGGAAAATCCAACAGACAGACAAACGAAAAAGCCATCCAGGAGAAACTTCATTTCTTCCACGTGAAAGTCGCTGAACCTAAAAATAATAAAACCAGGAACGATTTCGATCTGAATACGGTTTTGAATTCAATAACATCAATCGGAGATCTCCTTATATACAAGAGCGACGAGAGCCCGTACTTTGGTAGTAAAACCAAAGGGCAGACTTACGTGCCCAAAGTAAACACGACCGTAGATAAGGGCTCGTTGGACGAAGCACCACCCTCTATTGTGAAAAAGAATCTGTTGAAGCGAGAAATCGACGAGTACATGTACGCTCCAGGAATGGGCTTGGTGCCAGAATTGGACATGCCCCTGGATCTTCCACATCTTCCCGGTATAGCCGGGGACGTTCAGTATTCGGTTACTGGGGATGGGTCTATAGCGCCATCAGCTGTAACATCACCCGTCGCCATCACAAACCCCATCCCCCGCCCCCGCCGCCACCACCCCCGCCGCCGATGGAGATCACACAAACGCCAAATAGCTAATTTTGGGTTCCTTGTCCACAGACGAGAGCAGCGTGAAGCTAGTGCTGCTGCCCCGCCTCCGCCCGTGGATGCCCACGCGAACCTGATGGCGGCCATCCGGCAGGCGGGCGGCGTCGGACGGGCGAAGCTGCGGCACGCTGACGACGTAACAACAGAAAAGGCGAGCAAACCTGTCGGTGGCGATCTGATGGCTGATCTTCACGCCAAGCTGTCGATGCGTCGTCGTGGCATATCGGGTGCTGAGGGTACCGTGCTTCATACGCTGGCGAGGGTTATACCGGAACCGGGGGAAACTACCGAGAGGTCCTCCAGCGACGACGAATGGGATTAA

Protein sequence:

>DPOGS200350-PA
MEGLYKINLIPNDLSVEETVLQIADTLDNLNGIVDDVFKRISNKIKINVEKTSKLQERINVSRTKVEKLAGTQKAIKVFSSAKYPSSITHEHYKSIFESNDYNYEPKNVIPTGKSNRQTNEKAIQEKLHFFHVKVAEPKNNKTRNDFDLNTVLNSITSIGDLLIYKSDESPYFGSKTKGQTYVPKVNTTVDKGSLDEAPPSIVKKNLLKREIDEYMYAPGMGLVPELDMPLDLPHLPGIAGDVQYSVTGDGSIAPSAVTSPVAITNPIPRPRRHHPRRRWRSHKRQIANFGFLVHRREQREASAAAPPPPVDAHANLMAAIRQAGGVGRAKLRHADDVTTEKASKPVGGDLMADLHAKLSMRRRGISGAEGTVLHTLARVIPEPGETTERSSSDDEWD-