Monarch geneset OGS2.0

DPOGS211085
TranscriptDPOGS211085-TA1770 bp
ProteinDPOGS211085-PA589 aa
Genomic positionDPSCF300007 - 1204514-1207032
RNAseq coverage102x (Rank: top 61%)
Annotation
HeliconiusHMEL0124910.086.59% 
BombyxBGIBMGA002965-TA0.091.17% 
DrosophilaCG7371-PA0.065.59% 
EBI UniRef50UniRef50_B0WGM00.065.08%Vacuolar protein sorting n=5 Tax=Culicidae RepID=B0WGM0_CULQU
NCBI RefSeqXP_973597.10.068.42%PREDICTED: similar to CG7371 CG7371-PA [Tribolium castaneum]
NCBI nr blastpgi|910832250.068.42%PREDICTED: similar to CG7371 CG7371-PA [Tribolium castaneum]
NCBI nr blastxgi|910832250.068.42%PREDICTED: similar to CG7371 CG7371-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[1-590] IPR0072580Vps52/Sac2
Orthology groupMCL10453 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211085-TA
ATGGAAAGCATGCTGTTGGTTTTTCAGAATGATTTGGGAAGCATAAGTAATGAAATTATTAGCTTGCAAAAACGTTCTGTTAATATGTCCGTTCAACTTTCCAACCGTCAAGCACTTAAAGGTCCACTGTCATCATTCATAGAGGATATTGTAGTCTCCGAAACTCTTATTTTTGGTATCAACAACGTTCCTGTAGTTGACAAAGAATTCATGATACAACTGGCTATACTGAACCAAAAACTTAACTTTGTAAAAGAACAAGAATTCAAAGAGACGAAAGCATGTCATGATGTTAAGGACATCTTGGAAAAGTTGAAAATCAAAGCTGTGGCTAAGATAAGGACATATATATTGGAACAGATATATAAATTTCGGAAACCGATGGCCAACTATCAAATCCCACAGAATGCAATGTTAAAATATAAATTCTTTTTTGAATTCATATTATCCAATGAGAGAAATGTCGCACAGGAAATTTGCAATGAATACATTGATACATTGAGCAAAGTCTACTACTCATACTTCAAGTCTTATGCTTCTAGATTAGATAAATTGAAATATGAAGAAGTTCCCACAAAAGATGATCTAATGGGCATAGAGGATGGCTCGAAAGGGGGTTTCTTTCAAAAATCCAATCTTAAAAATAAAAGCACTATATTCACTATTGGTAATAGAGGCGATGTGCTCGCTCAACAATTAGAGGCTCCTATAATTGTTCCTCATGTGCAACAAAAGACAAAGTATTCCTATGAAGCACTCTTCAGAAGCTTACAATATGCGTTAGTGGACAACAGTTGCAGGGAATACTTGTTTACAACGGAATTTTTCCATGTAAAAGGCAGTCATGCTCAGGAATTGTTCGACAGGATACTTGGCAGAACACTGTCTTTACTTGTGAAAAATGTTGAGAACTATGTGTTGGAGTGTTATGATTGTCTCGCGTTGTTTCTATGCATACAACTTATAAATAGATATCGATGGATGTGTCACAAGAGAGCTGTAGCCGCATTGGACAGTTACTGGGATTCCTTATTGGGGACACTTACACCCAGATTGGAATACATTCTTAAACTGAACATTCAAAGTGTCAGAGATTGTGATCCAGCCAAGTTATCAAATAAAGAGATGGGACCTCATTATATAACAAGGAGATACGCTGAATTTTCTGCGGCAATGCTCAGTTTGAGCGAGCAGTTTCCCAATGAAGAGCAAAGTAACCTTCTACTTGCAATGCAAGACGAAGTACATTGTTTCTTGTTAAAGATGGCGGCTGAATTCCCTCAGAGAATACAGCAATTGATATTTTTGATAAACAATTATGATATGGTCTTGAATATTTTAATGGAAAGAACCAGAGACAATACAAAGGAGGCGGAAAGTTTTAAGGAGCAATTACAAGCTAGAAGCTCAGAGTATGTCGAAGAAATACTCAGCCCACATTTTGGAGGTCTCATGCAGTTTGTTAAGGAAGGGGAACAATTACTTGAAAGTGATAAAAAGAATGAACTTGCGAATTTGGAAAAGAAATCCTTGTCGCTGGTCACATCTTTTACAACGAGTTGGAAGCAGAGTCTTGAAGAAATACACAGAGAAGTGCTGGTGTCATTCCCGAATCTAGTTACCGGCTCGGGTTTATTACAAATGGCTCTAACAAATTTTGTTCAGTATTATCATAAGTTTGTTAAACTCCTAACCCCTAATGCACGCACCCAACTTGTAAATATTCATGTTATAATGGTTGAAATCAAGAAATATAAAACAAATTATTGA

Protein sequence:

>DPOGS211085-PA
MESMLLVFQNDLGSISNEIISLQKRSVNMSVQLSNRQALKGPLSSFIEDIVVSETLIFGINNVPVVDKEFMIQLAILNQKLNFVKEQEFKETKACHDVKDILEKLKIKAVAKIRTYILEQIYKFRKPMANYQIPQNAMLKYKFFFEFILSNERNVAQEICNEYIDTLSKVYYSYFKSYASRLDKLKYEEVPTKDDLMGIEDGSKGGFFQKSNLKNKSTIFTIGNRGDVLAQQLEAPIIVPHVQQKTKYSYEALFRSLQYALVDNSCREYLFTTEFFHVKGSHAQELFDRILGRTLSLLVKNVENYVLECYDCLALFLCIQLINRYRWMCHKRAVAALDSYWDSLLGTLTPRLEYILKLNIQSVRDCDPAKLSNKEMGPHYITRRYAEFSAAMLSLSEQFPNEEQSNLLLAMQDEVHCFLLKMAAEFPQRIQQLIFLINNYDMVLNILMERTRDNTKEAESFKEQLQARSSEYVEEILSPHFGGLMQFVKEGEQLLESDKKNELANLEKKSLSLVTSFTTSWKQSLEEIHREVLVSFPNLVTGSGLLQMALTNFVQYYHKFVKLLTPNARTQLVNIHVIMVEIKKYKTNY-