Monarch geneset OGS2.0

DPOGS210180
TranscriptDPOGS210180-TA3051 bp
ProteinDPOGS210180-PA1016 aa
Genomic positionDPSCF300393 + 96406-118764
RNAseq coverage246x (Rank: top 42%)
Annotation
HeliconiusHMEL0127520.084.59% 
BombyxBGIBMGA014194-TA0.079.38% 
DrosophilaVps35-PB0.058.08% 
EBI UniRef50UniRef50_E2BGH60.064.60%Vacuolar protein sorting-associated protein 35 n=7 Tax=Formicidae RepID=E2BGH6_HARSA
NCBI RefSeqXP_392327.20.064.65%PREDICTED: similar to vacuolar protein sorting 35 isoform 1 [Apis mellifera]
NCBI nr blastpgi|3072074580.064.60%Vacuolar protein sorting-associated protein 35 [Harpegnathos saltator]
NCBI nr blastxgi|3072074580.064.27%Vacuolar protein sorting-associated protein 35 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[106-1012] IPR0053780Vacuolar protein sorting-associated protein 35
Orthology groupMCL14243 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210180-TA
ATGGCCTCTTCGGGTTTGCCGACGTGTCCCGGTAGCTCAGCGACGAGGAGACGCAACGAGAACGGTACCAACGAGCCGCCCTCAAGTAGCCTGTCCGTAGAAGCTCCCGGAGCCCTGCCTCGTCCCGACTCACATGCTCCGCTCGGACCACACCACGGCTACCAGCTGATGACACACCAAGATACTTCACAGCTATCTCTTGAAGTGCATCAGCCTGGTTCGGGTCCAATTGTATCAAAGAATGTTGATGGATCTGGTAAGCTGGTAGCCGGGACTGCAGTTGCCGAACTCACAGTGGGTATTACATTAGCACAAATGACAAACCAGGCGTCGCCCGTAGAGGAGCAGGAGAAGCTTTTGGAGGAAGCGTTAAGCAATGTAAAATTTCAGGCATTTCAGATGAAAAGATGCCTGGACAAATCAAAACTTATGGATGCATTGAAACACGCTTCTACAATGCTGGGAGAGTTGAGGACATCACTTTTGTCACCAAAGAGTTATTATGAATTATACATGGCAATTACTGATGAGCTCCGCCACTTGGAACTGTATCTTCTAGAAGAATTCCAGAAGGGTCGTAAAGTTGCAGATTTGTATGAACTAGTACAGTATGCTGGGAATATTGTACCTCGTTTGTACCTATTAATAACAGTCGGATTAGTTTACATCAAAACTAATACAAATTTAAGGAGGGATTTGCTCAAGGATCTGGTTGAAATGTGCCGTGGCGTGCAGCATCCGTTACGCGGGCTGTTTCTCAGGAATTATCTCCTCCAGTGTACAAGGAATGTTCTGCCCGACACCGTGGAAGCGGAGAATGAAAATGAGGGGAATGTCAGAGACGCTATTGACTTTGTGCTGATGAATTTCGCCGAAATGAATAAGCTGTGGGTCAGGATGCAACACCAAGGACATTCGAGGGACAAAGAGCGTCGTGAGCGTGAAAGATCAGAACTTCGCATCCTGGTTGGCACCAATCTTGTCCGAGTGTCACAGCTGGAGTCTGTCAGTGAGGCGGATTATCGGAGGCTGGTGCTGCCAGCTATATTGGAACAGGTCGTCTTCCCCGATGAGTTTCACTTGGCAAACCTTCAGCCGTTCCTGAAGTCATGCGCTGAACTGCAGCCCGGTGTTAACATAAAGAACATCATTATAGCGCTCATTGAACGACTTGCCGCCTACAGTCAGCTCCTAACAACAAAACCAGACATGGCAATTACTGATGAGCTCCGCCACTTGGAACTGTATCTTCTAGAAGAATTCCAGAAGGGTCGTAAAGTTGCAGATTTGTATGAACTAGTACAGTATGCTGGGAATATTGTACCTCGTTTGTACCTATTAATAACAGTCGGATTAGTTTACATCAAAACTAATACAAATTTAAGGAGGGATTTGCTCAAGGATCTGGTTGAAATGTGCCGTGGCGTGCAGCATCCGTTACGCGGGCTGTTTCTCAGGAATTATCTCCTCCAGTGTACAAGGAATGTTCTGCCCGACACCGTGGAAGCGGAGAATGAAAATGAGGGGAATGTCAGAGACGCTATCGACTTTGTGCTGATGAATTTCGCCGAAATGAATAAGCTGTGGGTCAGGATGCAACACCAAGGACATTCGAGGGACAAAGAGCGTCGTGAGCGTGAAAGATCAGAACTTCGCATCCTGGTTGGCACCAATCTTGTCCGAGTGTCACAGCTGGAGTCTGTCAGTGAGGCGGATTATCGGAGGCTGGTGCTGCCAGCTATATTGGAACAGGTCGTGAGCTGCAGGGATCCCATAGCACAGGAATATCTCATGGAGTGTATCATACAGGTCTTCCCCGATGAGTTTCACTTGGCAAACCTTCAGCCGTTCCTGAAGTCATGCGCTGAACTGCAGCCCGGTGTTAACATAAAGAACATCATTATAGCGCTCATTGAACGACTTGCCGCCTACAGTCAGAGGAACGAGGGGAATGTGAATCTGAGTGTTGTCCTTGATGATGGACAGGAACAAGAGGTGCAATTGTTCGAGGTGTTCTCTGATCAGGTCGCTGCCATCACTCAGAGTCGCACAGACATGCCGCCGGAGGACATGCTCTCTCTGCAGCTGGCGCTGTTGAAACTAGCACAGAAATGTCACCCTGACAAGCTGTCTTATGTGGACAGGGTGTTAGCTCACACCGACAGGATATGTGTAGACATACTACCATCAGGAAAACCATACTTGGAGCACAATACACCCGTGTTCAAAGAGCTCATGAAGATACTGAAGCTGCCAGCTGATCATTACAAGAACATACTCACATTGATCAAGCTCCAGAACTACGCTCCACTCATCAACAGGCTGAGCCAGCCCGGCAGGATGCTGATAGCTGTTCATCTTATCAACGACGTCCTCGAGAGCAATACAACTGTCTCCACACCAGAAGATTGGGCATTGAACGATGCGTCCCGAGCTCTTGACTGTCTGAAGAAAGCGGCCCGCGTCGCCCAGCAGTGTATGGACGGAGGTGTGCAGGCCCAGCTGTTGGCTGAGCTGCTGGGTCGGTACGCGCTTCTCAGGGAGAGGGGACACGCCAGCCTCACCGCGCCTCTCATACAAGCGGTAGGACTGATCCACCACTTCAAGTCGGACTCGGCCGACCAGCAGTACCTCATCTTGAGCACCGCCCGCCGCCTGCTGCAGGGCGGGGGCGCCGCCCGCATACAGCACACTTTCCCGCCGATAGTGTTCCACGCCTACTCGCTGGCATTCACCTACCACCAGCTCAAGGACCAGGATGAGATGTGGGAGAAGAAATGCCAGAAGATATTCCAGTTCTGTCATCAGACGATCAGTTTGCTGGTGAAGGCCGAGCTCGCTGAACTCCCACTAAGATTGTATCTCCAAGGAGCTCTCGCTATAAGCGAGATAGGTTTCGCTAACCACGAGACCATAGCCTACGAGTACTTATCACAGGCGTTCTCGTTGTATGAAGACGAAATATCGGACAGCAAAGCCCAACTGGCGGCCATCACGCTAATAATAGCAACGTTCGAACAAATCAATTGCTTCGGACAGATTTTAATTTGA

Protein sequence:

>DPOGS210180-PA
MASSGLPTCPGSSATRRRNENGTNEPPSSSLSVEAPGALPRPDSHAPLGPHHGYQLMTHQDTSQLSLEVHQPGSGPIVSKNVDGSGKLVAGTAVAELTVGITLAQMTNQASPVEEQEKLLEEALSNVKFQAFQMKRCLDKSKLMDALKHASTMLGELRTSLLSPKSYYELYMAITDELRHLELYLLEEFQKGRKVADLYELVQYAGNIVPRLYLLITVGLVYIKTNTNLRRDLLKDLVEMCRGVQHPLRGLFLRNYLLQCTRNVLPDTVEAENENEGNVRDAIDFVLMNFAEMNKLWVRMQHQGHSRDKERRERERSELRILVGTNLVRVSQLESVSEADYRRLVLPAILEQVVFPDEFHLANLQPFLKSCAELQPGVNIKNIIIALIERLAAYSQLLTTKPDMAITDELRHLELYLLEEFQKGRKVADLYELVQYAGNIVPRLYLLITVGLVYIKTNTNLRRDLLKDLVEMCRGVQHPLRGLFLRNYLLQCTRNVLPDTVEAENENEGNVRDAIDFVLMNFAEMNKLWVRMQHQGHSRDKERRERERSELRILVGTNLVRVSQLESVSEADYRRLVLPAILEQVVSCRDPIAQEYLMECIIQVFPDEFHLANLQPFLKSCAELQPGVNIKNIIIALIERLAAYSQRNEGNVNLSVVLDDGQEQEVQLFEVFSDQVAAITQSRTDMPPEDMLSLQLALLKLAQKCHPDKLSYVDRVLAHTDRICVDILPSGKPYLEHNTPVFKELMKILKLPADHYKNILTLIKLQNYAPLINRLSQPGRMLIAVHLINDVLESNTTVSTPEDWALNDASRALDCLKKAARVAQQCMDGGVQAQLLAELLGRYALLRERGHASLTAPLIQAVGLIHHFKSDSADQQYLILSTARRLLQGGGAARIQHTFPPIVFHAYSLAFTYHQLKDQDEMWEKKCQKIFQFCHQTISLLVKAELAELPLRLYLQGALAISEIGFANHETIAYEYLSQAFSLYEDEISDSKAQLAAITLIIATFEQINCFGQILI-