Monarch geneset OGS2.0

DPOGS206843
TranscriptDPOGS206843-TA1662 bp
ProteinDPOGS206843-PA553 aa
Genomic positionDPSCF300001 - 3083452-3087410
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0132850.074.73% 
BombyxBGIBMGA012797-TA4e-10883.48% 
DrosophilaVps45-PA0.056.58% 
EBI UniRef50UniRef50_Q9NRW70.059.37%Vacuolar protein sorting-associated protein 45 n=76 Tax=Metazoa RepID=VPS45_HUMAN
NCBI RefSeqXP_970273.10.059.50%PREDICTED: similar to Vacuolar protein sorting-associated protein 45 (mVps45) [Tribolium castaneum]
NCBI nr blastpgi|3407258800.062.13%PREDICTED: vacuolar protein sorting-associated protein 45-like [Bombus terrestris]
NCBI nr blastxgi|3407258800.062.13%PREDICTED: vacuolar protein sorting-associated protein 45-like [Bombus terrestris]
Group
Gene OntologyGO:00069041.5e-262vesicle docking involved in exocytosis
GO:00161921.5e-262vesicle-mediated transport
KEGG pathwaytca:6588230.0 
 K12479 (VPS45)maps-> Endocytosis
InterPro domain[1-550] IPR0016191.5e-262Sec1-like protein
Orthology groupMCL11559 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206843-TA
ATGACGGAAGAGAGTGGACCTGGAATGAAAGTTATACTTATGGATAAAGAAACAACCAGCATAGTCAGTATGGTATTCAGCCAATCTGAAATACTCCAAAAGGAGGTTTACCTTTTTGAAAGGATTGACAGTCACTCGAAATGGGATGACTTAAAACACATGAAATGTATTGTATTCCTGAGACCTACATCCGAAAATATTGCATTACTTTCAAGGGAGTTGAAAAGCCCCAAATATGGTGCTTACTTTATATATTTTAGCAATGTTGTTTCCAAAGCTGACATCAAGACGTTAGCTGAGTGTGATGAGCAGGAAACAGTGCGTGAGGTTCAGGAGGTGTTTGCTGACTACCTGGCTGTGGACCGACATCTGTTTTCTTTCAACATTGTTAGTTGCTTGCATGGTCGTTCATGGAAACAGCATCACCTTCAGCGGTGTTCACAGGGTCTGTTGGCTCTACTGCTGTCCCTCAAGAGACGTCCAATCATCAGATATGAAGCTAGCTCTGAAGCCTGTGCACGTCTCGCAGAACGGGTCAAAGAACTGATAAGAAGAGAGGCAGTTTTGATGGACAATAACATACCATTCAATGGTGATATACCACCGCCGCAATTGCTGGTGCTGGACAGAAGAGATGATCCCGTTACACCATTGTTACATCAGTGGACATACCAGGCTATGGTCCACGAGTTGCTAACCATCGATAACAATCGCGTCAGTCTCGCCGGGGTTCAGGACGCACCCAAGGACTTGAAGGAGGTTGTACTATCATCAGAACAAGATGAGTTCTACGCTAAGAATTTATATTCTAATTTCGGTGAAATCGGTCAAACGATGAAATCTCTGATGGATGAGTTCCAAAAGAAGGCGAAGAACCACCAGAAGGTGGAGAGTATTGCTGACATGAAGAACTTCGTGGAAACTTATCCTCTGTTTAAGAAAATGTGCGGTACTGTTACGAAGCACGTGACAGTGGTAGGTCAGCTGTCATCAGTGGTGGGGTCACGTCGCCTGCTGCAGGTCTCCGAGCTGGAACAGGAGCTGGCGTGTCACGCGGACCACACCAGACATCTGCAGCGGCTCAAAGCCATGTTATCGGACGAGGCCATCGCCGGGACGGAGCTGGTGAAGCTCGTGTGTCTGTACGCGCTGCGCTATGAGAAACACGCGGCCAACGCGTTGCCGGCTCTCATAGACAGTCTCAAGGGTCGGGGGGCGGAGCACCGGGCGCCCGCGCTACTACTGGAGTACGGAGGGGCGCACGCTCGCCAGAGCGACCTGTTCGGGCTGCAGGATGCCGCCAAGATTACTAAGAGACTTTTCAAGGGTCTCAGTGGCGTGGAAAACATCTACACTCAGCACACGCCGCTGCTCAAGGACACTCTGGAAGACCTCATCAAAGGGAAACTGAGAGAGAACTTGTATCCTGCGGTAGGAGGAGAGCTTCTCAACAGGCGACCCCAAGACATCATAGTCTTCATAGTAGGAGGAACCACATACGAGGAGGCGCTGTGTGTCCATCAAATAAACCAGTCCTACCCTGGAGTGAACGTGGTGCTCGGTGGTACCACCATACACAACTCCACCACATTCCTTAATGAGGTGAAGGAGGCTATGCACGGACAACACAGGACACACACGAGGCATATTAGAAATTTATAA

Protein sequence:

>DPOGS206843-PA
MTEESGPGMKVILMDKETTSIVSMVFSQSEILQKEVYLFERIDSHSKWDDLKHMKCIVFLRPTSENIALLSRELKSPKYGAYFIYFSNVVSKADIKTLAECDEQETVREVQEVFADYLAVDRHLFSFNIVSCLHGRSWKQHHLQRCSQGLLALLLSLKRRPIIRYEASSEACARLAERVKELIRREAVLMDNNIPFNGDIPPPQLLVLDRRDDPVTPLLHQWTYQAMVHELLTIDNNRVSLAGVQDAPKDLKEVVLSSEQDEFYAKNLYSNFGEIGQTMKSLMDEFQKKAKNHQKVESIADMKNFVETYPLFKKMCGTVTKHVTVVGQLSSVVGSRRLLQVSELEQELACHADHTRHLQRLKAMLSDEAIAGTELVKLVCLYALRYEKHAANALPALIDSLKGRGAEHRAPALLLEYGGAHARQSDLFGLQDAAKITKRLFKGLSGVENIYTQHTPLLKDTLEDLIKGKLRENLYPAVGGELLNRRPQDIIVFIVGGTTYEEALCVHQINQSYPGVNVVLGGTTIHNSTTFLNEVKEAMHGQHRTHTRHIRNL-