Monarch geneset OGS2.0

DPOGS212057
TranscriptDPOGS212057-TA1395 bp
ProteinDPOGS212057-PA464 aa
Genomic positionDPSCF300960 + 295-5089
RNAseq coverage381x (Rank: top 31%)
Annotation
HeliconiusHMEL0054122e-16784.37% 
BombyxBGIBMGA010381-TA0.084.04% 
DrosophilaSlh-PD6e-15657.26% 
EBI UniRef50UniRef50_Q241798e-15457.26%Protein sly1 homolog n=26 Tax=Coelomata RepID=SLY1_DROME
NCBI RefSeqXP_975318.11e-17064.86%PREDICTED: similar to vesicle protein sorting-associated [Tribolium castaneum]
NCBI nr blastpgi|910807093e-16964.86%PREDICTED: similar to vesicle protein sorting-associated [Tribolium castaneum]
NCBI nr blastxgi|910807092e-16064.86%PREDICTED: similar to vesicle protein sorting-associated [Tribolium castaneum]
Group
Gene OntologyGO:00069042.7e-163vesicle docking involved in exocytosis
GO:00161922.7e-163vesicle-mediated transport
KEGG pathwaydme:Dmel_CG82281e-13 
 K12479 (VPS45)maps-> Endocytosis
InterPro domain[10-437] IPR0016192.7e-163Sec1-like protein
Orthology groupMCL15146 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212057-TA
ATGAAGCATCAGAAATCTGATTCATTGTCATACTATGCAATTAATAAAGGAGATACTAAAGATACAGAAATGGAAGCAATCATGGATGAAATTGTAGAAAGTCTTTTTTCAGTTTTTGTCACACTCGGTAATGTTCCAATAATTAGATGTACAAAAGGCAATGCAGCGGAAATGGTGGCTAAGAAATTGGACAAAAAATTGCGGGAAAATCTCTGGGATGCTAGAAATAATTTATTCCATGGTCAGGCTGGAACTTTTAGCTACACGAGACCAATGTTGATATTGTTGGACAGAAATATTGATATGGCAACACCATTACATCACACATGGACCTATCAGGCTCTGGCTCATGATGTGTTGGATTTATCATTGAACAGAGCAGTCGTACCAGAAAACTCTGGGCCAGCAATGCCAGGGCAGAAAGTTAAAACACGAACATGCGACTTGGATTCTAAAGATCCTTTATGGTCTGAACACAAAGGGAGTCCGTTCCCCACAGTAGCAGAAGCTATTCAAGAAGATCTAGACAAATATAGGAGTTCAGAGGCGGAGGTCATGAAATTGAAGAACTCTATGGGACTGGACGCTGACAGCGACCTCGCCTTGAGTATGGTGAGCGACAACACTCAGCGACTCACCAGCGCTGTGAACTCATTGCCGCAGCTCATGGAGAAGAAGAGACTGATTGATATGCACACCACCATCGCTACAGCAATCCTAAACGCAATAAAATCAAGGAGGCTAGATTCGTTTTTCGAATTGGAAGAAAAGATAATGAGTAAGAGCAGCGGTGTTGAAAGTAAAGCTGTGATGGATCTGATAACTGACACGAGCGCTGGCACAGAGGAAGATAAGATGCGTCTATTTATTATTTATTATCTATGTACATCACAAGTACCAGAGGAGGAATATAAGAAGTTTGAAACTGCCTTACTCGCAGCAAATTGCGACATCAAGCCAATGACATACATGAAGAGATGGAAGGGTTTTACGAAAATGTCAAGCCAGGGTCAATACGAAGGCGGCGGAACAAAAACCGTATCAATGTTTTCAAAATTAGTATCACAAGGATCGTCGTTCGTTATGGAGGGAGTTAAGAATTTGGTTGTTAAGAAACATAAGCTGCCTGTCAGTCGCTCAGTGGAATCGGCGCTGACCAGTTCAGAGGGCGAGTTGTCGTGGTTGGATCCAAGAGCGGCTCGGGCTGACGCGACCAGACGCGCCAAGCACGCGCCGCCGACTGACGCCGTCGTCTTCGTGGTGGGCGGCGGCAGTTACATAGAATACCATAACCTGATAGACTTCGCAAAGGTATTTTGTATACATACAATTATATCTTACGATCCTAACATTGAAATATTAAAGATTTCATCTCTAATTGTTTTGTTTATATAA

Protein sequence:

>DPOGS212057-PA
MKHQKSDSLSYYAINKGDTKDTEMEAIMDEIVESLFSVFVTLGNVPIIRCTKGNAAEMVAKKLDKKLRENLWDARNNLFHGQAGTFSYTRPMLILLDRNIDMATPLHHTWTYQALAHDVLDLSLNRAVVPENSGPAMPGQKVKTRTCDLDSKDPLWSEHKGSPFPTVAEAIQEDLDKYRSSEAEVMKLKNSMGLDADSDLALSMVSDNTQRLTSAVNSLPQLMEKKRLIDMHTTIATAILNAIKSRRLDSFFELEEKIMSKSSGVESKAVMDLITDTSAGTEEDKMRLFIIYYLCTSQVPEEEYKKFETALLAANCDIKPMTYMKRWKGFTKMSSQGQYEGGGTKTVSMFSKLVSQGSSFVMEGVKNLVVKKHKLPVSRSVESALTSSEGELSWLDPRAARADATRRAKHAPPTDAVVFVVGGGSYIEYHNLIDFAKVFCIHTIISYDPNIEILKISSLIVLFI-