Monarch geneset OGS2.0

DPOGS206911
TranscriptDPOGS206911-TA1755 bp
ProteinDPOGS206911-PA584 aa
Genomic positionDPSCF300001 - 1532846-1537177
RNAseq coverage323x (Rank: top 35%)
Annotation
HeliconiusHMEL0160910.087.31% 
BombyxBGIBMGA008046-TA0.079.11% 
Drosophilacar-PA3e-13443.55% 
EBI UniRef50UniRef50_E2ANA60.055.54%Vacuolar protein sorting-associated protein 33A n=9 Tax=Endopterygota RepID=E2ANA6_CAMFO
NCBI RefSeqXP_001603366.10.057.82%PREDICTED: similar to ENSANGP00000014711 [Nasonia vitripennis]
NCBI nr blastpgi|1565543390.057.82%PREDICTED: vacuolar protein sorting-associated protein 33A-like [Nasonia vitripennis]
NCBI nr blastxgi|1565543391e-18057.58%PREDICTED: vacuolar protein sorting-associated protein 33A-like [Nasonia vitripennis]
Group
Gene OntologyGO:00069044.7e-201vesicle docking involved in exocytosis
GO:00161924.7e-201vesicle-mediated transport
KEGG pathway 
InterPro domain[1-583] IPR0016194.7e-201Sec1-like protein
Orthology groupMCL12068 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206911-TA
ATGAGAAAGGAACTGTTAAATCTTTTGCATATTTGCGAAGGATCTAAACTGATAATATGGGATGAATGGCTCGCTAACCCTGTAGGTCTTGTGGCCCAATACTCTTTGTTGATGGAACATGAAGTGGCTGATATATTTCCCCTAAAACCTGGAACTTTACCTACAATTTCTGTGAAACACATTATATTTATAGCACGGCCTAAGCTGGTATTAATGGATTTAGTTGCTGATTATATACTTTCCCTCAGAAACAAACAAAGTGCTACAGTAGAGTTCCACCTATTCTTTGTACCAAGGAGGAGCGAGCTGTGTGAAAAACATCTCAAAAATCGAGGTGTTCTGGGAACACTTTCTATTGAAGAGTTTAGATGTGATATATTTCCCTTTGATAGTGATGTAATGTCACTGGAACTACAAAATGATTTTAGAGAAAATTACTTGGAAGGTGATCCGACTTGTCTGTATAATGCAGCTCAAGCTATACGCACCATCCAGCAATTATACGGAATCATACCAAGAGTTTATGGAAAAGGAGAAGCTGCTAAACAAGTATGGGATCTTCTCTGTAGATTGAATAAGGAAGAACAGGGTAGTCCGCGAGGTGCCTCAACTTCCTGTATCGATCAGCTATTGTTGCTCGATAGAGCCATTGACTTCACCAGTGTCATGGCCACTCAACTCACATATGAAGGACTTATTGATGAGTTGTACGGTATAAGGAGTTCTACTGTCACATTTCCGGGTCACAAGTTTGTTAGCCCGGATGACGCTAGACCGGAAGTAACAGCCAGAGAAAAAAAGAGAATCATACTCAACTCCAGCGAAGAACTGTTTGCTGAGCTGAGAGATTGTAACTTCACATCGGTGGGAGCACAATTGTCAAAGAAAGCAAGAATTATCAAGACCCAATTGGACGAAAGACACAATGACAAATCAGTTCAGGAGATAAAACAGTTTGTCTCCCGTCTGCCACAGATGTTAGCAAACAAACAATCATTGGCAACGCATACAGCAATAGCTGAATACATTAAAGAGACCACAGATTCCTTCGAATTCCATGACACCATCCAGTGTGAAGAAGACTTTATAAATGGCATCGACAACGATAAAGTGTGCCCGTTCATAGAGGATCTCATAGCACAGAAGGTGCCATTGACAAAAGTACTTCGTCTGATCTGCCTCCAATGTGTGACCGGTTCTGGTTTGAAGCCGAAAGTCCTGGAATACTATAAACGTGAACTGGTCCAGGTATACGGGCTCAGTACCTGGTTAACGCTGTGCAACCTTGAAAAATGTGGACTTTTAAAGGCACAGACAGGACCCCGACATTACACTGTACTAAGGAAGACTTTACATCTAACTATGGAGGAATTTGAGTTGGAAGCGAAAGAGAAGAGTTATCTATCCAGCAAATACATCCCGCTGACTGTCAGACTCAGTGAACATATATCCAAGAACAAAGGATGGTCAGCTATCGCTGATGTTCTGGGTTCGCTCCCGGGACCTCTTTTAGAGGAACTACAAAGTCTACAGCCGCGTATGACTCGTCGGAACTCAATCTCCAGCGAAAATTCTTCGTCAATTGAAAACGCTAGGGTGGTTTTGGTTTTCTTTATCGGTGGTTGCACCTACCACGAGATCTCAGCTTTAAGAAGTATCAGCCAGCAAGAAGAATCTAATGTGGAATTTGTAATACTTACTACAAAATTGATTAATGGCAACACTTTTATCGAGTCCCTGGTGGAAGAACATTAA

Protein sequence:

>DPOGS206911-PA
MRKELLNLLHICEGSKLIIWDEWLANPVGLVAQYSLLMEHEVADIFPLKPGTLPTISVKHIIFIARPKLVLMDLVADYILSLRNKQSATVEFHLFFVPRRSELCEKHLKNRGVLGTLSIEEFRCDIFPFDSDVMSLELQNDFRENYLEGDPTCLYNAAQAIRTIQQLYGIIPRVYGKGEAAKQVWDLLCRLNKEEQGSPRGASTSCIDQLLLLDRAIDFTSVMATQLTYEGLIDELYGIRSSTVTFPGHKFVSPDDARPEVTAREKKRIILNSSEELFAELRDCNFTSVGAQLSKKARIIKTQLDERHNDKSVQEIKQFVSRLPQMLANKQSLATHTAIAEYIKETTDSFEFHDTIQCEEDFINGIDNDKVCPFIEDLIAQKVPLTKVLRLICLQCVTGSGLKPKVLEYYKRELVQVYGLSTWLTLCNLEKCGLLKAQTGPRHYTVLRKTLHLTMEEFELEAKEKSYLSSKYIPLTVRLSEHISKNKGWSAIADVLGSLPGPLLEELQSLQPRMTRRNSISSENSSSIENARVVLVFFIGGCTYHEISALRSISQQEESNVEFVILTTKLINGNTFIESLVEEH-