Monarch geneset OGS2.0

DPOGS201067
TranscriptDPOGS201067-TA1929 bp
ProteinDPOGS201067-PA642 aa
Genomic positionDPSCF300185 - 334805-336830
RNAseq coverage136x (Rank: top 55%)
Annotation
HeliconiusHMEL0210700.076.59% 
BombyxBGIBMGA007200-TA2e-12571.71% 
DrosophilaVps33B-PA4e-9635.03% 
EBI UniRef50UniRef50_Q7PUM32e-10134.34%AGAP001463-PA n=4 Tax=Culicidae RepID=Q7PUM3_ANOGA
NCBI RefSeqXP_001657113.15e-11837.72%vacuolar protein sorting (vps33) [Aedes aegypti]
NCBI nr blastpgi|1571376491e-11637.72%vacuolar protein sorting (vps33) [Aedes aegypti]
NCBI nr blastxgi|1571376498e-11537.44%vacuolar protein sorting (vps33) [Aedes aegypti]
Group
Gene OntologyGO:00069042.6e-111vesicle docking involved in exocytosis
GO:00161922.6e-111vesicle-mediated transport
KEGG pathway 
InterPro domain[1-642] IPR0016192.6e-111Sec1-like protein
Orthology groupMCL15848 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201067-TA
ATGGATCAACCACTCTCTTTAAAGTTGGCTTCTTTGTGTCAAATATCCCAGTATAAATTGGAGCACATCCTGTCCCAATGCGGGGAAAAAGCCGATTTAATCATTGATCCTGTTCTTATAAAGCCACTGGAGAGAATATGTGGAGTGACTTGGCTCAGGCAACATGGTATAGATAAAATATATAAAATGGACCCACAACTTGGATCTACATCTAATACGAGAAGGATATACTTTATACCAGCTTGTATCAAGAAATACAAATGTGTGTTGGATCAGATATCATCACTTCTCAGCCTAAACCCAGCTCAAGCCGATGCCAAGTGTTTTAATATTATTATAGTGCCTAAATTCGTATGTACTTTCGATTCCATATTAGAGAATAAGGGTTTATATGAAATAGTCAAGCTACATTCATTCTCATGGGAATTGATGAGGTTGGATGATCAGCTCTTGAGCCTGGAGGTGCCTTTCCTTTATAAACAATTGTATGTGGAAAAAAATCAATCATTGTTGTCGAGTATTGCTATGTCTTTGTGGAGCTTATTTCATGTTATCGGCAAACCTAAATGTTTGATGTCATTGGGAAAGATGTCATCAAGCGTTTTAGACCTACTTGAAGTATATAACGAGACCTTCACTAGAGACTTTGTTAGTAACAATACTGATGATATCGGAGCAGTTCTCGTTATAGATAGAGATCAGGATTATTCTTCCAGTCTACTAACACCGGCTACTTATAGTGGACTGTTAAGTGAAATTTTCAATATAAACTGTGGTCATCTAGAGCTTAATGTGAAAGAAACAAAGTTGAAGAAGGGCAAGCTTAATTTTACCACATCTAAAGATGAAACAGCCAAAACCGCAGTCATGTCGTTAGACAGCTCTATTGATCATTTGTACGGTGAAATAAAGCATAGGCATTTCTCTGAAGTTTTAAGCGTGTTGAGTTCAAAGGCGAAGCTTCTCAAGAACGAGGATATCAAGGCTCTCGGAATACAAGAAATGAAACAGTTTGTTGCGACTAAATTGCAACAAGTTACACTGTTCAAGCAGAATCTAGTGAATCACGTGTTGGCCAGCGAGACGATAATATCAGAAATGAACAATAAATTTGAAAACTTAACATTAACGGAGAGTGAAATGCTCAATAATAGAAATAAAAGGGCCAATTTCACTTACATCGATGAACACTTTGGCACGGATGTCCACATGCATAACTCCTTAAGACTAATGTGCTTGTTAAGTTTAACCCAAGGTCTGACCTACGAGGAGTACAATTCGCTTGTACACAAGTACCTATTAGCATTCGGCTACAAGTTTTTATATGTTTTCAACAATTTGATAACATCCGGTTTACTCGTACAACCTTCCAGCCCAAAATTGTCTTTGTCCAATCTGAGCAATTTAAGCGACAGATTGCCTAAATGGCAGACTGAGTTCCAAACGGTGGCCAACAAATTGAAACAATTGCCGACCAAGTCAGATAGTGCGAAGTCTCCGAGCTACGTGTTCAACGGCGGCTACATACCTCTCGTAGCTGTTATATGTAACGCACTATTGACATCGGAATCCTTAGCTGAACTCCTGACGAAACTATCAGCGCTCGATGATCTAAAAATCGGCGGCAGTTCAGTAGACAAAGCGAAACGAGGTATGGAGAGCTTAAACGAAAAAATAACGAATTCAAACACGAAAACCGATTCGGAAAATGGCTACAGGGATGCTAAGACCCTGTTGAAGACGCTGAAGGGAGACAGCAGAACTATGTTTACTTTGAAGCCTAGAACAGTGCTGGTTTATATGATCGGGGGGGTGACGTATGCGGAAGTAGCCGCCTGTGACGTCATTCAGTCCGTAACAGGAAGCAGGATCATTATAGCCAGCGACTGTATAGCGAGCGGCAGCGACCTGATCGCCGCTAATATGTAA

Protein sequence:

>DPOGS201067-PA
MDQPLSLKLASLCQISQYKLEHILSQCGEKADLIIDPVLIKPLERICGVTWLRQHGIDKIYKMDPQLGSTSNTRRIYFIPACIKKYKCVLDQISSLLSLNPAQADAKCFNIIIVPKFVCTFDSILENKGLYEIVKLHSFSWELMRLDDQLLSLEVPFLYKQLYVEKNQSLLSSIAMSLWSLFHVIGKPKCLMSLGKMSSSVLDLLEVYNETFTRDFVSNNTDDIGAVLVIDRDQDYSSSLLTPATYSGLLSEIFNINCGHLELNVKETKLKKGKLNFTTSKDETAKTAVMSLDSSIDHLYGEIKHRHFSEVLSVLSSKAKLLKNEDIKALGIQEMKQFVATKLQQVTLFKQNLVNHVLASETIISEMNNKFENLTLTESEMLNNRNKRANFTYIDEHFGTDVHMHNSLRLMCLLSLTQGLTYEEYNSLVHKYLLAFGYKFLYVFNNLITSGLLVQPSSPKLSLSNLSNLSDRLPKWQTEFQTVANKLKQLPTKSDSAKSPSYVFNGGYIPLVAVICNALLTSESLAELLTKLSALDDLKIGGSSVDKAKRGMESLNEKITNSNTKTDSENGYRDAKTLLKTLKGDSRTMFTLKPRTVLVYMIGGVTYAEVAACDVIQSVTGSRIIIASDCIASGSDLIAANM-