Monarch geneset OGS2.0

DPOGS204445
TranscriptDPOGS204445-TA957 bp
ProteinDPOGS204445-PA318 aa
Genomic positionDPSCF300002 + 54456-57929
RNAseq coverage361x (Rank: top 33%)
Annotation
HeliconiusHMEL0174061e-5876.40% 
BombyxBGIBMGA013642-TA1e-10863.44% 
DrosophilaCG15221-PB3e-3028.75% 
EBI UniRef50UniRef50_Q16HW13e-3934.01%Synaptic vesicle protein n=4 Tax=Culicidae RepID=Q16HW1_AEDAE
NCBI RefSeqXP_001664089.16e-4034.01%synaptic vesicle protein [Aedes aegypti]
NCBI nr blastpgi|1571379461e-3834.01%synaptic vesicle protein [Aedes aegypti]
NCBI nr blastxgi|1571379468e-4134.01%synaptic vesicle protein [Aedes aegypti]
Group
Gene OntologyGO:00550853.3e-10transmembrane transport
GO:00160213.3e-10integral to membrane
GO:00228574.3e-05transmembrane transporter activity
KEGG pathwaytca:6629823e-21 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[4-314] IPR0161969.7e-25Major facilitator superfamily domain, general substrate transporter
[173-308] IPR0117013.3e-10Major facilitator superfamily
Orthology groupMCL22329 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204445-TA
ATGGTTCCGTGGCGGGTGTTCATATGGTCATGGTGTGCGCCGGGGCTGGCCGCGGCCCTGGCGTTACTCTGGCTCCCAGAGAGTCCACGATACATCCTTGCATCCAGTGGACCAGAGAAAGCGTTACCCATCCTTGCTAAGATGTTCGCGTACAACAAAAATTCTAAAATTTCAAATTATCCGGTGGAGAGAATAGTAACGGGCAGCATAGAATCCTCGAAGTCGAGCGGGTTCGTGGGCGCTTTGAAGAACATCACCTTGCTATTCCGACCCCCCCTTCTGAGATGTGTCCTGATCTCACACACCCTTATGTTCGCTGTTTTTATGCTGTCCAGCGGTCTTTACGTCTGGGTTCCCGACATATTGAACAGCATGCTCCGTAGCACGAGCAGTGAGAGCATCACCATCTGTCAAGTCATAAAGGACAAGTTTGAAGCTGCCAAAAACGTCACACTTCTATCACAAGTGGGCTGTAAATCAGAAGTGGCGGTCCGCGTTTTTCCTATATCCATGGGTATGGGCGCTGTGTTCGCGGTCACTTACCTAGCCATTGGCGTGTTCATTGGCAAGGTTGGCAAGAAAACGTTGTACTCGTGTATAATGGTGGTGTGTGGTGTTGCAACTGTGGGTGCAGCGTTCGTACCACAGGGCGGTGCTGCCACGGCGTTGCTGATCCTGGCTCTGTGTTCTGGATGTGCTGCTTCAATACTCGCTGCTATAGCTGTAGATGTATTCCCGACGTGCTTACGGGCTATGGCGATGTGTGTGATGTACATGGTCGGTCGTACTGGCGCTGCCGTGGGCTCCAATCTTCTGGGAGCAACCCTCAACCTGCACTGCGAACAGGCCTTCTGTATCTTTGGAGCCTTCACTGCTGGATCGACTCTACTTATTATATTCTGGCCAAAACCTGAAAAAGTTAGAGAGGAATTGGAGCAGCGAGGGCTGATCTACTGA

Protein sequence:

>DPOGS204445-PA
MVPWRVFIWSWCAPGLAAALALLWLPESPRYILASSGPEKALPILAKMFAYNKNSKISNYPVERIVTGSIESSKSSGFVGALKNITLLFRPPLLRCVLISHTLMFAVFMLSSGLYVWVPDILNSMLRSTSSESITICQVIKDKFEAAKNVTLLSQVGCKSEVAVRVFPISMGMGAVFAVTYLAIGVFIGKVGKKTLYSCIMVVCGVATVGAAFVPQGGAATALLILALCSGCAASILAAIAVDVFPTCLRAMAMCVMYMVGRTGAAVGSNLLGATLNLHCEQAFCIFGAFTAGSTLLIIFWPKPEKVREELEQRGLIY-