Monarch geneset OGS2.0

DPOGS208030
TranscriptDPOGS208030-TA1686 bp
ProteinDPOGS208030-PA561 aa
Genomic positionDPSCF300203 - 94756-103739
RNAseq coverage1110x (Rank: top 11%)
Annotation
HeliconiusHMEL0040450.089.35% 
BombyxBGIBMGA001498-TA0.088.27% 
DrosophilaCG3168-PB0.066.86% 
EBI UniRef50UniRef50_Q6U1H10.063.11%SV2-like protein 2 n=1 Tax=Ctenocephalides felis RepID=Q6U1H1_CTEFE
NCBI RefSeqXP_001651533.10.070.76%synaptic vesicle protein [Aedes aegypti]
NCBI nr blastpgi|1571694700.070.76%synaptic vesicle protein [Aedes aegypti]
NCBI nr blastxgi|910761700.071.35%PREDICTED: similar to synaptic vesicle protein [Tribolium castaneum]
Group
Gene OntologyGO:00550853e-31transmembrane transport
GO:00160213e-31integral to membrane
KEGG pathwayaag:AaeL_AAEL0058490.0 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[49-553] IPR0161964.5e-69Major facilitator superfamily domain, general substrate transporter
[93-373] IPR0117013e-31Major facilitator superfamily
Orthology groupMCL16135 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208030-TA
ATGCCGAGAAGAAAGAAGCTCCCCGTTGGAAGCGACACCAGAAACTGTGATGTAACAAGTGACCACCAACTAAACGGACTAGGTTCCAAAACTGTTGACCTCGGCGTCGTGCCTTCGCTCAAGTCACCAGCAAAACTCGACCCGGAAAAGGGATCGAATTCAGAAAAGGCTGACTTCGAACGGGCCATCGAACTAACGGGCTATGGCCGCTTCCACTACATGCTTCTCGCCGTTTGCGGCCTCGTCAGCACATCCGAGGAGATGGATGTCATCTCTATGTCTTTCATCCTACCTTCAGCGCAGTGCGACCTAGAGCTCACCACACAAACTAAAGGATGGCTGAACAGCATTATATTCATCGGTATGATGGTGGGAGCGTACGCCTGGGGCTCCGTTGCGGACTCGCTGGGTCGCAAGCGTGTCCTGATAGCGATCAGTATCGTGAACGCGTTAGCTATCGTCGCCTCTTCCTTCAGTCAGAACTATGAACTCTTTATGCTATTCCGATTTATTAATGGCGCTGCGTTGGGTGGGTCAGGTCCAGTTATCTGGTCCTACTTCGCAGAGTTCCAGCCGAAGAAGAGGCGAGGTGCTATGCTGAGTTTTATGGCTGCTTTCTGGACTTTGGGCAACTTGTTCGTGGCCGGGCTAGCTTGGGTTATCATTCCCAGTGAAATCGGAGGTGCGTACGGTGGTTTCGTTTACAACTCATGGCGGATTTTCTTATTGGTTATGTCAATTCCATCATTTCTCGTCGCCGCTCTACTCTTCCTTCTACCCGAATCACCAAAGTTCCTCATAACGACCGGACGCCACGACAAAGCATTGGAAGTCTTCAAAGGCATATACATGATGAACACTGGAAAGGACAAGGAATTGTATCCAGTGAAGCAGATACTCGTCGACGATCCCATACACGTGAGGCCAGAGAAGCAAGTCGATCTTGAAACTAAAGAACAGAAATCCAAGCTGAGAAGAATGTTGAGCGATATCGTTGAACACAGCAAGGAACTGTTCGTACCACCGATATTGAAATTCACTGCGATCTCAATTACAATCAACTTCACATTCCATATTGGTTACTACGGTCTCATGATGTGGTTCCCCGAGATGTTCAATCGTTTCGACGAGTGGTCCCGAACCCACGACAACGCGGAGGCCGACATCTGTCAGGTCACCGCGTATGTAACGAGACAAGGCTCGCACTCGGACGAGGCGATGTGCGATTCACATATACGCGGCGATGTGTTCATGGACTCATTGATAACTGTGGCTGCGGCGCTACCATCTAATATATTCGCCGTGCTTGGGATGGATAAATTGGGAAGAAAATTTTTCTTAGTGTTTGCAACATTCTCCGCGGGTCTATGTTCGGGTGCTATGTACTTCGTGTACAACAAGACCAACAACCTCATAGTGAGCGCCGTCTTCAGTTCAGTCATTTCGTGCGGAAACGCCTCGCTTGACTGTCTCATCACCGAAGTCTTCCCAACTAATTTACGTGCGACGGGCGTGGCGATATCTATGGTGGCAGCTCGTCTGGGCGGTATCATAGGTAACGTCGTGATAGCGGCGTTGTTAGACTCCTACTGTCCCGCGCCCACCTTCATAGTGGCAGTGCTCCTGGCCGGGGGCGGTCTCATGTGTATCTTCCTACCGAACACAACTAGAACAGCTTTGAAATAA

Protein sequence:

>DPOGS208030-PA
MPRRKKLPVGSDTRNCDVTSDHQLNGLGSKTVDLGVVPSLKSPAKLDPEKGSNSEKADFERAIELTGYGRFHYMLLAVCGLVSTSEEMDVISMSFILPSAQCDLELTTQTKGWLNSIIFIGMMVGAYAWGSVADSLGRKRVLIAISIVNALAIVASSFSQNYELFMLFRFINGAALGGSGPVIWSYFAEFQPKKRRGAMLSFMAAFWTLGNLFVAGLAWVIIPSEIGGAYGGFVYNSWRIFLLVMSIPSFLVAALLFLLPESPKFLITTGRHDKALEVFKGIYMMNTGKDKELYPVKQILVDDPIHVRPEKQVDLETKEQKSKLRRMLSDIVEHSKELFVPPILKFTAISITINFTFHIGYYGLMMWFPEMFNRFDEWSRTHDNAEADICQVTAYVTRQGSHSDEAMCDSHIRGDVFMDSLITVAAALPSNIFAVLGMDKLGRKFFLVFATFSAGLCSGAMYFVYNKTNNLIVSAVFSSVISCGNASLDCLITEVFPTNLRATGVAISMVAARLGGIIGNVVIAALLDSYCPAPTFIVAVLLAGGGLMCIFLPNTTRTALK-