Monarch geneset OGS2.0

DPOGS210577
TranscriptDPOGS210577-TA1494 bp
ProteinDPOGS210577-PA497 aa
Genomic positionDPSCF300168 - 628107-633679
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0129102e-16563.80% 
BombyxBGIBMGA013629-TA4e-7533.14% 
DrosophilaCG31106-PB3e-4726.99% 
EBI UniRef50UniRef50_B3M2831e-5229.98%GF17899 n=4 Tax=Drosophila RepID=B3M283_DROAN
NCBI RefSeqXP_001850917.13e-5429.61%synaptic vesicle glycoprotein 2B [Culex quinquefasciatus]
NCBI nr blastpgi|1700467587e-5329.61%synaptic vesicle glycoprotein 2B [Culex quinquefasciatus]
NCBI nr blastxgi|1700467582e-5629.17%synaptic vesicle glycoprotein 2B [Culex quinquefasciatus]
Group
Gene OntologyGO:00550854e-22transmembrane transport
GO:00160214e-22integral to membrane
KEGG pathwaytca:6601559e-41 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[1-490] IPR0161963.4e-55Major facilitator superfamily domain, general substrate transporter
[45-317] IPR0117014e-22Major facilitator superfamily
Orthology groupMCL34840 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210577-TA
ATGAAACCGGAAAAGGAAGAGTTGCCCTCACAAAAAAATATCCAGAACAAATATATGGATGTGGATCTAGATAGAGCCTTGGACCTCTCTGGCGCTGGAAGGTACCAGGTATTCCATTGTGCAATGATGTTGGCCGTTATTAGTACTGTCATCGTTGAAATGATTGGAAACGCATTTATTTTACCAGCTGCGGTGTGTGACCTTGGATTGTCTAATAGTCTACGGGGTTTACTTACATCTGTTCCCAATATCGGCGTCATATTAACTGCACCCGTTTGGGGCAGGGCAGCTGACGCGCTCGGTAGGAAACCAGTGCTGCTGTTTTCTACAGCAGTTTCAGGAATCTTTGCACTTGTGGCAGCTTTCATGCCGAATCTGATAAGTTTTGCATTATTCAAGTTTGCAAGTTCGTTATTCCTTTCCTGTCCATCGTCACTGGGTTTCGCGTATGCAGGAGAATTAATGCCACGACAAAAGAGAGATCTAGCAGTTATGATTTGCAATGCACTCTCAATGCTGACATCCACTTTTTGTCCCCTCCTTGCATGGGGCATTCTTTCATATAATTGGGACAGTAGCCCAATACCGCTACGTCCCTGGCGAGTGTTGACAGTTGTATACGCCGCTCCTGTTATATGTGCTGCTATATGGGTCACCCAAGCCAAAGAGAGCCCCAAATTTCTTATGGCTAAAGGCAGACATGAAGAGGCGCTGGATGTCTTGAAGCACATCTACGCTTTTAATACTGGATTAGACAAAGAAAATTATTGTGTAAAATCATTGAAGATTTGTTCCGACCATACAATCGAATATAACAGGGACTCGCCGAGCAGTGAAGAGGATTCATTATCTTTATTAAAACCGCCGCATTTAAAGTGGCTCGTCTTGATCGGTTTCCTTATGTTCGGATTATTTTCACTGTTAAACGGGTTGTTTTTGTTTGTGCCTTTCACTTTAAACAAAGCGGTGCTGTCAGATAAACCACGAACTGTATGCGAATTAATAAACCAGCCTCAGAATCAGACGTCGGCTGTTTGCTCTGACACTATCTCCTATGGGACCTTCCAAGTCACAGTAATCTGTTCTCTCGTCTATGGAACCCTTGTGATGTTGGTCAGCCTCAGCTCCGTCTCCAAGAGGACCCAGCTCATCATCATGTACAGCCTCGTGGGCGTCACTTGCATCATATCTGCCTTGACTACCAATAAGTTACTGGCAGGCATATCAATGTCGGCGCTTCAAATAACCGCTCTGGGAATCGGTCCACTAACAGCTTACGCCGTAGAGATGTTCCCAACCACTTTAAGGGGAACAGCTGTGGGTGCTGTCCTTATGTTTGGACGTGTGGGTTCCGTAGTCGGAGCCAACTTGGCTGGGGTCTTCCTCGCTGGATCCTGTACAGCCACCTTCTATGGCTTCTCAGGACTGTTATTCCTGTGTGCATTTTTGAGTGCCTTCTTACAACGAGAACAAACTACCAACCAAGTGACATGA

Protein sequence:

>DPOGS210577-PA
MKPEKEELPSQKNIQNKYMDVDLDRALDLSGAGRYQVFHCAMMLAVISTVIVEMIGNAFILPAAVCDLGLSNSLRGLLTSVPNIGVILTAPVWGRAADALGRKPVLLFSTAVSGIFALVAAFMPNLISFALFKFASSLFLSCPSSLGFAYAGELMPRQKRDLAVMICNALSMLTSTFCPLLAWGILSYNWDSSPIPLRPWRVLTVVYAAPVICAAIWVTQAKESPKFLMAKGRHEEALDVLKHIYAFNTGLDKENYCVKSLKICSDHTIEYNRDSPSSEEDSLSLLKPPHLKWLVLIGFLMFGLFSLLNGLFLFVPFTLNKAVLSDKPRTVCELINQPQNQTSAVCSDTISYGTFQVTVICSLVYGTLVMLVSLSSVSKRTQLIIMYSLVGVTCIISALTTNKLLAGISMSALQITALGIGPLTAYAVEMFPTTLRGTAVGAVLMFGRVGSVVGANLAGVFLAGSCTATFYGFSGLLFLCAFLSAFLQREQTTNQVT-