Monarch geneset OGS2.0

DPOGS204443
TranscriptDPOGS204443-TA1704 bp
ProteinDPOGS204443-PA567 aa
Genomic positionDPSCF300002 + 16311-25589
RNAseq coverage324x (Rank: top 35%)
Annotation
HeliconiusHMEL0174100.076.84% 
BombyxBGIBMGA013641-TA2e-11062.85% 
DrosophilaCG3168-PB7e-4827.79% 
EBI UniRef50UniRef50_E2BMY63e-5627.13%Synaptic vesicle glycoprotein 2B n=7 Tax=Formicidae RepID=E2BMY6_HARSA
NCBI RefSeqXP_001120285.12e-6129.42%PREDICTED: similar to CG3168-PA, isoform A, partial [Apis mellifera]
NCBI nr blastpgi|3320211438e-6930.54%Synaptic vesicle glycoprotein 2B [Acromyrmex echinatior]
NCBI nr blastxgi|3407129226e-6929.48%PREDICTED: synaptic vesicle glycoprotein 2B-like [Bombus terrestris]
Group
Gene OntologyGO:00550859.8e-18transmembrane transport
GO:00160219.8e-18integral to membrane
GO:00228579.8e-18transmembrane transporter activity
KEGG pathwayame:7244295e-61 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[9-548] IPR0161963.6e-48Major facilitator superfamily domain, general substrate transporter
[65-288] IPR0058289.8e-18General substrate transporter
Orthology groupMCL26103 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204443-TA
ATGGTTTGTCGAGTGGCGCCGAAAGAGACGCCCTCCCAGAGGGATGGCGCTCAGGAGCAACAAGTAGCAGACTACGAGACCGCCTTGGACGCAGCTGGGTATGGCCGTTTCAGCCGTAGTGTCCTGGGTGCGTGTGCATGCGCGTTCTTCACTACTGGCGTCGAGAACTGTGTCATGTCTTATGTACTTCCGGCAGCGAGGTGCGAGCTGAATCTCACCACCTACCACACAGGACTGATCAATATGGCCTTCATGAGTGGGGGTGTAGCAAGTGCATTTTTTTGGGGTATAGTTGCTGACGTATTTGGAAGAAAGAATATCCTGAGTATGACCTTAATAGTCGACTCTACAATACTCCTCGCTCAGAGCACGGTATCAGACTACAGATTGCTCATAGCTGCTAGAGGTATCAATGGATTTCTCATCGGTGGACCATCAACGCTAGTTTTCACGTATCTATCAGACCTGGTGGGATTCAAACGTCGCCAATTTTATCTAGATCTCATCGGAATGAGTTTCGTGGTAGCTTGGCTTTTTTTACCAGCAATCGCTTGGTTAGTGATCCCCATAAAACTAACCTATACCACCATACTCCCCATATACTCCTGGAGACTCTTTCTCGCCCTTGGCAGCATCCCGGGATTCATGGCCGGTATATGGGTGCTGATCCTCCCTGAGAGCCCTAGACTGCTATCAGATACAAATAGAAGTGATAAAGCCTTGAAAATAATTGGCAAAATCCACAAAAGTAACAAAGAAATGACTGAGTTTAAGATTAAGAAACTAATACAAGACAATATATTGATGACGAAATCTATGAACGGGGAACAAAACAGGAGCAAAGTCCTCTTCATGGGCGTTATGAAGGATTTGAAGATGTTCGTATCTAAGATGTACGCTAAGAAGTCAACTCTGATACTCTTTGTATTCTTCGCCAACATGGCCGCAGGATTCGGTCTAAACTTCTGGATACCCGAACTACTTCTCCGCACTAAAGGCAAGAGTTGTTCGTCAAATATAAGTGTGCCAGAAAACGCTAAATTATTAAACGCTAGTATACCTTGGGATGATCTGCCCAAGGTACTGGAGCTGCGCGGAGGAACTGATGATGAGTACAACGCCACAGTGTTTGATGCCACTGGTGGAGATTGCAGTGAAGATATTGATGATTCGATATACACATCGGGCCTGATAGTTGGCGCATGTTGCGTGATCGGTAACGCTGCCTGTGCTCTCATCTCATCTCGCGGCGCTGTCGGCGTGCGTCGTGCTGCAGTACTCTGTGCATTGGCCTGTGCTGCCGCGTGTGTCGTGCTGGCTGCCTGCGTCTGTGCCTGTGCCGCGACAGAACATATAGCGGTTGCCGCTGCCGCTGCCTTAAACGCTGCGTCATTAAACGGGAATGTTCTGCTGATACGATTACTATTGCACGCTCTTCCTGCTAAGTTGAGTGGTCTTGGTGTTTGTTGGGGAGCATGGTGGGGTCGAGCTGGTGGCGTTGCGACCAACTTCGCTGTGGGCGCACTACTTGACTACTCATGTCCGGCACCTTTCATAGCTGTGGCGGCCCTCTTAGCACTCTCAATCGGTGGTATAATGATGATAAAAATTGATCAAGACAAAACAGCGGACGAAGATAAGAATGGGAGCGAGAGCAGCGGGAAAAATATATACATAGACAGATACATATCAACCCACATGTAG

Protein sequence:

>DPOGS204443-PA
MVCRVAPKETPSQRDGAQEQQVADYETALDAAGYGRFSRSVLGACACAFFTTGVENCVMSYVLPAARCELNLTTYHTGLINMAFMSGGVASAFFWGIVADVFGRKNILSMTLIVDSTILLAQSTVSDYRLLIAARGINGFLIGGPSTLVFTYLSDLVGFKRRQFYLDLIGMSFVVAWLFLPAIAWLVIPIKLTYTTILPIYSWRLFLALGSIPGFMAGIWVLILPESPRLLSDTNRSDKALKIIGKIHKSNKEMTEFKIKKLIQDNILMTKSMNGEQNRSKVLFMGVMKDLKMFVSKMYAKKSTLILFVFFANMAAGFGLNFWIPELLLRTKGKSCSSNISVPENAKLLNASIPWDDLPKVLELRGGTDDEYNATVFDATGGDCSEDIDDSIYTSGLIVGACCVIGNAACALISSRGAVGVRRAAVLCALACAAACVVLAACVCACAATEHIAVAAAAALNAASLNGNVLLIRLLLHALPAKLSGLGVCWGAWWGRAGGVATNFAVGALLDYSCPAPFIAVAALLALSIGGIMMIKIDQDKTADEDKNGSESSGKNIYIDRYISTHM-