Monarch geneset OGS2.0

DPOGS208957
TranscriptDPOGS208957-TA1560 bp
ProteinDPOGS208957-PA519 aa
Genomic positionDPSCF300009 + 760924-767266
RNAseq coverage827x (Rank: top 16%)
Annotation
HeliconiusHMEL0157730.077.43% 
BombyxBGIBMGA002430-TA0.092.86% 
DrosophilaCG3168-PB8e-10137.89% 
EBI UniRef50UniRef50_E2B4770.064.93%Synaptic vesicle glycoprotein 2B n=14 Tax=Endopterygota RepID=E2B477_HARSA
NCBI RefSeqXP_974142.20.066.86%PREDICTED: similar to synaptic vesicle protein [Tribolium castaneum]
NCBI nr blastpgi|2700015490.068.58%hypothetical protein TcasGA2_TC000393 [Tribolium castaneum]
NCBI nr blastxgi|2700015490.068.58%hypothetical protein TcasGA2_TC000393 [Tribolium castaneum]
Group
Gene OntologyGO:00550856.4e-27transmembrane transport
GO:00160216.4e-27integral to membrane
GO:00228576.4e-27transmembrane transporter activity
KEGG pathwaytca:6629820.0 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[1-512] IPR0161961.6e-63Major facilitator superfamily domain, general substrate transporter
[58-286] IPR0058286.4e-27General substrate transporter
Orthology groupMCL16305 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208957-TA
ATGGGTTTAGACTTCTTAGAACATGGAGCGGACTTCGAAGCCGCGATATCCGCAACTGGTTTTGGCCGTTTCCATTTCTGTCTGTTAGCGGTAACAGGTCTAATTTATGCTAATACAGCGATAGGTATTACCATTATTTCCTTTGTACTCCCTGCGGCGACATGCGATTTCAAGATGACATCCGCCGACAAGGGCTGGCTTACAGCTGCTCCTATGTTAGGTATGGTTATCGGATCTTATTTCTGGGGTTGCTTGGCTGATACTAAAGGCCGTAAAGTGGTCTTAGTATCCACCTTGCTAATCGACGGTTTTGTTGGAATACTTTCGTCGTTCGTCCAAATTTTACCAATCTTCATGGCATGCAGATTTATCAACGGATTTTCTGTTGCTGGAGCTATGGGCATTTGTTTCCCATACTTAGGAGAATTCCAACAAACCAAGTACAGAGAAAAGATCCTTTGTTGGATGGAAATGTTCTGGACTCTGGGAGTTATCGTATTGCCTCTGATCGCGTGGGGCATCATTCCGATCCAGGGAATACGCATCGAAAGTGGAAGTTTCTCCTACGATTCCTGGAATTGGTTCGTTGCTGCTTGTGGAATACCCTCTCTGCTCCTGGGCTTCTGGCTCTTTACATTCCCTGAGAGCCCCAAGTTCATGATGGAGTGCGGTGACTACGATGATGCTCTCGCCTGCCTCAAAAAAATTTACAAACAGAACACCGGCGATGATCCCGACAATTATCCCATCAAGAGCTTGAAAGAAAAGGTACGTACTATATCGGTAGCTTCCCAGAGCTCTCAAAAATCCGTCCGAAGTTTGAGCATGCGCAAGCCTAAAGATCTGAGACGACTATTTTCCGAAATTTGGGCTCAAACTAAAGCACTCTGCAAAGCACCATACTTGAAGTACACGATACTCACCTGTCTCATTCAATTTGGATTAACTACTAGCTACTATACCCTCATGATCTGGTTCCCTGAATTGTTCAACCGCTACGAAGAATTCGAACATCGCTTCCCCGGTAAATCTGCTTCAGTTTGTGACGTCTCCAGTATTGTGGTTAAGGACGAAAACAGCAAAGATCCATTCGACTGCGAGAAAATCATCGATCAGAGTGTATACACTCACACATTAATTGTCGGTTTGGCTTGTATACCAACATCCCTATGGTTGCCGCTTTGCGTGCACAAACTTGGTGCGAAGTTCTTCCTCATTTTCTCGCTGGGTTTCGCTGGTGTAGTTACGGTGGGACTTTACTTCGTACAGAACTCTACTCAGAACTTAGTACTTTCTTGTATATTCGAAGCCCTCACCTCGTTAGCCATCAGTCTGGTCTTCTGTGTGCTTGTAGACCTCTTCCCTACAAACCTTAGGGTAATGGCGGCTGCACTCTCACTGACCGCTGGCCGAGGTGGTGGCCTTATCGGTAACTTGTCCTTTGGATACCTCATTGATATCAACTGCGTGGTGCCCATCGTGCTATTCTCAGCATTCCTGTTCATTGCTGCAATACTTTGCTTCTTCTTGCCAAAGACGGGACAAGAAGCACTCGACTAG

Protein sequence:

>DPOGS208957-PA
MGLDFLEHGADFEAAISATGFGRFHFCLLAVTGLIYANTAIGITIISFVLPAATCDFKMTSADKGWLTAAPMLGMVIGSYFWGCLADTKGRKVVLVSTLLIDGFVGILSSFVQILPIFMACRFINGFSVAGAMGICFPYLGEFQQTKYREKILCWMEMFWTLGVIVLPLIAWGIIPIQGIRIESGSFSYDSWNWFVAACGIPSLLLGFWLFTFPESPKFMMECGDYDDALACLKKIYKQNTGDDPDNYPIKSLKEKVRTISVASQSSQKSVRSLSMRKPKDLRRLFSEIWAQTKALCKAPYLKYTILTCLIQFGLTTSYYTLMIWFPELFNRYEEFEHRFPGKSASVCDVSSIVVKDENSKDPFDCEKIIDQSVYTHTLIVGLACIPTSLWLPLCVHKLGAKFFLIFSLGFAGVVTVGLYFVQNSTQNLVLSCIFEALTSLAISLVFCVLVDLFPTNLRVMAAALSLTAGRGGGLIGNLSFGYLIDINCVVPIVLFSAFLFIAAILCFFLPKTGQEALD-