Monarch geneset OGS2.0

DPOGS200214
TranscriptDPOGS200214-TA3429 bp
ProteinDPOGS200214-PA1142 aa
Genomic positionDPSCF300328 - 10150-110116
RNAseq coverage8x (Rank: top 85%)
Annotation
HeliconiusHMEL0107731e-12455.38% 
BombyxBGIBMGA010096-TA1e-6364.06% 
DrosophilaCG3690-PA5e-6030.45% 
EBI UniRef50UniRef50_E2ADZ19e-6930.78%Synaptic vesicle glycoprotein 2B n=6 Tax=Formicidae RepID=E2ADZ1_CAMFO
NCBI RefSeqXP_001603639.15e-7331.11%PREDICTED: similar to synaptic vesicle protein [Nasonia vitripennis]
NCBI nr blastpgi|3838561189e-7331.88%PREDICTED: synaptic vesicle glycoprotein 2C-like [Megachile rotundata]
NCBI nr blastxgi|3838561182e-7631.88%PREDICTED: synaptic vesicle glycoprotein 2C-like [Megachile rotundata]
Group
Gene OntologyGO:00550851.3e-24transmembrane transport
GO:00160211.3e-24integral to membrane
GO:00228571.3e-24transmembrane transporter activity
KEGG pathwaydme:Dmel_CG31684e-52 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[586-1138] IPR0161967.5e-55Major facilitator superfamily domain, general substrate transporter
[663-847] IPR0058281.3e-24General substrate transporter
[96-202] IPR0137831.6e-09Immunoglobulin-like fold
[96-177] IPR0131064.9e-09Immunoglobulin V-set
[195-286] IPR0035996.4e-06Immunoglobulin subtype
Orthology groupMCL30723 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200214-TA
ATGCACCACATCGCACTGCTCTTGATGCTTCTCATGAGGGCAAGTGCCCATTTGAATATGACGGCTTCGCCTACTTCTGATTCTACATATCTTCTCAGAAGTACATTGCCACCAACTGTATTTACTTTGTCAGTGACAAAGGCGAGACGGATTTCATCTAGACGGGGAAAGTCGGATTCACCGATGTTGAATTACATTTTTGATTCATATGCGACAAATAACAAACATTTTCATGACAATTTCAAAGCACCGTCATTCGAAGATGACATAAACTCTGACAAAATTATCAACACAGACACAGGTGCGACAGTTTTATTTGATTGTCGAGTTTCGTCATTAAGAGATAAAACAGTATCCTGGCTGAGGGTATCCAACGACACAAATCTAGAACTGTTGACAGTTGATTTAGAGACTCATACGACAGATCCGAGATATAAAGTGGACGTTGCTGGTGAAACATGGAAGCTGTCTCTCACGGACGCTAAGGTTCAAGATTCTGGAATATACTGCTGTCATGTATCAACGCATCCGCCCATGTTACGTCGATTTCGCTTAGTTGTTCATCCGCCCGAAATAAAAATGAACAATGAAGCTTTCCTGGAAAGCGGCGAGACTCTATCGCTCAAGTGCGCTGTATTGCACTTAAATCCCGGAGAAGCGAGTGAATTACACTGGTACCGAGGAAACAATACCACGCCGCTTGATGAGATACGGTCTGGTGTACTAGTAGAGACAGATCTTTTAACGCTTACAAGTCATCTTCAGGTAGCTCATCTTAAAGTGGAAGATGCTGGAAACTACACCTGCGCCCTTGCATCGCCTATAACGATGAGAGCCGTAGCAAGGGTTCATGTGTTACAAGGTGAAAAACATAATTTGGTTAAAGCTATAAAAATTTTAGAAGAATCTTTAGTATTTTGTAAATCTGGTAAATTTCATATCAAGCTATTAGTGGCTTCAATGTGTGGAATATTTGCCACAATGACGATAATCACAACTTCTTATATTTTGCGAGAAGCAGAATGTGATTTAAATATGAATATTATGCAAAAAGGTCTCTTAAATGCTATGCCATTTTTTGATAAAACTAAAAACACAGATGATTCAATGAAAATATTAGAGGATGCACTAGTACTTTGCAGATTTGGCAAATTTCATATCAGGTTACTAGCCGCTTCATTATGTGCTGCATTTGCCGTTATGATGGTAACAACGACCTCTTCTTATATTCTGCCGGTAGCAGAATGCGATTTGAATATGAATATAATGTATAAAGGGCTCCTAAATGCAATGCCATTTTTTGAATTACATGCATGGAATATTTACTTATACGTCTGTTCAATATGGAGTTTCATGGGAACAATTCTATTCTATAACCTGCCTGAGAGTCCCAAGTACTTGCTATCACATGGTCAGGAAAAAGAAGCTTTGGAAGTAGTACGAATAATATACTCCGAAAACACAGGAAACGCAAAGGACACGTTCCCCGTAACGTCATTCAATGTTTCGTGCAATTCAAATCCTTCGAATGAGATGAGCCTGAGGAGGCAATTAGTGAATGCTTTATATGAAGTCAAAGAACTATTTCGAAAGCCCTTAGTGTTTCACCTGTTACTATTTTCCATGATATCTTTTATCGCATTCTTGGGCTTCACATCCTTGCGGCTCTGGTATCCACAATTATCGACAATTGTTGAAAATTTTGAGAAACAAAATGGGGAAACAGCACGTTTTTGCGTCATGCTGACAGATTACATGCAAAACCTTAAAGTGAAGCACAGAAATACCACTTTACTTGAATTGACTGAACCTGATGTCTGTGTTCCTGTAAATAAAACTAAAAACACAGATGATTCAATGAAAATATTAGAGGATGCACTAGTACTTTGCAGATTTGGCAAATTTCATATCAGGTTACTAGCCGCTTCATTATGTGCTGCATTTGCCGTTATGATGGTAACAACGACATCTTCTTATATTCTGCCGGTAGCAGAATGCGATTTGAATATGAATATAATGTATAAAGGGCTCCTAAATGCAATGCCATTTTTTGGACAGATTGGTGCAAGTTTATTTACAGGATTCTTGATTGATGCTTTTGGTAGGAAAATATTTCTTGTGGGTGGAAACGCAGCCATTTTTGTCTGCACTCTAATCGAAGGGTCAAGCCAAAATTATTGGATGTTAATATTTATGAAATTACTGGAAGGAATTTCAATGAGTCTTAGCTTTAGTGCAATATCGACAAACTTAACGGAGTTCTGCCATAAGGACATAAGGGACAGAACGTTAATGTTATACTCAGGGTTTATGTCTCTATCCTTAATCGTCGCTGCTTTGGTATCATGGGCGATATTGCCCTTGAAAATTGATATTGTATTTGTGAAGGGATATTTTGAATTACATGCATGGAATATTTACTTATACGTCTGTTCAATATGGAGTTTCATGGGAACAATTCTATTCTATAACCTGCCTGAGAGTCCCAAGTACTTGCTATCACATGGTCAGGAAAAAGAAGCTTTGGAAGTAGTACGAATAATATACTCCGAAAACACAGGAAACGCAAAGGACACGTTCCCCGTACGATATATCTACCTTTTATTTTCTCTCTTAATGGTAACGTCATTCAATGTTTCGTGCAATTCAAATCCTTCGAATGAGATGAGCCTGAGGAGGCAATTAGTGAATGCTTTATATGAAGTCAAAGAACTATTTCGAAAGCCCTTAGTGTTTCACCTGTTACTATTTTCCATGATATCTTTTATCGCATTCTTGGGCTTCACATCCTTGCGGCTCTGGTATCCACAATTATCGACAATTGTTGAAAATTTTGAGAAACAAAATGGGGAAACAGCACGTTTTTGCGTCATGCTGACAGATTACATGCAAAACCTTAAAGTGAAGCACAGAAATACCACTTTACTTGAATTGACTGAACCTGATGTCTGCGTTCCTAAATTGAGCGGATCCGAAACCTACATTAATGGAATGATCTTGGGATTTGTTTCCCTAATATTTGTTGCTATAACCTGTTATTTGGTAAAATACGTGTCGCAGAAGGTCCTAATGTTTATCTTTCTAATAACGTGCTCAATGACGTCAGCTGCAATGTATTGGACTAGCACTTCTATTCAAATAGCCCTATTAGTTTCCTGTACGTGTGCTTTTATACAAACAGCATTCAGTTTACAACAAAATCTTTTTGTTCGTGTATTTCCAACAACTCTAAGAGCTCTTGCCTTTTCAATTATCATGGTCTTGGGCCGATTAGGATCGGTCGTGGGGAATATAATCTTTCCAATTTTATTAGAAACTGGATGTATGGCTCCATTCATTTTAACATCTACTATAACATTATGTATATCGGGAGTAGTGTACTTCTTACCCGGTGTGAATAAAGAAAATAAAGAAGTCGGAGACAAATGA

Protein sequence:

>DPOGS200214-PA
MHHIALLLMLLMRASAHLNMTASPTSDSTYLLRSTLPPTVFTLSVTKARRISSRRGKSDSPMLNYIFDSYATNNKHFHDNFKAPSFEDDINSDKIINTDTGATVLFDCRVSSLRDKTVSWLRVSNDTNLELLTVDLETHTTDPRYKVDVAGETWKLSLTDAKVQDSGIYCCHVSTHPPMLRRFRLVVHPPEIKMNNEAFLESGETLSLKCAVLHLNPGEASELHWYRGNNTTPLDEIRSGVLVETDLLTLTSHLQVAHLKVEDAGNYTCALASPITMRAVARVHVLQGEKHNLVKAIKILEESLVFCKSGKFHIKLLVASMCGIFATMTIITTSYILREAECDLNMNIMQKGLLNAMPFFDKTKNTDDSMKILEDALVLCRFGKFHIRLLAASLCAAFAVMMVTTTSSYILPVAECDLNMNIMYKGLLNAMPFFELHAWNIYLYVCSIWSFMGTILFYNLPESPKYLLSHGQEKEALEVVRIIYSENTGNAKDTFPVTSFNVSCNSNPSNEMSLRRQLVNALYEVKELFRKPLVFHLLLFSMISFIAFLGFTSLRLWYPQLSTIVENFEKQNGETARFCVMLTDYMQNLKVKHRNTTLLELTEPDVCVPVNKTKNTDDSMKILEDALVLCRFGKFHIRLLAASLCAAFAVMMVTTTSSYILPVAECDLNMNIMYKGLLNAMPFFGQIGASLFTGFLIDAFGRKIFLVGGNAAIFVCTLIEGSSQNYWMLIFMKLLEGISMSLSFSAISTNLTEFCHKDIRDRTLMLYSGFMSLSLIVAALVSWAILPLKIDIVFVKGYFELHAWNIYLYVCSIWSFMGTILFYNLPESPKYLLSHGQEKEALEVVRIIYSENTGNAKDTFPVRYIYLLFSLLMVTSFNVSCNSNPSNEMSLRRQLVNALYEVKELFRKPLVFHLLLFSMISFIAFLGFTSLRLWYPQLSTIVENFEKQNGETARFCVMLTDYMQNLKVKHRNTTLLELTEPDVCVPKLSGSETYINGMILGFVSLIFVAITCYLVKYVSQKVLMFIFLITCSMTSAAMYWTSTSIQIALLVSCTCAFIQTAFSLQQNLFVRVFPTTLRALAFSIIMVLGRLGSVVGNIIFPILLETGCMAPFILTSTITLCISGVVYFLPGVNKENKEVGDK-