Monarch geneset OGS2.0

DPOGS213878
TranscriptDPOGS213878-TA1575 bp
ProteinDPOGS213878-PA524 aa
Genomic positionDPSCF300141 + 41326-61138
RNAseq coverage1366x (Rank: top 9%)
Annotation
HeliconiusHMEL0040456e-9433.46% 
BombyxBGIBMGA013477-TA0.077.78% 
DrosophilaCG3168-PB4e-9435.88% 
EBI UniRef50UniRef50_B0WWU32e-11542.88%Synaptic vesicle protein n=3 Tax=Diptera RepID=B0WWU3_CULQU
NCBI RefSeqXP_973450.22e-14148.83%PREDICTED: similar to synaptic vesicle protein [Tribolium castaneum]
NCBI nr blastpgi|1892343334e-14048.83%PREDICTED: similar to synaptic vesicle protein [Tribolium castaneum]
NCBI nr blastxgi|1892343333e-13848.83%PREDICTED: similar to synaptic vesicle protein [Tribolium castaneum]
Group
Gene OntologyGO:00550851.7e-24transmembrane transport
GO:00160211.7e-24integral to membrane
GO:00228571.7e-24transmembrane transporter activity
KEGG pathwaytca:6629823e-120 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[1-522] IPR0161966.4e-64Major facilitator superfamily domain, general substrate transporter
[62-332] IPR0058281.7e-24General substrate transporter
Orthology groupMCL31017 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213878-TA
ATGGAAGAACAGGCTGAGAGGAAGGAGCGTCTTCAGCGGGCTCCCTTCGAGACTGCCCTCCATCACGCCGGTTACGGCTGTTTCCAATGGCTCTTGTTGATGTCGTGTGGTGCTGTGTACGCCGTTTGCGCGCTCAGCACCACCACTCTCAGCTTCGTGTTGCCGGCCGCCGAGTGTGACTTCCTCCTTTCCTCTAGCGACAAGGGTCGACTCACTGCCACGCCGCTGATAGGCATGTGTGTGGGATCGTACTTCTGGGGTAATCTAGCTGATGCGCGAGGTCGCAGGAAGGCCATCATCGGAGCTCTACTATTGGATGCGCTCGCTGCCTTCCTCTCTAGCCTGGTGCAGTCCTTCCCAGCTTTTTTGGCCTGCAGATTCTTCTCGGGTTTCGGCATCATCGGTGCTACTGGGCTCGTCTTCCCTTATTTCGGAGACTTCCTCTCGCTGCGGCACCGTGATGTGATGCTGTGTCGGCTTGAAGTGTTTTGGACTTTGGGGACTATATTATTGCCCGGAATCGCTTGGCTCATTATCCAAGATCCACGGTTCAATTTAACCGTTGAAGGTGCAGACTTCCAGTACACCTCCTGGAGAGTGTTCGTCGCCGCTTGTGGTATCCCAAGTCTCTTGGTGGTGTTATTGCTGCTGCCATTTCCCGAGAGCCCCAGATATTTACTGTACGCTAACAAAGCCGATCAAGCACTTCAAGTTCTGCAAAGGATATTTGTTGTTAATACGCGTTTGTCTGAGAAAGACTTTCCAGTGGAATCTATGATAAGTGCAGTAGAAGGTGAGGAGGAGGCCGAAGTGGTGTCTGCGGACGACAACGGTAACGAGACCAAGTCGCCTCTGACATGGGGTCAACGTTGGCACGCCTTCTGGCTCAGGAAGAAGATGTTGGTGACGCCGCCCTACATCGGACTACTGGTGCTGTGCTGCTTCGTTGACTTCGGACTGATGTTTAGTTACTACACGTTGATGATGTGGTTCCCGGACCTGTTCGAGAGGTTCAGCCAGTTCTCCTCGCTCTTCCCCGGAGTGTCGGCCGGCGTGTGTGAGGTGTCTGCTAACGTCACTAAGAGCGAGCTATTTTACAACCAGGAGTGTCCTCAGGTCATCGACGATCGTGTCTACATCCACACCCTGGTGGTGGCGCTGAGCTGTGTCCCGACCGCGCTGTGGGTGGGTCTCACCATCAATATGGTGGGCAAAAAACTAATGTTGGTGATAATGTTGATAAGCTCCGGCCTGGCTGCCCTGGGTCTCAACCTGGTGCGCTCCTCTCTGGAGAATCTGCTCCTGTCCTGCGTCTTCGAGGCGATCGTCAGTTGTACTGAGGCTGTGCTTTTCTGTGTTATTTGTGAGATATTTCCGACCAAAGTCGCTGCTACAGCGATGGCGGTGACTGTGATGTGCGGTCGTGTGGGGGCCATTGTAGGTAACGTGATATTCGGGGCGCTCGTCGATCAACACTGCGTCGTACCAATTTATATGTTCGGCTCTCTGCTCATCACGAGCGGTCTACTGTGTGTGATCCTCCCCAAGAGCACAAGACCCGAGAGACCTCAGTGA

Protein sequence:

>DPOGS213878-PA
MEEQAERKERLQRAPFETALHHAGYGCFQWLLLMSCGAVYAVCALSTTTLSFVLPAAECDFLLSSSDKGRLTATPLIGMCVGSYFWGNLADARGRRKAIIGALLLDALAAFLSSLVQSFPAFLACRFFSGFGIIGATGLVFPYFGDFLSLRHRDVMLCRLEVFWTLGTILLPGIAWLIIQDPRFNLTVEGADFQYTSWRVFVAACGIPSLLVVLLLLPFPESPRYLLYANKADQALQVLQRIFVVNTRLSEKDFPVESMISAVEGEEEAEVVSADDNGNETKSPLTWGQRWHAFWLRKKMLVTPPYIGLLVLCCFVDFGLMFSYYTLMMWFPDLFERFSQFSSLFPGVSAGVCEVSANVTKSELFYNQECPQVIDDRVYIHTLVVALSCVPTALWVGLTINMVGKKLMLVIMLISSGLAALGLNLVRSSLENLLLSCVFEAIVSCTEAVLFCVICEIFPTKVAATAMAVTVMCGRVGAIVGNVIFGALVDQHCVVPIYMFGSLLITSGLLCVILPKSTRPERPQ-