Monarch geneset OGS2.0

DPOGS213879
TranscriptDPOGS213879-TA2694 bp
ProteinDPOGS213879-PA897 aa
Genomic positionDPSCF300141 + 64319-74672
RNAseq coverage137x (Rank: top 55%)
Annotation
HeliconiusHMEL0034690.074.89% 
BombyxBGIBMGA013435-TA3e-10344.75% 
DrosophilaCG15221-PB2e-3826.26% 
EBI UniRef50UniRef50_D6WB981e-5230.15%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_972569.12e-5330.15%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910778684e-5230.15%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastxgi|910778689e-5429.77%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00550855.5e-27transmembrane transport
GO:00160215.5e-27integral to membrane
KEGG pathwaycqu:CpipJ_CPIJ0115422e-36 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[10-504] IPR0161964e-44Major facilitator superfamily domain, general substrate transporter
[42-463] IPR0117015.5e-27Major facilitator superfamily
Orthology groupMCL26077 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213879-TA
ATGGATAGGAACCGTGGTGATATGGAAATGAGTGATGGTGTGAAGGAGGTTGTGTACGGATACGAGGAGGCGTTGGAATTGACAGGTCATGGGAGATACAACGTTGGACTGTTGTTTGCGTGTTTCATGGTGATCGTGGCGATGGGTATCGACATCTTTGGACTCGGCATCATCGTTACAGCGGCCACCTGTGACCTGGGGATGAATTTGCAGCAGATTGGAGTGCTCTCGTCTATGCCATTTGCTGGTATAATCGTCATGTCATATCCATGGGGTTACCTCTCCGACACTCGCGGCAGGCGACTGGTGTTGTTGTGGGCGATGGGGGGTAGTTTCGTGAGTGCTATGCTCAGTAGTTTGGCGCCCACGTGGCAGGTGCTGGCGGCCTTGAGGTTTATTAGTTCTGCATTGTCGAGTGCCGCAGAGTCAGCGACATATGCGTTACTTGGCGAGTGCTGCACGTCCCAGCACCGAGCGAGATACATGCTGCTCATGACCAGCGCGCTCATGCTGACACCCACTCTCTATTATGTATGGGGATACTTCATAACGAAATTGAATTTTACATTCTACATCCTGGGCATCGCGTACCGTCCTTGGCGTCTTCTGACCATGGTGATGGCGCTGCCGCTGGGACTCGGAGCTTTATTGTTGTACTTCTTCAACGAGAGTCCCAAATTCTTAGCGAATATCGGACGAACAAAGGAGGCTGTTGACGTGATGAAACGAATTCATGACATAAATAAAAGAGCAGTTGAAAAGTATCCGGTGAAAGACATCTATCTGTCTGAAGACTGTCAGGAGTCTTCGTCCTCAGTGACTCTACAGTCCCTGTGGCGTCAGCTTGCGCCACTGTTCAAGCCTCCTCTACTCAAAAGGACTCTTTTATTATATTATCTCACCTTTGTCATCTATATAGCTAACAATAGCTTCGCTATTATTTTGCCGACTATCTTCAACGTATTCTTCACGTCGTACGCCAGCAGGGAAGCAGCCGACTCATTCTGCAACCTTCTGTCGACGAATGGGACAGTAGCTTCCGTAGAAAACGACGTAGTTCCTGAATGCTCGGGTATTGAGGTTAACACAGTATGGGCGGGTTGCGCCCACGGACTCGCCTTTTTCGTACTCAACGCTCTGCTGTCACAGGCAGCACACAAGAGAAAGGTGCTCACAATTAGCATTCTGCTGGTAGCGGGTGTGTCAGCTGTCCTGGCGGACCTGACGGGTGAAGCCCTCTCGGGCCTGGTGTTGTTCTACCTCTACCTGACCACCGCCATGGTGTTTGGGATCGTGTCATCTTATTTCGTCACTCTCTACCCCACCTCCTACAGGGGTATGGTGGCCTGTATCGGTATGATGGTGGCTCGGGTGAGTGCGTTCGCTGGTACCAATCTGGTGTCCGCTGCGATCTCCCTTCACTGCGCTCCCACTCTCTACGGAGCCGCTGGCCTCGTGTTCACCGTCTGCTCGGAGTACACAAAATTTTCACCTGTTAAGTATTTCCAACAATTCCGTAAAAGTACAAAGGCTGTACTCGGTCGTACTGAGAGAGGTACCCGGGCATCCACGGCCAGTAGTGTTTGGTGGTCGAGTGCCGCAGAGTCAGCGACATATGCGTTACTTGGCGAGTGCTGCACGTCCCAGCACCGAGCGAGATACATGCTGCTCATGACCAGCGCGCTCATGCTGACACCCACTCTCTATTATGTATGGGGATACTTCATAACGAAATTGAATTTTACATTCTACATCCTGGGCATCGCGTACCGTCCTTGGCGTCTTCTGACCATGGTGATGGCGCTGCCGCTGGGACTCGGAGCTTTATTGTTGTACTTCTTCAACGAGAGTCCCAAATTCTTAGCGAATATCGGACGAACAAAGGAGGCTGTTGACGTGTTGAAACGAATTCATGACATAAATAAAAGAGCAGTTGAAAAGTATCCGGTGAACGACATCTATCTGTCTGAAGACTGTCAGGAGTCCTCGTCCTCAGTGACTCTACAGTCCCTGTGGCGTCAGCTTGCGCCACTGTTCAAGCCTCCTCTACTCAAAAAGACTCTTTTATTATATTATCTCACCTTTGTCATCTATATAGCTAACAATAGCTTCGCTATTATTTTGCCGACTATCTTCAACGTATTCTTCACGTCGTACGCCAGCAGGGAAGCAGCCGACTCATTCTGCAACCTTCTGTCGACAAATGGGACAGCAGCTTCCGTAGAAAACGACGTAGTTCCTGAATGCTCGGGTATTGAGGTTAACACAGTATGGGCGGGTTGCGCCCACGGACTCGCCTTTTTCGTACTCAACGCTCTGCTGTCACAGGCAGCACACAAGAGAAAGGTGCTCACAATTAGCATTCTGCTGGTAGCGGGTGTGTCAGCTGTCCTGGCGGACCTGACGGGTGAAGCCCTCTCGGGCCTGGTGTTGTTCTACCTCTACCTGACCACCGCCATGGTGTTTGGGATCGTGTCATCTTATTTCGTCACTCTCTACCCCACCTCCTACAGGGGTATGGTGGCTTGTATTGGTATGATGGTGGCTCGGGTGAGCGCGTTCGCTGGTACCAATCTGGTGTCCGCTGCGATCTCGCTTCACTGCGCCCCCACTCTCTACGGAGCCGCTGGCCTCGTGTTCACTGGAGCCGCGGCAGCGTGGTTCCTGCCAGCGGACACTGACAATAATACCTAA

Protein sequence:

>DPOGS213879-PA
MDRNRGDMEMSDGVKEVVYGYEEALELTGHGRYNVGLLFACFMVIVAMGIDIFGLGIIVTAATCDLGMNLQQIGVLSSMPFAGIIVMSYPWGYLSDTRGRRLVLLWAMGGSFVSAMLSSLAPTWQVLAALRFISSALSSAAESATYALLGECCTSQHRARYMLLMTSALMLTPTLYYVWGYFITKLNFTFYILGIAYRPWRLLTMVMALPLGLGALLLYFFNESPKFLANIGRTKEAVDVMKRIHDINKRAVEKYPVKDIYLSEDCQESSSSVTLQSLWRQLAPLFKPPLLKRTLLLYYLTFVIYIANNSFAIILPTIFNVFFTSYASREAADSFCNLLSTNGTVASVENDVVPECSGIEVNTVWAGCAHGLAFFVLNALLSQAAHKRKVLTISILLVAGVSAVLADLTGEALSGLVLFYLYLTTAMVFGIVSSYFVTLYPTSYRGMVACIGMMVARVSAFAGTNLVSAAISLHCAPTLYGAAGLVFTVCSEYTKFSPVKYFQQFRKSTKAVLGRTERGTRASTASSVWWSSAAESATYALLGECCTSQHRARYMLLMTSALMLTPTLYYVWGYFITKLNFTFYILGIAYRPWRLLTMVMALPLGLGALLLYFFNESPKFLANIGRTKEAVDVLKRIHDINKRAVEKYPVNDIYLSEDCQESSSSVTLQSLWRQLAPLFKPPLLKKTLLLYYLTFVIYIANNSFAIILPTIFNVFFTSYASREAADSFCNLLSTNGTAASVENDVVPECSGIEVNTVWAGCAHGLAFFVLNALLSQAAHKRKVLTISILLVAGVSAVLADLTGEALSGLVLFYLYLTTAMVFGIVSSYFVTLYPTSYRGMVACIGMMVARVSAFAGTNLVSAAISLHCAPTLYGAAGLVFTGAAAAWFLPADTDNNT-