Monarch geneset OGS2.0

DPOGS210578
TranscriptDPOGS210578-TA1566 bp
ProteinDPOGS210578-PA521 aa
Genomic positionDPSCF300168 - 605337-617322
RNAseq coverage210x (Rank: top 46%)
Annotation
HeliconiusHMEL0129090.070.94% 
BombyxBGIBMGA013629-TA0.068.82% 
DrosophilaCG15221-PB1e-7130.71% 
EBI UniRef50UniRef50_D6WB983e-9637.19%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_972569.15e-9737.19%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastpgi|371429382e-9939.44%SV2-like protein 1 [Ctenocephalides felis]
NCBI nr blastxgi|371429389e-10139.32%SV2-like protein 1 [Ctenocephalides felis]
Group
Gene OntologyGO:00550857.1e-24transmembrane transport
GO:00160217.1e-24integral to membrane
KEGG pathwaytca:6629825e-64 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[1-497] IPR0161962.9e-58Major facilitator superfamily domain, general substrate transporter
[43-326] IPR0117017.1e-24Major facilitator superfamily
Orthology groupMCL17619 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210578-TA
ATGACAGCGAACGGAAAAATTGATAAAGGGAATAAAAATGACTCGGTGGCAACACTGGACGAGGCAATGTTACACACAGGCTTCGGTATATACAATATTGTTCATATGCTACTGAGCGGTATGATCTTGATGGGGGTCATTGTCCAGAGCTTACTCATGGGATATGTGATGCCGGCTGCCCAGTGTGACTTGGGACTTTCGCTTCAACAACGAGGCTGGCTGGCTGCTATACCGTTTTTAGCTATCATCCTGACGTCGTACATGTGGGGTTGGCTGGCTGATACTCGCGGCCGTCGCTTCGTGATGCTGCTCTCCATGGTGCTGTCAGCGTTATTCGGCATCGTATCCAGTTTCGCTCCAAACGTACTGACCTTCGGACTGCTGTCTTTTCTAGCATCAGTCTTCATGTCCGGCCCATCAGCTGTGGTTTACACTTATCTGGGAGAGTTCACAAACCTTCGACACAGAGATAAAATGGTGGCATTCGGATCATCTTTCGTTGGCATCGGAACTGTTGTGTTACCTGCCGTGTCTTGGCTTATTCTACCGCTGGAATTCTCCTACCCGATTTCGTTTCTGGATATCTCATATCGACCCTGGAGGCTGTTGGTTGTGGCGTGCTGCATTCCATTCATGATCGGTACCATCTTTCTCATCCTTGCACCGGAGAGCCCGAAATTTCTCAGCGCATCCGGAAACTCGGAAGCAGCTTTGAAAGTTTTGAAGAAAATATATTCTATTAACAATAGAGTGCCCGAAGACACGTTTCCGGTTAAAAGTCTAGTGTCGGAAGGTCAAGGCGGCAAGTCTTCCACCGGCTTGCACGGTGTGCTGGTTTCAATGCGAGACCAGACCCTGCCGCTGTTCCGAGCACCGCTGTTGCCTTGGACGCTTCTTGCCTGTTTCGTGCAATTTGGAATATTTGCCAACACTAACGGTTTCTACGTGTGGTTCCCTACAATATTGAACTCTCTGTTGACACACGACGGTGACGAGTCTAGGATCTGTGACGTTCTTCAATCAGGACACAATTTTCCTAACAATAACACGGAAGTCGTCTGTGATGATACGATCAATGTGGCGACATTCGAGATGTCCATCTACATCGGCCTCGTGTTTAGTTCCATGTATATCATCGTGGGCTTTCTGGTGGACTTCGTGGGCAAGAAGAGCATCTTGCTGGTGCTCTTACCTTCCACCGGCCTCTGCGGTATCGGAGCCCATCTCGCGTCCAGCAAGAAAATCGCAGTAGTACTGTTCGCAATATTCCAGATGTGCGGTGCATGTATCGGCCTCATGAACGCGGTAGCAGTGGAACTGTTTCCCACAAAAAATAGAGCTATGGCTATTTGCCTGTCGATGATGATGGGCCGAGTGGGATCGATGGTGGGCTCCAACCTCATCGGATTCTTCCTCTCGACCAACTGCGGACTCAGCTTTTATCTTTTTGGAGGAGTTTTGATAATTTGTGCTTTGTGCTGCTTCGCTCTGCCGGGAAGGAGTGCGAGCGACTCCGCCAAAATCTGTAAGGCAAAGAAAACTGAACAGACAAATGAAACAGCCTGA

Protein sequence:

>DPOGS210578-PA
MTANGKIDKGNKNDSVATLDEAMLHTGFGIYNIVHMLLSGMILMGVIVQSLLMGYVMPAAQCDLGLSLQQRGWLAAIPFLAIILTSYMWGWLADTRGRRFVMLLSMVLSALFGIVSSFAPNVLTFGLLSFLASVFMSGPSAVVYTYLGEFTNLRHRDKMVAFGSSFVGIGTVVLPAVSWLILPLEFSYPISFLDISYRPWRLLVVACCIPFMIGTIFLILAPESPKFLSASGNSEAALKVLKKIYSINNRVPEDTFPVKSLVSEGQGGKSSTGLHGVLVSMRDQTLPLFRAPLLPWTLLACFVQFGIFANTNGFYVWFPTILNSLLTHDGDESRICDVLQSGHNFPNNNTEVVCDDTINVATFEMSIYIGLVFSSMYIIVGFLVDFVGKKSILLVLLPSTGLCGIGAHLASSKKIAVVLFAIFQMCGACIGLMNAVAVELFPTKNRAMAICLSMMMGRVGSMVGSNLIGFFLSTNCGLSFYLFGGVLIICALCCFALPGRSASDSAKICKAKKTEQTNETA-