Monarch geneset OGS2.0

DPOGS210579
TranscriptDPOGS210579-TA1548 bp
ProteinDPOGS210579-PA515 aa
Genomic positionDPSCF300168 - 591904-597502
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0082950.070.22% 
BombyxBGIBMGA013630-TA8e-15865.84% 
DrosophilaCG15221-PB7e-4927.67% 
EBI UniRef50UniRef50_D6WB985e-5428.06%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_972569.19e-5528.06%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastpgi|371429384e-5929.49%SV2-like protein 1 [Ctenocephalides felis]
NCBI nr blastxgi|371429382e-6129.74%SV2-like protein 1 [Ctenocephalides felis]
Group
Gene OntologyGO:00550853.3e-20transmembrane transport
GO:00160213.3e-20integral to membrane
KEGG pathwaytca:6601556e-44 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[5-500] IPR0161962.9e-45Major facilitator superfamily domain, general substrate transporter
[35-456] IPR0117013.3e-20Major facilitator superfamily
Orthology groupMCL26101 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210579-TA
ATGCCAGGAGCAAATGAAGATGTTCCTAGCAAATGTTCTTACGAGGACGCGGTGAATTTGAGTGGTATCGGCAAATACAACGTGTGTCTCCTGATGATCTGCTGCCTGCTCATATTAGCTATGTACATCGATATATTTGGCTTCTCTGTTGCGTTACCGATAGTTGCCTGCGACATGGGACTCTCTACATCCGAGCAAGGCCTACTCTCCGCTATGCCATTGATTGGCGTGATGTTGTCTTCGTACGCGTGGGGTCTCACAGCAGACATTCTGGGTCGACGGAAAACATTATTAATCGCAATGCCCATTGGAATGATTTTGAATATCGGCGCTAGCATGGCCCCGACTTACATAGCACTGGCTATTTTAAAGTTTTTCTCTGCTGCATTTACATCTTCAGCAAACGCCGCCGCTTTTGTGCTGTTAGGTGAAAGTATGCCCAGTAAATATCGTGGTCGGTCTATGTTCCTGATGGCCAGCGCTACCATGTACTGCCAATTTATTATTTGCATCATTGCCTTACCAGTTATTAAACTGTCGTTTTCTGTGGACATTTCTTGGCTATTACTCACATACAGACCATGGCGATTGTTGCTGCAAGCTATCAGTCTTCCTGGTATTATTGGTGTGATCGGATTATTATTCACTCTCGAAAGCCCCAAGTTCCTCCTAAGTAAAAACAAAGATGTCGCCGCTCTCGAGGTCCTAACAAGGATTTATACGATTAATAAGGGATTGCCAAAAGAAACTTATCCGGTAAAAATTATAATTCTAGATGAAATAACAATGCCGGACATTAAGGGTAATGAATCGTTTTTACGAAAAATGTGGAATCAGACCGCACCTCTCTTCAAACCTCCGCTTCTTAAGAACTCACTCATCATTTATTACATTTTACTTTGTGCTTATATGACGTCAACCGGTTACACTATGTGGGTACCGACAATAACAAATGCATTTTTCGACGGGGAGGAGAGCTGGGGGAAAACATTTTGTGAAGTCGCCAGCACGTCAGCCTCCTCCAGCAACAACACCATAACCGATTGCGATGATTTAGTGAAGCCAATGACTTTGTATGCCGTCATGTGTTACTCCGGCATTTCTGGAACTCTGAACATATTCTTGACATTTTTGGTCGGACCTCTCGGCAAACGACGTTTGACAATGCTGGTCTTCGTCGTCGCTATAGTTTGCGGAATAATTCTACTCTTCATAAGGATACCATTAATGAGCATCGCGCTGTTTTACTTCTTCCTCTACGTGGCGCTTATTTTGGGCAACGCAAATACTTATCTCGTTGAACTTAATCCAACTTATTTGAGGGGTATGGCTACGTGCCTGTCTGTGGTTGTGGCCCGAGGGTTCGGCTTTATCAGTGTTCAGCTCATCGGAAGATTGCTCGCAGACCACTGCACTTCAACAGTAGCTGGTTACATCGGCCTCATCTCTACTGGTCTGATTGTTTCTTTCTTCCTACCCAAAGATAAATCAATGAAGGACGATGTCAGTTTTACAACTATGGCTGAAGAAGATGGAACAAAGTTGTGA

Protein sequence:

>DPOGS210579-PA
MPGANEDVPSKCSYEDAVNLSGIGKYNVCLLMICCLLILAMYIDIFGFSVALPIVACDMGLSTSEQGLLSAMPLIGVMLSSYAWGLTADILGRRKTLLIAMPIGMILNIGASMAPTYIALAILKFFSAAFTSSANAAAFVLLGESMPSKYRGRSMFLMASATMYCQFIICIIALPVIKLSFSVDISWLLLTYRPWRLLLQAISLPGIIGVIGLLFTLESPKFLLSKNKDVAALEVLTRIYTINKGLPKETYPVKIIILDEITMPDIKGNESFLRKMWNQTAPLFKPPLLKNSLIIYYILLCAYMTSTGYTMWVPTITNAFFDGEESWGKTFCEVASTSASSSNNTITDCDDLVKPMTLYAVMCYSGISGTLNIFLTFLVGPLGKRRLTMLVFVVAIVCGIILLFIRIPLMSIALFYFFLYVALILGNANTYLVELNPTYLRGMATCLSVVVARGFGFISVQLIGRLLADHCTSTVAGYIGLISTGLIVSFFLPKDKSMKDDVSFTTMAEEDGTKL-