Monarch geneset OGS2.0

DPOGS210633
TranscriptDPOGS210633-TA2184 bp
ProteinDPOGS210633-PA727 aa
Genomic positionDPSCF300168 + 643683-656749
RNAseq coverage6x (Rank: top 87%)
Annotation
HeliconiusHMEL0129120.076.72% 
BombyxBGIBMGA013585-TA7e-14065.46% 
DrosophilaCG15221-PB3e-3728.31% 
EBI UniRef50UniRef50_D6WB982e-5232.17%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WB98_TRICA
NCBI RefSeqXP_972569.14e-5332.17%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910778689e-5232.17%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
NCBI nr blastxgi|910778689e-5430.13%PREDICTED: similar to SV2-like protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00550857e-17transmembrane transport
GO:00160217e-17integral to membrane
GO:00228577e-17transmembrane transporter activity
KEGG pathwayapi:1001673361e-39 
 K06258 (SV2)maps-> ECM-receptor interaction
InterPro domain[16-445] IPR0161965.9e-32Major facilitator superfamily domain, general substrate transporter
[65-312] IPR0058287e-17General substrate transporter
[461-535] IPR0117011.7e-08Major facilitator superfamily
Orthology groupMCL26095 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210633-TA
ATGCATTTAAAAAATCAACTTAAAGAAAAAGGATATGACACTGAAAAGGACAAGGCGCTTCATTCGGCTACTATTGAAGAAGCGATCACTGCTACAGGTTTCGGCAAATACAACCTTGGCCTCATGCTGGTGTGTAGCTGGACGTTGCAGGCTATGGGTATGGATTTGTTCGGCACCAGCTTCGTGGTTGCGGCGGCGGTCTGCGACCTGGAGCTGAGTATGCAACAGAGAGCTTTGTTGACGGCTACTCCACTTATAGGCGTGGTTCTCGGAGCACAACTTTGGGGTTACGTCTCCGACACCAAGGGTCGCCGGCTAACCCTTGTACTCTCTATGTCTATTGGCTTCGTGTTCGCAGCCCTCAGTAGCTTTGCTCCGAATTGGAAGATTATGGCACTGCTTAAACTAGTATCTTCTACCTTTACCTCAGCTAGTAACTCCGCTTCCTACACTCTGCTTGGAGAGAGTTGTCCTGAATTCTATCGCGGTCGCACAATGCTCCTCTGCAACTGTTTTCTGATGTGTTCACAGGCTGTTGTTGCTTTATTTGCCTATCCAATACTTCCATTGGAGTTCGTGTATTGGATTGATTTCCTTAGCATCAAGTACCGATCCTGGCGTCTCCTAGCTCTTGTCATGTCTCTGCCCTGCGCAGCCACGGCCTGTTTACTGCAACTGTTCCACGAAAGTCCCAAATTTTTGGTCTCAATAGGAAAAAACGAAGAAGCCATAGAAGTACTAAAGAAAATATACGCATGTAATAGTGGCAATAAAGCTAACACCTTTAATGTCAAGAAAATCGTGGACAAAGCCGAGCAATCCTCAGAGAAAGAATCATTCTTTAAGACTATGTGGCATCAGACAGCCGCCCTGTTTAAACCTCCTCTACTTAAAGTTACTCTAAAGCTCTTTTATTTAGTCGCCATCATTTACATGACTGGGAGTGGCTTCATTCTTTGGCTGCCATACATTATGAACAATCTATTTTCGGTTCTGGAAGCCGGAGGAGGACAAGGCATGAATCTTTGCACTATCATCAGATATTCAACTGAAAGTGTTGGCGTCTCAGGAAATGACACGATGATCCAGGAAGTATGTAACGACACAATTCAGGACACGACCCTCTTCTCTGCTATGACGTACGGTGCCTTGGCGAGTTCTAGTAACCTAGTGCTCTCTCTCACTTGCGGTTCAAGAAAGCGCTTTGCTATGATCTGCATCGTAACAACATCGGCTGTAGCTGCTGTACTCCTGAACGTCATTGCTGCTCCCATCGCAGGTGGCATCTTCTTCTTCTTCTTCCTTCTCTGTGCCCTCTCTATGGGAATTTTAAGTTTTGGGCTGTATAGCGTGGTCCTCCTCAGCCTGTCTGGTCTTATTATAATATCTCTGGTTTGTATAGCTTACGCCAGCACTATTATAGTTCCAGCTAGCGCTTGCGAGCTGGAAACTACGACCTCGCAAAAGGGACTTTTAGCGGCTGTTCCTGTTATTGCTTTACTACTGGGGGCTGTGCCATGGGGCTACCTCACAGATATCTACGGACGCAAGAGAATGCTGATCATCTTACTTTCCTCATCAGCCGTTTTTAACGGCTTAGCTTCTATATCCATGATCCAGGAAGTATGTAACGACACAATTCAGGACACGACCCTCTTCTCTGCTATGACGTACGGTGCCTTGGCGAGTTCTAGTAACCTAGTGCTCTCTCTCACTTGCGGTTCAAGAAAGCGCTTTGCTATGATCTGCATCGTAACAACATCGGCTGTAGCTGCTGTACTCCTGAACGTCATTGCTGCTCCCATCGCAGGTGGCATCTTCTTCTTCTTCTTCCTTCTCTGTGCCCTCTCTATGGGAATTTTAAGTGTGTATTTTGTGGAATTATACCCTACTTCATTGAGAGGAATGGCGTCCTGTTTGTCGGTGATGCTAGGAAGATCTAGCGCGTTTCTTGGTGTGAATGCGATTGGAGCTCTTATCTCTGTCAATTGTGAGGCGACGTTCTACGGCTGGGCTGTTTTATTACTTAAATGCTGCCCACGAGACAGTAAAAGAAACAGAGATAGGCAGACTAAGAGATGGGAAGGTAATCTCAAGAAAATAGCGGGTCCAATGTGGAGCCGAACAGCAAGGAATAGGACAACATGGAAATCATCAGAGGCCTTAGGAAGACAAGTTGAATAA

Protein sequence:

>DPOGS210633-PA
MHLKNQLKEKGYDTEKDKALHSATIEEAITATGFGKYNLGLMLVCSWTLQAMGMDLFGTSFVVAAAVCDLELSMQQRALLTATPLIGVVLGAQLWGYVSDTKGRRLTLVLSMSIGFVFAALSSFAPNWKIMALLKLVSSTFTSASNSASYTLLGESCPEFYRGRTMLLCNCFLMCSQAVVALFAYPILPLEFVYWIDFLSIKYRSWRLLALVMSLPCAATACLLQLFHESPKFLVSIGKNEEAIEVLKKIYACNSGNKANTFNVKKIVDKAEQSSEKESFFKTMWHQTAALFKPPLLKVTLKLFYLVAIIYMTGSGFILWLPYIMNNLFSVLEAGGGQGMNLCTIIRYSTESVGVSGNDTMIQEVCNDTIQDTTLFSAMTYGALASSSNLVLSLTCGSRKRFAMICIVTTSAVAAVLLNVIAAPIAGGIFFFFFLLCALSMGILSFGLYSVVLLSLSGLIIISLVCIAYASTIIVPASACELETTTSQKGLLAAVPVIALLLGAVPWGYLTDIYGRKRMLIILLSSSAVFNGLASISMIQEVCNDTIQDTTLFSAMTYGALASSSNLVLSLTCGSRKRFAMICIVTTSAVAAVLLNVIAAPIAGGIFFFFFLLCALSMGILSVYFVELYPTSLRGMASCLSVMLGRSSAFLGVNAIGALISVNCEATFYGWAVLLLKCCPRDSKRNRDRQTKRWEGNLKKIAGPMWSRTARNRTTWKSSEALGRQVE-