Monarch geneset OGS2.0

DPOGS200340
TranscriptDPOGS200340-TA3681 bp
ProteinDPOGS200340-PA1226 aa
Genomic positionDPSCF300026 + 478336-493234
RNAseq coverage167x (Rank: top 51%)
Annotation
HeliconiusHMEL0218630.064.26% 
BombyxBGIBMGA005633-TA0.047.69% 
DrosophilaPpn-PE1e-0523.24% 
EBI UniRef50UniRef50_D0ABA00.056.76%HM00052 protein n=1 Tax=Heliconius melpomene RepID=D0ABA0_9NEOP
NCBI RefSeqXP_001943228.11e-11829.60%PREDICTED: similar to thrombospondin, type I, domain containing 7A [Acyrthosiphon pisum]
NCBI nr blastpgi|2613359160.056.76%HM00052 [Heliconius melpomene]
NCBI nr blastxgi|2613359160.058.32%HM00052 [Heliconius melpomene]
Group
KEGG pathway 
InterPro domain[355-419] IPR0008847.2e-12Thrombospondin, type 1 repeat
Orthology groupMCL18127 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200340-TA
ATGTCTGTGAGGCAGCCGGTGGTCATTGAAGGGGAGTACACGGTGTATGTGGGCACATGGACAGAATGTAGTACTGGCGACGGAGATCAAGCTAAGTCTTTCGACAGATATAACGCTCGTCTGGAATCCGATTCTCTGGTGCACACTCCACGTCTGGGACTTCAGAGACGTCAAGTGCAATGTCGGAGAAAAGATGGCAGATTTGTAGAAGCTCTGTACTGCGGGATTGCCATAGCAAACATCGGCACAACTCGCGTGTGCGTGATGCGTGAGGACTGTTCCTTGGCTGAATGGTTGCCATGGCGACCAAGATCTGATGGAGCTCTAGTAAGAACAAGACGTTTACGAAGACTGTCACAAGGCGGAGGCAAAGAATGCGACGTGGTCGAAGAAGTACGACCTACTGTTTTGGAGACTACGGCACATTGGACTCCGGGTCCTTGGGGACCATGCCGTGTTGCTGTGGAACAAGCGGCCACTGCTGCACCAACTGATGATGACGATGACGCGGATGACAATGACGCGATTTACGACGAGGATGACGAGAGTGACGAAAGCGAGCAGTCAAGCTGTGGGGGCGGAGTGCAGCGACGTGCTGCGACCTGCGTCCGGGCAGACGGGCGAGCGTTGCACGACGCGCAGTGTGCTCATGCAGTTATGCCGACACTCGTGCAACCTTGCGAGGTACCTTGCCCACGTGATTGTGAAGTTGGAGAATGGAGTGAATGGGGTGCCTGTCAGCCTACTGACGGATGTCCTCTTTACCCAGTACAACAACTCACAACTACTGGGTACAGCGTACGTCGTCGAAGAGTAACAGCAGCAGCATCCGGTGGAGGGGCGCCATGTCCTCCTCTAGAGGAAAAACGTACTTGCACTACACCAAGATGTGCTGCGTGGAAAGCACTGCCGTGGGGCCCCTGTGTATTGACTCAACCCCATACTAGTTGTGGACCAGGCCGACGTACCAGAGAACTTAGATGCATGGGACACGATGGAAAGGAAGCTCAACGAGCGTGGTGTAACACTGGCGCTCCACCACGCAGTGAACGATGTCGTATCGCTTGTCCGGGAGACTGTGTAGTGTCTGCATGGGCGGAGTGGTCACCTTGTTCAGCGAGTTGCGTCGCCCCCGGACATGCACGACCCACGCGTACTAGACGCAGACATATACTTGCCCACGCTGCACCAAATGGCTGGCCCTGCCCATCTGAAGACCAACTGATTCAAAACGAGACATGCAACACCCACGCTTGTGCCACTTATTCCTGGCTTGCAACGCCTTGGGGACCTTGTGAGCGTCGCCGGCAAGATTTCATACCGGCTACCAATTACACAGATCTTCTGGATAATGAACCGTTCAATGAAAGCGATGACGAAGAACCTTGTATTGAAGAAGGTGAAATGAGCAGAGACGTTATGTGTGTTCAGAATAATGCTGACGTTGTCAGAGAAGCTCTATGTGCTCCATTACGTCGCCCAGCGTCCCGTCGGGCGTGTACTGTAAGATGTCGACGAGGTTGCAGGGTTGAAGCTTGGATGCCCTGGTCTCCATGTCCTGATACTTGTGACCCTGGTAAGCAGGTCCGCGTTCGCACCGTCCGAGGTGGTCCGAACTGCGGTCCGTTGCAGGAGACACGCGACTGTCCCGTTTCGAGGTCGTGTCGTTCCCGTGAGGCTGTCTGGGTCGCCGGGGAGTGGAGTACCTGTAGATTACCGCCAGGACAACGCTGTGGAGTTGGCTATAGGATTAGAAGTATCTGGTGCGGCTCGGACTCTCACCGCGTCGAGGCCGGCGCATGTGCTGGTGCCCGGGTGCCGCCCGCTGCAGCAGCCTGCAGCGTCACATGCGACACCATCGTACCACTCACTTGTGATATCATATGTTCAGATCCCCTAAAATACTTGGATGCCTCTGACCCCGACGTACCCTCATGCGTCTGCAAGAATGTCTCATTGGAACTGTTACCCGCTGATTCAGACTGTATTCTTCCACCTGGAATTGAATGCGGTGAAGGGAGATCACTGCGGGCAGCTCGTTGTTTAGTTGGAAGACGTGATGTACCCATGGATGTTTGTAGGAAATACCATCCCCTTACAGGACCCCGTCGCGTTCGTGAAGCAGCAACAGACGGCTTCACATATGATGAGGAATTCACATCTTTATTACGCGGTGCATGTAGCGTGCGGTGTGCGAGGGACTGTGCGGTCGGGGCGTGGGCTGCCTGGGGACCGTGTGCTGCTGAGCCGGGTTCCAGAGCTGCTTTCAGGTTCCGCACCAGGGAAGTAATAGAGGAAGGTTCGGCTGGTGGTCGTGAATGTGGCGCCACATTGCAGCGCTCTACGTGCGTTGTGACTGAGCCACGATGGATACTGGGCGAGTGGTCTGTGTGCGCTCCGAGACGAGCTCTATGTGGACGAGCCATTATCAATAGGACTGTTATGTGCATAGATGCGGATGGGAATAAATTGGAGGACACACAGTGTGAGGCGGCCGGCGCTGGTCCTGCGCCCTCTCGCGATGCGACATGTCGGGCTCCGTGCCCTTCTGACTGTGTTGTCAGCTCTTGGTCAGACTGGAGTCCATGTGAACAGACGAAATGGGGCGGTCGTCGTGATAGGACTCGTGTGGTTCTCCGCGCGGCTGCTGAGGGCGGGACTGCCTGCCCTCACCTGGTGGCTGCGGAGCCTTGTTCACCGCACGCCTACTCCTGGCACGTGGCACCCTGGGATGACTGTCAACCGCTGGGTGGGTCTCCGTGTGGGGAAGGAACAAAGAGAAGAGCTGTACGGTGCCTTCGCAGCGATGGTGTTTTCGTAAATGATTCATTCTGTCCGAACGCAACGGCATCCGAGGCTCGGGAGTCATGGTGCTACGTTCCATGTGGCGTAGACTGTGAGGTTGGAGAGTGGGGACCTTGGGACGCCTCCGCCTGCTCCTGCGGGGACGCAGTCACAGCACGCCACATGAGACGGATACGTCAACACTTGACGGCAGCTGTATGGCCGGGTCGCGCGTGCCCTCCCACTGAGCAACGAGCTCCCTGCCCGCGAGAACCATGCTTGAGACTCGTCGCTAGACCGCTATTAGGATGTCATGTACAAACGTCATCAGGAGAAGAAGCTGATAATGCATGCGGATGGGGAGTGAAGTTATCTCATGCAAGATGTGAACTGACTAGCATCAACGATGAACCGTCATCAGGAGCCTTCTTACAACCCTGGAGATGTGCCTCCGCTCTACCGGGACGTATCGTTACACCGCCAATGCATCATCAGGAGGACGAGGAGTGTGAGGTCGAATGTGGATGCCAGGAATCTGAGCTGGGGCAGCCGGGTCCGTGGGGCGCTTGGGGCGGCTGCCGTGGTGGGGCACGTTCGAGGACACGTACACTACTGGTACCACCCCGAAGAGCCTGCAGAACATCCTCCAGATACATAACAATCGAGTGGTCGAACTGCACCGAGGAGGCTTCGGAGGCGACAGCTGGTGGTGACGGAACGCGAGGCGCCTGGCTTTCAGAACACTACCATGACGGATATATAGAGGGAAGTACTTCAGTGTTGGCGGTAGTGTGGACTGCGACCATAATACTCAGCTTGTATGGCGCGTTCATGCTCTATCGTGGACTTCTAAGATGCATCAGAAGCAGAAAAATGAAGAGCATCACTAAAGTGTAA

Protein sequence:

>DPOGS200340-PA
MSVRQPVVIEGEYTVYVGTWTECSTGDGDQAKSFDRYNARLESDSLVHTPRLGLQRRQVQCRRKDGRFVEALYCGIAIANIGTTRVCVMREDCSLAEWLPWRPRSDGALVRTRRLRRLSQGGGKECDVVEEVRPTVLETTAHWTPGPWGPCRVAVEQAATAAPTDDDDDADDNDAIYDEDDESDESEQSSCGGGVQRRAATCVRADGRALHDAQCAHAVMPTLVQPCEVPCPRDCEVGEWSEWGACQPTDGCPLYPVQQLTTTGYSVRRRRVTAAASGGGAPCPPLEEKRTCTTPRCAAWKALPWGPCVLTQPHTSCGPGRRTRELRCMGHDGKEAQRAWCNTGAPPRSERCRIACPGDCVVSAWAEWSPCSASCVAPGHARPTRTRRRHILAHAAPNGWPCPSEDQLIQNETCNTHACATYSWLATPWGPCERRRQDFIPATNYTDLLDNEPFNESDDEEPCIEEGEMSRDVMCVQNNADVVREALCAPLRRPASRRACTVRCRRGCRVEAWMPWSPCPDTCDPGKQVRVRTVRGGPNCGPLQETRDCPVSRSCRSREAVWVAGEWSTCRLPPGQRCGVGYRIRSIWCGSDSHRVEAGACAGARVPPAAAACSVTCDTIVPLTCDIICSDPLKYLDASDPDVPSCVCKNVSLELLPADSDCILPPGIECGEGRSLRAARCLVGRRDVPMDVCRKYHPLTGPRRVREAATDGFTYDEEFTSLLRGACSVRCARDCAVGAWAAWGPCAAEPGSRAAFRFRTREVIEEGSAGGRECGATLQRSTCVVTEPRWILGEWSVCAPRRALCGRAIINRTVMCIDADGNKLEDTQCEAAGAGPAPSRDATCRAPCPSDCVVSSWSDWSPCEQTKWGGRRDRTRVVLRAAAEGGTACPHLVAAEPCSPHAYSWHVAPWDDCQPLGGSPCGEGTKRRAVRCLRSDGVFVNDSFCPNATASEARESWCYVPCGVDCEVGEWGPWDASACSCGDAVTARHMRRIRQHLTAAVWPGRACPPTEQRAPCPREPCLRLVARPLLGCHVQTSSGEEADNACGWGVKLSHARCELTSINDEPSSGAFLQPWRCASALPGRIVTPPMHHQEDEECEVECGCQESELGQPGPWGAWGGCRGGARSRTRTLLVPPRRACRTSSRYITIEWSNCTEEASEATAGGDGTRGAWLSEHYHDGYIEGSTSVLAVVWTATIILSLYGAFMLYRGLLRCIRSRKMKSITKV-