Monarch geneset OGS2.0

DPOGS215576
TranscriptDPOGS215576-TA4995 bp
ProteinDPOGS215576-PA1664 aa
Genomic positionDPSCF300097 - 8831-26706
RNAseq coverage40x (Rank: top 73%)
Annotation
HeliconiusHMEL0147650.076.26% 
BombyxBGIBMGA010978-TA0.066.02% 
Drosophilagogo-PA7e-8441.53% 
EBI UniRef50UniRef50_D6WC409e-9347.28%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WC40_TRICA
NCBI RefSeqXP_002048129.13e-8743.92%GJ11514 [Drosophila virilis]
NCBI nr blastpgi|2700029243e-9247.28%hypothetical protein TcasGA2_TC005237 [Tribolium castaneum]
NCBI nr blastxgi|2700029242e-9838.01%hypothetical protein TcasGA2_TC005237 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[381-431] IPR0008847.5e-08Thrombospondin, type 1 repeat
Orthology groupMCL15690 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215576-TA
ATGCGCCGCGAGCACATCGCCGCTCTCCTGCGGCTGCTTTTTATAGCAGGTGGTGCAAGTTTTGTGAATCCATACGCAAACGAGGAGAGGGCGGAGGTAACAGACCTGAGCCAGGTATGGGTGGAGATCCCCAGGAGTGTGGTCGCTCTGGGGGGAGATGTCCGTGTGTCAGTACACGGCGTCAGAGACACCGGCGGCTTGAGGGTACGCCTCGCGCGAGAAGACGACGAAGATCGAACCCTCCTCGCCACCATGCCACTAGCTTTAGACTCCGAGAGCCGCATGACACTACCGTGCGGCTACTTCCCTAGAGGCGGCAGCTACTATCTAGAAATCGTAGCAGATAAGGAAACCGTCCTTGACTATGACAACACCACTGTCAGAGTTAGGAGGGACTTAGGTCAAGGTGGAATTATGGAAACTGGAGATAGTGTGGTGAAGTCCTGGAAATTTGACGTCCTGTGGCCTAGCGCTAACCTGGACGTGACCCCAGAACAGATTCAAACATACCCGGAGAGACAGGTGACGGCGATCCTAGAGTTTCCGAAGGTAGTTTGTACACCCTTGGAAGATGGCGCGGACTTTTGGTTGGAACTATTGTATTGTGGACACTCGAGCGGAGGCGCAGTGCTTTGTGATGGGAAGAATAGTAGCTCCCATGCTCACGTCCTGTATTCAGAACAGATGCATGGCTTCCCCGGTCGACGTACAATGACACTCAGGTGCGAACTATTCGGTCAAGCGGGGGACTACGCCCTCACTCTGAGACCAGCTGCCTCTGCGGGTCCGCAAGACTTGGCGGTGGCTTTTGTTAAGGCGGATTGGTCAGAGCAGTTCGTGTTGAACGTTCACGGAGGGTCAGTGTTCCCGTGCGGTGATGGAGTCAGGGTGTTGTTCCAATACCCCGAGTGTGTGCTGGAGGGGGCGGACCGAGTGCGAGTGTTCGGACGAGAAGCGGAAAGATTAAGATACGTCGCCGAGAGGAGAGTCAGGAGAGGTCAACACACAGCTTCCTTCGATTGTAGACTGTTCAGTGAACGTTATCCAGAGTACTGTTTCGTGTACGTCTCCCAGGCGGTTACCGGCGCGGTCGCTGATGTTAGGATGGATTGTTTACCCACACTACCACTATCAGATGGCGGCGCTGGAGGATGGGGTGAGTGGTCTCCATGGTCGGCCTGCCCGGCACCAGCTCAGTGCTCGACCCGCACGAGGAGCAGACACCGCTTCTGTGATTCACCGCCGCCCGCGTACGGAGCCAAGTTCTGTGAGGTAACAGAGTGTAATCGAAAAATAGCAAGACTTAGAAGCGCTGTTGATGACTATGAACAAAAAAATGAAGAACTACAAAAAGCATACGATAAACTTGACGAAGACAGGGCAGATATTATAGCATATTTGAAAAAAACTCTTAATCTAAAGAATGAGGAAAATAGTGAGTTAAAAGAAAAAGTCAAGGGATTAGAAGAGACTAGAGAAATAGAAACTGCACAGTTTAAAGAAACAGTTGCAGATTTGGAAAAGAACTTTACAATAATGAAAGACCAATTGACTTCTGAAAACAAACTACTTGCGGGTAAATTAAATACCTTAGAAGAATTCAGAGCTATAAGGGATGATCTTATGAGAAAGTTTGATAACCAAGAACAAGCATTTAAAGACCAAGAAATGAAATATAAAAGAATTATTTATGATGCTGAAAAGAAATTTGTCATAGGAAAGGACAAATTGAAAAAAGAAATGGAGGCGCGCTTATTACAACTTGCTCAAGACTTCCAGGATGCGACAGAGTTACGTATAGCTGCATCCACTCATAGAGTGATTCGAGAGAACATTGCTATCAATAATGAGTTGGATAGTATTTTATCAACACAAGCTAAATTAGCTGACCAAAATGAGAAATATAAAGAGAGTGAAAGAGCTGCACGGGTCGCTTTAGAGTTAGCAGAGGAAGAAAGAGATAAAGCTATCAATAAAAATGTTATTCAACTCAAAGTCATTGACCAATTGACAACAGCATTCCAGGATATTAAAAAAGAAAAAGGAATTTATGACAAACGTATCTTGGATTTTGAAACTCTTCAAGTTAAAGTTCAAAAACTTGTAAAAGAAAATGAAAATTTGTCGCTACAAGTACGCATATTGGAACAAAACTTGCATGTTCGTATGAATGATCAAAATAAGGCTTCTGTAGAAGCGTCAAAAGTTTTTAAAGAATGTGCCAGACTAAAACGTATACTTAAAGAAGCTGCAATTGCCGTGCAGGCAGCACTAAAACTGGATGATTGGGCTACTAGTGATCCTACTAAGGAAGTTATGGATAGACATGTCTTACTTTCGCGTTTGCTTTCTATTCTAGCCCAGTATCGAGAAATGCAACGAACAGAATCTCTAGAAACTTTAGCCTCATTCAGTAAAATATACGAGGAAGGAGACCTCGGTTTTATACCAAAACCGCTCCAAAGAAAGTCTATAATTTCTGCTACTGTGTCTTTAACCTCTAAAGAATCTTCCAAATCAGATCAAGGGACATCAGAAATTTCACTTGTCTCGTCATCACTTGGAAGTGTTAAGACTATTCCTAGTATAAAACTCATTCCACCTTCAACAGAACCCCCACCTCCAGAAAAAGAAAGTTTGCAATCCTTTGTAACCTCGACTAAATCTACTATAAGCGGCAGTGAAGAAGAATTAGAGGAAACACCTGAAAGTGAAATGAATATAGAACAAAAATTAGAAGCTAGTAAACTTGAGATTCAAAAATCAATACTTAAAGATCTACAATATTCACAAATGATATCGCAATCTCAGCTGGCAAGTGATTTGAAACATGGATCTAAATCAGTCATAATTGAAGAACCAGTTGAAGAGGCAGAAGAAGCAGACGGAGAAGAAAAGACAAAGAAAGAAGAAGAAGCAGTTGGAGAAGAGGAAGGAAAACACGAGGCGGGTGAAGAGACAGCAGAAATAAAATCAGAAAAGAGTGTTGGTCAGGCGGTAGAGCTGGCGGCTTGCGAGTGTTTGGCGCGTCCCTGGGCTGGAGACTCGTCGGTTGTGGTGGCGGAGGTGGGCGAGCGATGTCGCTGTGGTTGCGTGCTTCACCTGGCGCCCGCCGCGCGTCTCCTGGCTGGATCAGCCCGAGCCTGCCCCGCTAGGTCCTTTTGGTTGCTGCAGGCAGAAGAAGGTTTGGCCGTGAGAGTCAGTTTCGAGTCTGTAAAGCTCCCGTGCAGCGGCCAGGCTCTGAGAGCGAGGGATGGAGACTCCCTGGGCTCGCCTCTCTTGGCCTCGTGGGACGGCCCGGACACCTCAGCTGTGGTAGTGCTGGTTGAAGAGTACGAGACTACCACCACCAGCGGTGTGGTGGAGGTAGTAGGTGGTAGACACGTTCTGGTGGAGCTGCGTTCAGGGGACCCGGGAGATGGGGGGGACCTGTGTGCCGGAGGGTTCCTCGCACACGCTACACAGATAGAGCCGATCCGTAACGCGTCGGCTGCTCGGTCTCCCCTGAGGTCCTGGTGGGCTGGCGGCGGTGCTGCACGAGCGGCGGCCGCGGCACTGGCTGCGGCGGCGGCTCTCGCTGCCTTACTGCTGGCCGCCCACTCCGCGCATCGCACACGCGCTTACCAGCGAGCCGCTGATAAGGAATCACTCACCGATAGTGACGCATGTTCTCTGTCCCTGGAGCTGTGTGCGAGTGCGTCACGTAGCACTCTAATGTCGGAGGCTGGCGGGGCATCTCTACGACGTCTGTTGCCTACCTCAAGGGAGAAGGGTCGCGGAGAGGCACTCAGGGATGAAGAAGACCAGACATCAGATGCAGAGGTCGAGGGTGACAATGTGAGTGTTGGCAGCGCCATCAGTGCGACCAGTCAGGCGACCATCACGCCAGAGAACGTGTCGTCTAGCAGCGCCCCGGCTGGCCCTCCCCCGGATACACAGCATGCCACTACGTCTCTGAGACGATCTCGTACCGTGTCCTCTAACAACGTTCAAGTTGCCAGCATGTCATTACCTCAAGAACCTTCAACGAGTCAACTGACTGCGGAGAGTACGAGCATCACCAGCATAAGTGAGAGGGATAGCTGCAGCGAAAAAGACAAAAACAGTGCGTGGGACAAGGATAGAAGTGTTAGCAGAGCTTCATCTTCCACTTCTGCTGCCACTCTTACTAATGGTTGGTACTCCCCTGCCCTGAGCGGCGTATCATCAGCTCGAGTGCGTGACAACCCTAGCCGCAACCAGCACGCGACTGCCTCCAACCCTGGTTCTAAAGCCAGCAAGGACTGTCGTAACCGCCGTCTCCTCCGCTCTGACCTCAGCCTCGCCTCACACGCTGAAATGGAAATAGACTACTACGACTATGAGGTCAACAATGCTGGTTCCGTTCCAGGCTCTTACTTAGGAATGGATCCAGCATTCCTAGTTTGGATTCCACCTTTAGATGAAGGAGACATAATTAAGGAGATTGACGACAATATGCCTATATATGAGGAAATTCTACCTAAGGAATTGCACCCGGATCCGGGAAGCAATACGGAATCACCAGATGAAGGACCTGCATTAAGTACTTTGAAGAGTAACTCAGACAATAGATCGAACAAATCAAACGAATCAGCAAAATCGGTCCGAAAACACGAAAGAGGCAACATCATTCACCCCTTAAACTTCAGCATAAGCGATCTGCAAAATGCACCAAAACTCGCTAAAATTAATAAAAAGAAAAAAGATGATGTCTTCATTCCAATGAGAGATCTCACTCTCACTATGTCACCGGTGAAAGTTCACAGAAGAGAAAAAAAATCTGAATACGATAATTCATCGACGCTCAAAAGAGTCAATGACTCCAGAGAAAAAGACGACACTCTGAAACGAACAACGAACTTCAATGACATCAAGTTTGCGGATGAAAATTCTGATTCATCCAACGAAATACAGTTCGCGGACGAGTCTCCAGACAGCAACAGATTGGCTGACAAATACGGAGTAAGAGCATAA

Protein sequence:

>DPOGS215576-PA
MRREHIAALLRLLFIAGGASFVNPYANEERAEVTDLSQVWVEIPRSVVALGGDVRVSVHGVRDTGGLRVRLAREDDEDRTLLATMPLALDSESRMTLPCGYFPRGGSYYLEIVADKETVLDYDNTTVRVRRDLGQGGIMETGDSVVKSWKFDVLWPSANLDVTPEQIQTYPERQVTAILEFPKVVCTPLEDGADFWLELLYCGHSSGGAVLCDGKNSSSHAHVLYSEQMHGFPGRRTMTLRCELFGQAGDYALTLRPAASAGPQDLAVAFVKADWSEQFVLNVHGGSVFPCGDGVRVLFQYPECVLEGADRVRVFGREAERLRYVAERRVRRGQHTASFDCRLFSERYPEYCFVYVSQAVTGAVADVRMDCLPTLPLSDGGAGGWGEWSPWSACPAPAQCSTRTRSRHRFCDSPPPAYGAKFCEVTECNRKIARLRSAVDDYEQKNEELQKAYDKLDEDRADIIAYLKKTLNLKNEENSELKEKVKGLEETREIETAQFKETVADLEKNFTIMKDQLTSENKLLAGKLNTLEEFRAIRDDLMRKFDNQEQAFKDQEMKYKRIIYDAEKKFVIGKDKLKKEMEARLLQLAQDFQDATELRIAASTHRVIRENIAINNELDSILSTQAKLADQNEKYKESERAARVALELAEEERDKAINKNVIQLKVIDQLTTAFQDIKKEKGIYDKRILDFETLQVKVQKLVKENENLSLQVRILEQNLHVRMNDQNKASVEASKVFKECARLKRILKEAAIAVQAALKLDDWATSDPTKEVMDRHVLLSRLLSILAQYREMQRTESLETLASFSKIYEEGDLGFIPKPLQRKSIISATVSLTSKESSKSDQGTSEISLVSSSLGSVKTIPSIKLIPPSTEPPPPEKESLQSFVTSTKSTISGSEEELEETPESEMNIEQKLEASKLEIQKSILKDLQYSQMISQSQLASDLKHGSKSVIIEEPVEEAEEADGEEKTKKEEEAVGEEEGKHEAGEETAEIKSEKSVGQAVELAACECLARPWAGDSSVVVAEVGERCRCGCVLHLAPAARLLAGSARACPARSFWLLQAEEGLAVRVSFESVKLPCSGQALRARDGDSLGSPLLASWDGPDTSAVVVLVEEYETTTTSGVVEVVGGRHVLVELRSGDPGDGGDLCAGGFLAHATQIEPIRNASAARSPLRSWWAGGGAARAAAAALAAAAALAALLLAAHSAHRTRAYQRAADKESLTDSDACSLSLELCASASRSTLMSEAGGASLRRLLPTSREKGRGEALRDEEDQTSDAEVEGDNVSVGSAISATSQATITPENVSSSSAPAGPPPDTQHATTSLRRSRTVSSNNVQVASMSLPQEPSTSQLTAESTSITSISERDSCSEKDKNSAWDKDRSVSRASSSTSAATLTNGWYSPALSGVSSARVRDNPSRNQHATASNPGSKASKDCRNRRLLRSDLSLASHAEMEIDYYDYEVNNAGSVPGSYLGMDPAFLVWIPPLDEGDIIKEIDDNMPIYEEILPKELHPDPGSNTESPDEGPALSTLKSNSDNRSNKSNESAKSVRKHERGNIIHPLNFSISDLQNAPKLAKINKKKKDDVFIPMRDLTLTMSPVKVHRREKKSEYDNSSTLKRVNDSREKDDTLKRTTNFNDIKFADENSDSSNEIQFADESPDSNRLADKYGVRA-