Monarch geneset OGS2.0

DPOGS213113
TranscriptDPOGS213113-TA6216 bp
ProteinDPOGS213113-PA2071 aa
Genomic positionDPSCF300016 + 270239-279090
RNAseq coverage2863x (Rank: top 4%)
Annotation
HeliconiusHMEL0097900.073.75% 
BombyxBGIBMGA007875-TA0.060.05% 
Drosophilanocte-PB1e-1756.18% 
EBI UniRef50UniRef50_E2AK756e-6032.45%Large proline-rich protein BAT2 n=2 Tax=Formicidae RepID=E2AK75_CAMFO
NCBI RefSeqXP_002425652.13e-5234.74%hypothetical protein Phum_PHUM213210 [Pediculus humanus corporis]
NCBI nr blastpgi|3227958654e-6132.54%hypothetical protein SINV_80390 [Solenopsis invicta]
NCBI nr blastxgi|2420097633e-13726.52%hypothetical protein Phum_PHUM213210 [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[15-166] IPR0097383.5e-20BAT2, N-terminal
Orthology groupMCL25401 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213113-TA
ATGTCTGCACTCTCGACGCCGGGCGGCGGAGGCACTACGGCGCAGAGCAAGCCGACGTCCGGGAAGCAAAAATATCAGAAATTGGACATCAATAGTCTTTACTGCGCTAATAGGAATGAAAGCTCTGAACCGTCATCAGTGAAATCTCAACTTAGCCGTAAACATGGAATGCAAAGTCTTGGAAAAGTACCTTCAGCTAGACGACCACCAGCAAACCTGCCTTCTTTAAAAACTGAAACTGGCCAAGATCCTAACGCTAATACTGTATCTACTGTAACTACTCCTGCAACTACTCAAGCGACATGCACATCTCAGACTATAACCACAACTTCAAGCAGCAGTGCTGTTGCGTCAGGATGGGTGGCTCTCCCCCCTCCGTCCTCACCTCATTTTCGCACAGAATTTCCTTCACTAGAGGCTGCTGCACAACCTTCACATCGTTCTACAGATCACACTGTGCCTCAAACACAGCTGAGACCACAAACGGAAGGCAGCTGGACGTGCGGTGGCACGGCCGGCGTTCGGCAAGAAACTACATCGTCCACCGTCGCCGCTCCCGCCTCCACGCCCGCTTCACAGCAAACACCCGCTTTCCGTGCTATTCTGCCACCCTTCCTGATGAAAGGTAGCAATGGCACAGGATTGGGCTTGGGAACATTGACATCACTTCGTGATTCTAGAAACATTGGCGGTGGCGGCGGAAGCGGTAATTCCAATAGTAACATGGCTGGTCCAAGGCAATCCCAGCGACCGTCCGCTACGCCTCGAGCCGTTGAAGTTTTAACGGCAAGGCCCATTCTTCGTGAAGAACAGATATCTTCTCTTGATGATATATCTCGTGATGCTGGATGGGCGCAACATGACGACATAGATTATGACCAAAAGTTAGACTTCTCAGATGGTGAGTCTTCAGCTCCTACTGTGAAAGTTAGCAACAAGACTAATAATAACCGGACCAGTATTGAAAATGATCGCATAGACTCAAAAATACAAGACCCTGGTGATGAAGACCAGTTATGGGCAGAACGGCGGCAGAAGCAAAGTAATGAAGTTGCTCAAGCTGTTGCTCGTGCAAGACAACGGAAGGAAGAGGAACAGCGACGTCAAATGCGGGATTCACCAGCGTCACAGCAGTCCTCTAAAGATAACCGTGATAAAAATGATAGAAATGATCGTGATCGCAACGATAGAGAAAGAGATTATGAACACAGAGACAAGGATAGGGCTGAGAGCAAAGAAAAGGAAAGGTGTGACAATAAAGACAGAGAAAGAAACGATTTACGTGACAGGGACAAGGACCGATTAGAGGGTAGGGATCGTGATCGTATGGAAAACAGAGATCGTTTTGATAATCGTGATAGAGAACGTGACCGCGACCGTGAAAGAGATCGAGCAATGGATAACAAGGATAGGGGAAGGCAAGACAATAGAGATCGGAATGATAGCAATAAAGATAGGGACTTGCGTGACAGGGATCGTAATGACAATAGAGATCGCAATGAAAACCGCGATCGTAATGAAGCTAGGGATCGAGGAGATAGAGACCGCAATGATAATCGTGAACGTGAAAGATTTGATAGAGAAAGAGACCGAAATGACCGGGATCGCGATCGCGATCGAGAAAGGTCTGATCGGGACTTCAGGGATCGCGACAGAGATTTTGCTCGTGAGCGCGATCGGGACCAGTCAAATCCATTCTCGAAGATATTTCAAGCCAACATACCTCCTAGGTTTTTAAAGCAACAGCAGAACAGACAGCAGGAAGACCAACATAAGGGATGGGCCTTTGCTGGAAAACCTGCACCAAGAAGACAAGAAGCGCAATCATACAACGCACCGAGACATGCACCAAATAATGGCCGACGTTCATACTCCAGAGATTATTCAGACCGTGAAGATGAGGTACGGAGAGATAAAGATGGACCTCAATGGCGTAATGAAATGCAAGACTATGATAGATCTGGTCGAGAGTTAGACAGGTCTAACTCTAAGGAATCTTATAAAGACTTTAACGATTACAAAGATAGACGATCTGATACTGAACGAAAACCTTTAGATACAATTGAACATAGCAGCAGAACAGCAGCCGATAAGTTGACTGAAGTCTTTGAAAGAAAAAGCATAGGTGTTGCTGAACCCTTTGTTTCGACAGAATCTTCTATACAAGCACAAGCTCATGAAAAAAGATCTTCACCGCCTTCAAACACACCACAAAGGGAATTTACCGAAAAGGATCTTATTGAAACCTCTTGGGCTGATATGGGACAGGAAGAACAAAGTAACAGCGCTATTTCAGGGGACAAAAAAAATTCAGCCAAAATGTTCGATCAAAAGGAGAATGAATCGAACGCCCCACAAGCTGACCCCATCAAGACGTTACAAGAAACAAATGATGCAAAAATTATTAGTTCGAACATGTCTCAATCTCATAACAATAATTACAATCAGACATCTAACAACACTTCTAATGCTTCTGTACAAAATCAACACTCTAATATGAATATTGCGGGAAATATTCCAAGTTCAATAATGCCATCTGGTATACAATCTCAAACCCCTTCATTTACCGATTCTCACTTTACTTCTAAGTCTACTTCTAATCAAGTACTTCCTAAACCTATTAGCTCAAATGATATACTTTCTAATTCCTCACAACCACCAAAGCAGTTTGTTCCGGAACAGTCAAATGTTGTTACAAATACTGCCAAACAAACAGTATCCTTAAAATCAGAAAAAGATCTGTCAGAATCTGAATATATTGCCTCTAAATCCAGCAGCGATCCTTCCGGAATTATAGGCAATAAACTGACTGATTCAAATTCCATTGAAAATAATCAAGACTCTCAAAAGCGACCACAGGATGATAAAAAAAGTGAAAGATATACTAATGATCAAGGAAACAAAATGCCGCCTAAAGAACCTATTGAACGTAAATCAAGTGGGTCAGGATCTGAAAAGAAAGGACGAGGGTATGGTGTTGGGGGAGGGTATAACGTCTATGGCAGAGGTTGGGGTTCAAGAGAGTCACGTGGTCGGCGTTCACATAGAAGCTCTCGGTCAAATAATAGAGCCAGTGAGTCAGATGGTTCAACAGATGGCACTAATATTGATCGAAGGGAACGACGAAGAGCACCTCGTAGCCCACGTGGCCCTAAAAAGCAGGAAAGACTCGAGGATTCAAGTCACGTTATTGACCAGGGATCCCAATTTACAGATGGTTTAGAAAATAGAGAACCATTTGCCCCTCGTGGTCAACCTTCGCGACGAGGCCGTGGTGGCTTCCATGGCACAAACAGACCTCCAGCACCGGCTAAAAGAGTAACTGGATATGGACCGCCGAATACAAAAAGTCCTTTTAGTCAAGCTAACAGAGCAAACAAAGACAATGAGGATTGTAAGGACAACGTTCAAAGTGACAAAGACAAAAATACCAATAGACCGCATACCGGTTCTTCAGGAGGCAAAGGTAGAGATCGGCGTTCTAAAGGAGGTCTCAGTGGTGAAGACGAAAATTGGGAAACTACCTCTGAACATTCTGAAGGCGGTGGCGGACCACGCCGATCTGGTAGCAGACAATTAAATCAATCTCAAAAGGGACAAGGTGGCAGTCGAAATCATATTTCTAACAATCGACAAAACAATGGTCGGAATCAGCAGGGTATTAGGAAAGAAGGTGACAATAAAAGTACGGACGCATTAGAGGCTATAGGTGACCCTAAAATACCGACTGCCAAGAAAGATGAAGAAGCTATTGATGACGGATTCCAAGAAGTGCGTAATAAAAAAAATTCAAAAGATATCAGAGGTCCCCTTAATGCAAAAGAAGAAAAGCAACCTAGATCTCGTTCAAACCAAGGCGGAGGTGGCAGAAATGGTTCATCTACAAGAAATTCAAATGACAAATCTAACTCTAGAGGTTCTGGACCTGTTCCCGGTAAATCAAATGTACCTTACGATAACAGACCGCGTCAGGCTAACTTGGCACCTCGCTTTGTTAAACAAAGACAAAAACAACAAATGGGGTTGGTACCCAACTTTGGCACGGATACCGGTGCTGCTCCCCCTCCGCCACCAGTAAACGCGTGGGACAAACCTATTGCACAAACTTTGCGTGGCAATGTTGAAGACCAACCCGAAGTTGTTGAAAAATCAAACCAGTCTAGCCAACGCAGTACTCCAGGAGACACAACTGCTGAAAGTAATAAACAACCTCCTCCTTCATGTGCTGTAGTTGCTGATAAAGCGGGCGTCTTAGATGGGACTACACCGCCTGTGGAAACTATAATATTTGAAAACACCAATTATAAAACTGTGCCACCTGCGGAGGCTCTTAAACAAAAATATCAACCTAGTGCAGTTACAGCTAAACCCCAAGGAGAAGAAGTTCAAAATGAAGTGGATTCTAGGGCAATGCCATTCAATGGAGACGTGAGGTCGCGTCCTAGATCAATACAAGAGCTTATGGCTGAAAGCGGTAGACCTGTGTCAGAGGCCGAGGGGTCTCTCGGTCTTCAAATGGCTTTTGATACATCACAGAAGAACGAGGATTCTTCTGACATGAAACTTGATTTTGCATTTGACTCTGACCTTGGGCAGCTCACTGATGAAAAATCAGCTAAAGCTTTAGGATTGCCTCGTGGCACCCACATGAGTACGTCGAACACCATATCACCGTTAGCAGCCGATCTCAATTTAAAAATAGCGAGTGTTAAGAAAGTATGGGAGATGCACGCTGTGGCTGAGGGTAGTGAGGAGCTACAATTCACTACAAATTTCGAAGAGAACAATACAGAAACTGGCGCACCTCCGAACGTGTGTAAAGTTAAACCAACTCAGCAACTACAATCCCCGCCGCCTCAACATTATAACCATGTAACATACCAGGGCGGTTATGGAGGCCTGTCAGTCCCATCGCCACCAGCAGTTCTGTACAACTCGACTCAACAATTATTAGGTTCATCTCAGCAACTACAACAACAAGGTGGATTATACGGAGCCTTTTTAGATCAAACACGGGGACAATTTGGAGGCTTCCCTGGAACTCCTTATGGCGCGGGTTCTGCTACTCCATATAATTACCAACCACCACCGGATATGTTCCAGAGCCTGCCAAGTCAATACCGAATGGCCGCTGCGGCAGGAGGTGGCGCTGCCTTCGGCCAGTCTGGTCAATTAGGAAACAGTCCAAGCACTGTACTTATTTCAAGTACGTCAAACTCGCTCATGTCGGCTACAGTAAAGCCATCGACTCAACAGATCGGAGCTATTGGTAGTAAAGGTGGAGGCGTGGGCGGAGTCGGTGGGGTCGGGGCTGTTGGTGGTGTGAATACGTTCCAGCAGCAGTACTTGGGGTACTCGGGACCCGTGGGTGATTCTCCATATTCACTGCCTGGGCTGCTGCCTCGGCCTGCCCCTCCTTCCACTTCATACTACTCCCCCTACCAGCCGCCCGCCGCGCCCGCTCCTACATATCCGCTACAGTTCACTCAGCCAGCACAGTCCAGCGCGTTCAGTTCACAGTTCCTCTCCTCACAGCTGCATGTCGCCGCCGCCGCGGCCGTCCAACAGATGCAGCAACAATATCGGGCACCTCTACAACAACAGTATGCTCCGCCTCAGCCCCGACCTCCGCCTCAGCAACAACTCAAGAGTCCACTGCACGAACATCCTAACGGATTCGCCCCTTTGTGTGACTCGGCCTCACCGACGCCCAAAGGAGCAACAAAACAGCAGAAACCGCCTCATTCGCCGCCTCAACACAAGTACCACGCGCCGCCGCCACAACATCAACAGCACCCGCCTCACCCCCCGCCAGCACACACACCGCACCAGCACCATCAGCAGCACCAGCAACACCAACAGCAAATGGTCGGCGGCGGAAACAACGGACGTGGCGGGAACGGCGGCAACGGAAACGGCAACGCCATGAACCGCGGCGGCATGGTGACGTCACGCTACCCAGCACCCATACAGAGGCCGCACGCGCCCGCGCCTCCCATGTACCGCGCGCCGCCCTCGCAGCCGCCGCGACCACACCACGCGCAACACGTACAACATATGAGACCCAACCTCTACTACCATCACCACCAACGCAACGGCGGGGGAGGTTCGGAGCGTGTGACCGAGGGTGGGGAAGTATCAGCTACCATGGAGGAGGTCGGGGAGACGGTGACCGCAGGCGAGACGCCTTCCCCCGCTGAGGTGAAGGCCGAGTGA

Protein sequence:

>DPOGS213113-PA
MSALSTPGGGGTTAQSKPTSGKQKYQKLDINSLYCANRNESSEPSSVKSQLSRKHGMQSLGKVPSARRPPANLPSLKTETGQDPNANTVSTVTTPATTQATCTSQTITTTSSSSAVASGWVALPPPSSPHFRTEFPSLEAAAQPSHRSTDHTVPQTQLRPQTEGSWTCGGTAGVRQETTSSTVAAPASTPASQQTPAFRAILPPFLMKGSNGTGLGLGTLTSLRDSRNIGGGGGSGNSNSNMAGPRQSQRPSATPRAVEVLTARPILREEQISSLDDISRDAGWAQHDDIDYDQKLDFSDGESSAPTVKVSNKTNNNRTSIENDRIDSKIQDPGDEDQLWAERRQKQSNEVAQAVARARQRKEEEQRRQMRDSPASQQSSKDNRDKNDRNDRDRNDRERDYEHRDKDRAESKEKERCDNKDRERNDLRDRDKDRLEGRDRDRMENRDRFDNRDRERDRDRERDRAMDNKDRGRQDNRDRNDSNKDRDLRDRDRNDNRDRNENRDRNEARDRGDRDRNDNRERERFDRERDRNDRDRDRDRERSDRDFRDRDRDFARERDRDQSNPFSKIFQANIPPRFLKQQQNRQQEDQHKGWAFAGKPAPRRQEAQSYNAPRHAPNNGRRSYSRDYSDREDEVRRDKDGPQWRNEMQDYDRSGRELDRSNSKESYKDFNDYKDRRSDTERKPLDTIEHSSRTAADKLTEVFERKSIGVAEPFVSTESSIQAQAHEKRSSPPSNTPQREFTEKDLIETSWADMGQEEQSNSAISGDKKNSAKMFDQKENESNAPQADPIKTLQETNDAKIISSNMSQSHNNNYNQTSNNTSNASVQNQHSNMNIAGNIPSSIMPSGIQSQTPSFTDSHFTSKSTSNQVLPKPISSNDILSNSSQPPKQFVPEQSNVVTNTAKQTVSLKSEKDLSESEYIASKSSSDPSGIIGNKLTDSNSIENNQDSQKRPQDDKKSERYTNDQGNKMPPKEPIERKSSGSGSEKKGRGYGVGGGYNVYGRGWGSRESRGRRSHRSSRSNNRASESDGSTDGTNIDRRERRRAPRSPRGPKKQERLEDSSHVIDQGSQFTDGLENREPFAPRGQPSRRGRGGFHGTNRPPAPAKRVTGYGPPNTKSPFSQANRANKDNEDCKDNVQSDKDKNTNRPHTGSSGGKGRDRRSKGGLSGEDENWETTSEHSEGGGGPRRSGSRQLNQSQKGQGGSRNHISNNRQNNGRNQQGIRKEGDNKSTDALEAIGDPKIPTAKKDEEAIDDGFQEVRNKKNSKDIRGPLNAKEEKQPRSRSNQGGGGRNGSSTRNSNDKSNSRGSGPVPGKSNVPYDNRPRQANLAPRFVKQRQKQQMGLVPNFGTDTGAAPPPPPVNAWDKPIAQTLRGNVEDQPEVVEKSNQSSQRSTPGDTTAESNKQPPPSCAVVADKAGVLDGTTPPVETIIFENTNYKTVPPAEALKQKYQPSAVTAKPQGEEVQNEVDSRAMPFNGDVRSRPRSIQELMAESGRPVSEAEGSLGLQMAFDTSQKNEDSSDMKLDFAFDSDLGQLTDEKSAKALGLPRGTHMSTSNTISPLAADLNLKIASVKKVWEMHAVAEGSEELQFTTNFEENNTETGAPPNVCKVKPTQQLQSPPPQHYNHVTYQGGYGGLSVPSPPAVLYNSTQQLLGSSQQLQQQGGLYGAFLDQTRGQFGGFPGTPYGAGSATPYNYQPPPDMFQSLPSQYRMAAAAGGGAAFGQSGQLGNSPSTVLISSTSNSLMSATVKPSTQQIGAIGSKGGGVGGVGGVGAVGGVNTFQQQYLGYSGPVGDSPYSLPGLLPRPAPPSTSYYSPYQPPAAPAPTYPLQFTQPAQSSAFSSQFLSSQLHVAAAAAVQQMQQQYRAPLQQQYAPPQPRPPPQQQLKSPLHEHPNGFAPLCDSASPTPKGATKQQKPPHSPPQHKYHAPPPQHQQHPPHPPPAHTPHQHHQQHQQHQQQMVGGGNNGRGGNGGNGNGNAMNRGGMVTSRYPAPIQRPHAPAPPMYRAPPSQPPRPHHAQHVQHMRPNLYYHHHQRNGGGGSERVTEGGEVSATMEEVGETVTAGETPSPAEVKAE-