Monarch geneset OGS2.0

DPOGS214330
TranscriptDPOGS214330-TA2472 bp
ProteinDPOGS214330-PA823 aa
Genomic positionDPSCF300020 - 574753-579308
RNAseq coverage848x (Rank: top 15%)
Annotation
HeliconiusHMEL0142380.073.91% 
BombyxBGIBMGA003999-TA0.068.61% 
Drosophilasbb-PG1e-5345.96% 
EBI UniRef50UniRef50_D6WTE64e-10440.65%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WTE6_TRICA
NCBI RefSeqXP_973606.17e-10841.68%PREDICTED: similar to AGAP005030-PA [Tribolium castaneum]
NCBI nr blastpgi|910862951e-10641.68%PREDICTED: similar to AGAP005030-PA [Tribolium castaneum]
NCBI nr blastxgi|910862959e-15942.16%PREDICTED: similar to AGAP005030-PA [Tribolium castaneum]
Group
KEGG pathwayspu:5829418e-42 
 K10847 (XPA)maps-> Nucleotide excision repair
Orthology groupMCL17340 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214330-TA
ATGGGAGGAGCACTCTCACCCTTTATGCTAATGAACGGTCCCGTTATAAATTGGATCGCTTTGCGAATAACTCGATTGCGCCCATACCGGCTTGTCTGCCGTCACTGGAACTTTTCGACATCTTTGACCATATTCACAGGAGTACTGGTGGTGAATGTGACGTGGCGAGGGAAGACTTACGTCGGGACACTGTTGGACTGTACGAGACACGACTGGGCGCCGCCACGATTTTGCGATTCTCCTACGGAAGAATTGGACGCCCGGACTCCAAAAGGGCGGGCGAAACGCGGTCGTGGCGCCGCGCCCACAGACATGACGAACTTCACAGAGACGCGTTCCTCGGTTCATAGCAAATTAAGGAACGGCGGTGCCAAGGGACGACGGGCGCTGCCCTCACCCACACCGTTCACACCTCCAAGACCGGACTCCAAAAGAAAACGGAACTCCGATGCCGAGGAGAGACCCCCCACTCCCAAAGCGAAACGTCCACCGCCCACGACATCCTCTCCACCTCCTGACCCGGTGCTGCTCGAGTGCCCAGAGCCGAACTGCTCCAAGAAATACAAGCATGCGAACGGTTTGAAGTACCACAGATCGCATGCACACGGGTCGCCTGATGAAGAAGACAAAGACGGCTCCAGCTCAGAGCAAGAGGAGACTGCAACGGAGCCCGCGTCACCGGCGCGGCCGCCTTCAGAACCGGCCACCCCTGCGACCCCGGTGAAATCTCCTACACCCACTAAGTCACCTGAAGAGAAGAGCGATAAGTCACCAGAAAAGTCTCCGGAAAAAACGGAACCAGTTGAATCACCCCCACAACCCCGGTTCGAGGAGTTCGCGTCGACACCGGAGCCTCCTCCAGTGAATCGGCCGCCTTCAGAACCGGCCACTCCTGCGACCCCGGTGAAATCCCCTACTCCCACTAAGTCCCCTGAAGAGAAAAGCGATAAGTCGCCAGAAAAGTCTCCTGAAAAAACGGAACCAGTTGAATCACCCCCACAACCCCGATTCGAGGAGTTCGCGTCAACACCGGAGCCTCCTCCAGTGAACGAGCGACCGCCGACACCTTCCGCTCCGAGTACTCCCCCGCCAGAATCTCATCCTCCACAACAGCCGACAACACAGTTGACACAGTTCAAGGTGAAGCCACGGTCGGCACTAATGCCGTCTGAGGAGCGAAAGTCGACGGAGTCTCCAGACGGGTCGCCGGGGAAGCGTCGTCGTCGTTCCCCGACCCCGGGCGTGCGGTCGCCCGCCTACTCCGACATCTCCGACGATGCGGCACCCCCGGAGGGCGCCGCTGACGCTGAACACAGAACCTTCCCGGTCTATCATCAGTACTACGGACAGTCGCCATACCTGCCGCCGTCGCATCCTGCCACCGCGCCTCCCACGGATAAGGGTAAAATAAATTTTCTCATAAACATCAGCTACACGATGATACAATTGGACGGTAAACCCGAAGGTGGTTCGCAGAAGGTGTTGCCTCCCCACTTCTATCCCTACAACTATGTCCCAAATTTTCCCTACAACGTGGAATCCGGCCCCCCTCCTGGAGCACCCCTAGATGACAAATCCAAAGAATCAGATCGTTCGAAAACAACTCCCAGTCCGTTAGACAAGAGTAAACAATCGAGGAACTCGGATGCAAAGGATCCGTCGAGGCCGAATGAAAACCATCAAATATTGAAAGAGAGTATAGAAATGAAAGCTCAAATGGGACCCTACGCCTTCCAACGACCGCCGCACGCGAGGGAAGAGGAATTAAGGAGGTATTACATGAGTCTCGACCAACGTCGTAAGGAAGGTGGCGATGGGAAGCAGGTGGGTCCCGGTGCGGCCTCAGGGGGGGGAGTGAGACCTCAACCAGCACACAAACCCAAGGACAAGGATGACAAGCCCAAAGAGGAAGTTAAAGTGAAGCAGGAAGGCCAGAAGCCGACGACGGAGACGCAAGGTCCGCCTCCGCCCCCTACCTCCCAGTACTATCTCCCGCCATACATGCAACCGCCGCACTACGGAGCTTTGCCCTTCGACCCCGTGTACCGCGCGCCACTCAGCCCGATGCTGGTGCCGGGCTTCGGCGGCGGCTGGACCGTGCCTCGCTATCATGCGCCCGAGGACCTGTCTCGGCCGGGGGCACCGGCTAAATTGGAACTGCTACCTGGCCACGGGGCGCAGTACTACGGGCCGCACGCTCCTCACGCCCCTCACGCTCCGCACGCGCCTCACGCGCCGCCACATAAGATACACGAACTGCAGGAACACGCGAAGTCTCCTCAGGGCAAGCCGCCGCGGCCGGAGCCGCCCAAGGACGCCCCTCGGTCTCCGCCCCCGCAGCGCCACGTCCACACGCATCACCACACGCACGTAGGACTCGGCTACCCGATATACCCGCCGCCATTCCCTGCGGCAGCCGTACTGGCGAGTACCCAGGCGGTAGTCAACTCCTTCCCGACGCCGCCAAAATGA

Protein sequence:

>DPOGS214330-PA
MGGALSPFMLMNGPVINWIALRITRLRPYRLVCRHWNFSTSLTIFTGVLVVNVTWRGKTYVGTLLDCTRHDWAPPRFCDSPTEELDARTPKGRAKRGRGAAPTDMTNFTETRSSVHSKLRNGGAKGRRALPSPTPFTPPRPDSKRKRNSDAEERPPTPKAKRPPPTTSSPPPDPVLLECPEPNCSKKYKHANGLKYHRSHAHGSPDEEDKDGSSSEQEETATEPASPARPPSEPATPATPVKSPTPTKSPEEKSDKSPEKSPEKTEPVESPPQPRFEEFASTPEPPPVNRPPSEPATPATPVKSPTPTKSPEEKSDKSPEKSPEKTEPVESPPQPRFEEFASTPEPPPVNERPPTPSAPSTPPPESHPPQQPTTQLTQFKVKPRSALMPSEERKSTESPDGSPGKRRRRSPTPGVRSPAYSDISDDAAPPEGAADAEHRTFPVYHQYYGQSPYLPPSHPATAPPTDKGKINFLINISYTMIQLDGKPEGGSQKVLPPHFYPYNYVPNFPYNVESGPPPGAPLDDKSKESDRSKTTPSPLDKSKQSRNSDAKDPSRPNENHQILKESIEMKAQMGPYAFQRPPHAREEELRRYYMSLDQRRKEGGDGKQVGPGAASGGGVRPQPAHKPKDKDDKPKEEVKVKQEGQKPTTETQGPPPPPTSQYYLPPYMQPPHYGALPFDPVYRAPLSPMLVPGFGGGWTVPRYHAPEDLSRPGAPAKLELLPGHGAQYYGPHAPHAPHAPHAPHAPPHKIHELQEHAKSPQGKPPRPEPPKDAPRSPPPQRHVHTHHHTHVGLGYPIYPPPFPAAAVLASTQAVVNSFPTPPK-