Monarch geneset OGS2.0

DPOGS210375
TranscriptDPOGS210375-TA2340 bp
ProteinDPOGS210375-PA779 aa
Genomic positionDPSCF300025 + 688571-695219
RNAseq coverage123x (Rank: top 57%)
Annotation
HeliconiusHMEL0089060.082.01% 
BombyxBGIBMGA011941-TA0.065.22% 
DrosophilaCG2938-PB0.048.71% 
EBI UniRef50UniRef50_D6WF470.051.62%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WF47_TRICA
NCBI RefSeqXP_321732.40.053.82%AGAP001402-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479658210.052.90%AGAP001402-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479658210.052.90%AGAP001402-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[364-601] IPR0124192.1e-87Cas1p-like
Orthology groupMCL15199 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210375-TA
ATGGGTACTTACCCAAAAAAAAGTTCTGCAGAATGGTTCATAGACCAATTAAATGTAGAAAATGCTAAATTACTGGCATTTGCACTAGTCATAGGATTTATTGGGTACCACGGTATACTACATTTACGTTATGGACCTGACTCATGCACTTGGTTGTTATCATCAGGAAGGTACAAAGGAGATCATGAATGGCAACCTTATGGATGTATGTTGCATAAATATTCTAAAACGGATGCAAGAAGATGTTTGCGGTATTTAGCGTTTTGGGGCAAATATAACAGTTTTGCATTCATCGGGGACTCGAGATTGGAACAGTTATATGAGTATTTTATTGGTGTGCTCAGGACTCGCCTGAAGCTAGATTCTTCATACTCAACTATAGATCACCATCAGCCGAACTACACATACGTTGATAATAAATTAAAGTTGTCAGTAACATTCATATGGAGCGAAGATGTCTCCAGGACTATGGTGGAACAGTTTAGGAGCTGGCAATATTCCGACAGGCCTCCATCTGTGATAGTGGCTAGCATCGGTCTCAATTTGGTGAAGATCCACAACGCTACGGAACCTATACTAGAGGAATATAAACGGAACCTCACACAGCTGGTCCAGCCGATAGATTCCTTATCGGGGAGAGGAACCCAGGTGTTGTGGAAGTTGTTGGAGGACGTTGATCAGAAGACGGTCAAGATCAGCAACAGTGATATAGACGCTTACAACAGAGCTGCCATGGAGATCCTCCAACACAGCGCCACCAAGATATGGAACTCCGCCCGCCTGGCCGGAGCCCCGGGCGCCGGGCCGGGGCTGCAGCACACGGCTCAGATCCTCCTCAACATGTTCTGCAACGACCACATGAACTTCAACGACGGCACTTGCTGCGCCCAACCCGAGCCCTGCACACAACTACAGTTACTCACATTTGCGTTGTTCCTGCTCTGCGCAGTACTGGCCTGCGGACGATGGTTGTGGAAGTGGTCGCAGGGCATCAAGCAGCGCATGGAAGGTTACGCTCTCGTCAACGCAGTACACAACGAAACGCCATCAGCCATGGTGGCTATGGCGAAGTTGGGCATGATTATGGCCTACTTCTATCTATGTGATAGAACTAATTTCTTCATGAAGGAAAACAAATATTATTCTGAATGGAGTTTTTGGCTACCCGTCGGCTATGTGTTCGCGTTGGGGCTGTTCTTTACCGATGAATCTAGATCCAGCAGTCATTCGAGGGTGCTCCATCGCGAACAGACGAACGAATGGAAGGGCTGGATGCAGCTGGTGATTCTGGTGTACCAGGTCACAGGTGCCAGCAAGGTCCTGCCGATATACATGATGGTGAGGGCGCTCGTGTCCTCGTACCTGTTCCTGACCGGATATGGTCACTTCTACTACACGTGGAAGACCGGGGACACGGGCCTCGTGAGATACTTCAGGGTTATCTTCAGACTGAACTTCCTGACCGTGGTCCTCTGTCTGACCATGAACAGACCGTATCAGTTCTACAGCTTCATACCGCTGGTGTCGTTTTGGTACACCTTGATGTTCGCGATTTTCTCGCTGCCTCCTCAACTCTCTCCGCCTCATACCCTGGAGCCTTACCAGCCTGTGTACACAGTTATAAAGACCCTGGGCCTGCTGGCGATGGTGACCGTGCTGTACATGAGCGAAGTGTTCTTCCAGAAGATCTTCCTCATGAGACCCTGGAAGGCGCTGTTTGTGAACTCCGACGACGACATCCGACAGTGGTGGCTGGACTGGAAACAGGACCGCTACTCGATGGCGTACGGCATAATATTCGCGGCGGCTTACCTTTTAGCGCAGAAGTATAGCTTACTGGACGACAACAACCACAGCAACCTGTTCACGCCGGGCATCGCGTTGACCGCTACCCTGCTGGCGTTCATCGCGCTCGGAAGTTACATAACGTTCACATTTTTCTGCACCAACACATTCGACTGCAACGAGATACACTCCTACGTGACCTTTCTGCCCATCATCGGGTACATCATATTGAGGAACGTGTCCGGCGTGCTCCGCACGAGACATTCGAGTCTTTTCGCGTGGTTTGGGACCATAACGCTCGAACTGTTCGCCAGTCAGTCCCATATCTGGTTGGCCGCCGATACTCACGGCGTGTTGGTCCTAGTTCCCGGCGTGCCCGTCTTCAATCTGATCCTGACCTCGTATATTTTCATATTCACCGCCCACGAAATACATAAATTAACAGGAATCATTCTCCCCTACGCCGTTCCGGACGACTGGCGGCTAGTTTTAAGGAATTTTGCTATTTTCCTAGCGATTTTGGTACCAATTGGCATCCACGATGGTATGTTTTAA

Protein sequence:

>DPOGS210375-PA
MGTYPKKSSAEWFIDQLNVENAKLLAFALVIGFIGYHGILHLRYGPDSCTWLLSSGRYKGDHEWQPYGCMLHKYSKTDARRCLRYLAFWGKYNSFAFIGDSRLEQLYEYFIGVLRTRLKLDSSYSTIDHHQPNYTYVDNKLKLSVTFIWSEDVSRTMVEQFRSWQYSDRPPSVIVASIGLNLVKIHNATEPILEEYKRNLTQLVQPIDSLSGRGTQVLWKLLEDVDQKTVKISNSDIDAYNRAAMEILQHSATKIWNSARLAGAPGAGPGLQHTAQILLNMFCNDHMNFNDGTCCAQPEPCTQLQLLTFALFLLCAVLACGRWLWKWSQGIKQRMEGYALVNAVHNETPSAMVAMAKLGMIMAYFYLCDRTNFFMKENKYYSEWSFWLPVGYVFALGLFFTDESRSSSHSRVLHREQTNEWKGWMQLVILVYQVTGASKVLPIYMMVRALVSSYLFLTGYGHFYYTWKTGDTGLVRYFRVIFRLNFLTVVLCLTMNRPYQFYSFIPLVSFWYTLMFAIFSLPPQLSPPHTLEPYQPVYTVIKTLGLLAMVTVLYMSEVFFQKIFLMRPWKALFVNSDDDIRQWWLDWKQDRYSMAYGIIFAAAYLLAQKYSLLDDNNHSNLFTPGIALTATLLAFIALGSYITFTFFCTNTFDCNEIHSYVTFLPIIGYIILRNVSGVLRTRHSSLFAWFGTITLELFASQSHIWLAADTHGVLVLVPGVPVFNLILTSYIFIFTAHEIHKLTGIILPYAVPDDWRLVLRNFAIFLAILVPIGIHDGMF-