Monarch geneset OGS2.0

DPOGS214011
TranscriptDPOGS214011-TA1953 bp
ProteinDPOGS214011-PA650 aa
Genomic positionDPSCF300313 + 16825-19114
RNAseq coverage17x (Rank: top 81%)
Annotation
HeliconiusHMEL0083792e-16249.33% 
BombyxBGIBMGA014395-TA1e-7235.34% 
Drosophila% 
EBI UniRef50UniRef50_UPI000192758B4e-7436.52%UPI000192758B related cluster n=2 Tax=unknown RepID=UPI000192758B
NCBI RefSeqXP_002163940.11e-8036.29%PREDICTED: similar to hCG32740, partial [Hydra magnipapillata]
NCBI nr blastpgi|2211174342e-7936.29%PREDICTED: similar to hCG32740, partial [Hydra magnipapillata]
NCBI nr blastxgi|2211174344e-8136.35%PREDICTED: similar to hCG32740, partial [Hydra magnipapillata]
Group
KEGG pathway 
Orthology groupMCL21019 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214011-TA
ATGGCGGCCCGGCGTAGCTCATCTGTGTTGACTAAGCAAACGCTACTAACTCGAGGTGGTAGAATATTAGCTCTTGTGCCTGCTGAAAATGCACCCTCTGACGATAGCGAGCGCTCAGATGTCGAAGAGGAATTTCGTGTACGCACCCCGCTTTCTTCATTTTCTTCTCCAGCACCTTCTATTCATTCTTCCTTGGAAAGACTAAACATTGTAGATGATGACGAACATTACAACAGCGACAATGTTCCACTTACGCCTATTTTTCAAACAGTATATTCAAGTCCGTCTTTAGAACCAAATTGCAATAAAGTACCGGTGCTGAGTGATATTCCGTCACTCCCAACAACTCCTCTTACTCCAGTAAATCCACCAAATCCAAAAACCAGATCCCGACGTCGTCAGCCCACCGTTCCTGTCCTGAAAAGACCGAGACTAATGAAAAAATTTACTTTGAACTATCAGTGGAAAAAAGCCGTTTTTCGACACAGAGCTATTATAGAAGAAGACTCCAATTATACAGATGTTCCGGATATATCAGCGATGGATTATTTTTATAAATTTTTTTCGCCAGACATTCTAACAGATATCGTAGAGCAAACAAATTCTTACTCAATAGGAAACACTGGACGTTCACTTGGTTTGACGGAAGATGAGCTTCGCGATTTTCTAGCCATACATATTATAATGGGGGTCGTGAATATGCCAGCATATACGGACTTTTGGAGTTTACGATACAGGTATAACTTAGTAGCAGATTTGATGACACTAAAACGATATCAACAAATACGTCGCAATTTGCACTTCGTTGACAATAATATACAGGATTCCGACCGATATTACAAAGTGAGGCCATTTGTGCAGAAAGTAAGACAAAATTGTCTAGCACAGGAAAATGAACGGAAGTTTAGTATAGACGAGATGATGATTCCGTATAAAGGAACCAAAGCTGGAAAACGCCGTCAATATATGAAGGACAAACCCAACAAGTGGGGTTTCAAAAATTATGTCCGCGCCGGAGTCTCAGGCATGGTATATGATTTTCTTCTTTATGGTGGCGAAGATACATTCAGATTTTACACGTTTTCAGATAAGGAAGCTTCTATAGGTTTCGGAGGACGAATTGTAATCGCTTTATGTCAAAGCATAAAATTCAAACCGGCTTTTGTTTTCTGTGACAATTTTTTTTCCTGCCCTGAGCTGTTTTACATTCTACGCGAAGAATACGGTATATTTGGTTTGGGGACTATAAGAAACAATAGACTTAGAGGAGCAGAAAAAGTGTTGCCTTCAGAGAAAGAGATGAAAAAGAAGCCTAGAGGAAGTTATTCACAAGTAGTGTGCAACAAAAATAGATTATCGGTAGTGAGATGGAACGATAACAAACCTGTTACCCTAATTAGTTCATATGTAGGCGTAGAACCTGTCGAAAAAATGAAAAGATACTGTAAGGAGACAAAAAGCAAAATCGATGTCGACTGTCCCCAAATAGTCAAGGAGTATAATAAACATATGGGGGGAGTCGACTTAGCCGACATGCTTATATCTCTCTATAAAACACCATTTAAAAGTAGGCGTTGGTACATAGGTATTTTTGTTCAAATACTTGACATATGCATAAATAATGCCTGGCTTTTATATAGGCGGGATCAAGCTCAATCAAGCAAGCAATATATACCTCTAAAAGATTTTAGGTACGAAATATACGAAGGCCTGAAAAAATTTGGTCGTGCTGATAAAAATGACAAAGCCAGAAAAGGTGTGCCATCAGCTAACCCAGTGGAATCGTTACGATACGATGCCATTGGTCATTTTATGATAATGACCACTCAAGGCAGGTGCAAGCTATGTCAGAAGCTTACCACTGTATTGTGCATGAAATGCAAAGTGCGGCTATGTTTCGTAACAGGGAAGAATCCCAGAAACTGTCAATTAGATTACCATGTAAAGGCGTAA

Protein sequence:

>DPOGS214011-PA
MAARRSSSVLTKQTLLTRGGRILALVPAENAPSDDSERSDVEEEFRVRTPLSSFSSPAPSIHSSLERLNIVDDDEHYNSDNVPLTPIFQTVYSSPSLEPNCNKVPVLSDIPSLPTTPLTPVNPPNPKTRSRRRQPTVPVLKRPRLMKKFTLNYQWKKAVFRHRAIIEEDSNYTDVPDISAMDYFYKFFSPDILTDIVEQTNSYSIGNTGRSLGLTEDELRDFLAIHIIMGVVNMPAYTDFWSLRYRYNLVADLMTLKRYQQIRRNLHFVDNNIQDSDRYYKVRPFVQKVRQNCLAQENERKFSIDEMMIPYKGTKAGKRRQYMKDKPNKWGFKNYVRAGVSGMVYDFLLYGGEDTFRFYTFSDKEASIGFGGRIVIALCQSIKFKPAFVFCDNFFSCPELFYILREEYGIFGLGTIRNNRLRGAEKVLPSEKEMKKKPRGSYSQVVCNKNRLSVVRWNDNKPVTLISSYVGVEPVEKMKRYCKETKSKIDVDCPQIVKEYNKHMGGVDLADMLISLYKTPFKSRRWYIGIFVQILDICINNAWLLYRRDQAQSSKQYIPLKDFRYEIYEGLKKFGRADKNDKARKGVPSANPVESLRYDAIGHFMIMTTQGRCKLCQKLTTVLCMKCKVRLCFVTGKNPRNCQLDYHVKA-