Monarch geneset OGS2.0

DPOGS207262
TranscriptDPOGS207262-TA1275 bp
ProteinDPOGS207262-PA424 aa
Genomic positionDPSCF300008 - 706110-709087
RNAseq coverage447x (Rank: top 27%)
Annotation
HeliconiusHMEL0021780.089.18% 
BombyxBGIBMGA012127-TA0.085.34% 
Drosophilal(2)09851-PA2e-16363.64% 
EBI UniRef50UniRef50_B4J5Q64e-16463.24%GH20229 n=5 Tax=Coelomata RepID=B4J5Q6_DROGR
NCBI RefSeqXP_974924.12e-17066.04%PREDICTED: similar to GA11814-PA [Tribolium castaneum]
NCBI nr blastpgi|910790284e-16966.04%PREDICTED: similar to GA11814-PA [Tribolium castaneum]
NCBI nr blastxgi|910790287e-17566.67%PREDICTED: similar to GA11814-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055157.5e-50protein binding
KEGG pathway 
InterPro domain[101-411] IPR0159437.5e-50WD40/YVTN repeat-like-containing domain
[77-415] IPR0110461.4e-46WD40 repeat-like-containing domain
[24-93] IPR0220528.4e-17Histone-binding protein RBBP4
[238-271] IPR0197813.2e-11WD40 repeat, subgroup
[323-363] IPR0016802e-08WD40 repeat
[258-272] IPR0204721.1e-06G-protein beta WD-40 repeat
Orthology groupMCL11751 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207262-TA
ATGGAAGAAGGCGACCAAACCGAAGAAGATTCACGGCCTAAGACCTATCTTCCCGACCAGCCTTTACAAGAAGATGAGCATTTAATTTGTGATCAATCGGCTTATGTTATGTTGCACCAAGCACAAACTGGTGCTCCATGCCTTAGTTTTGATATAATCACAGACAACCTAGGCAATGATCGTCAACAATTTCCCATGACAGCTTACTTGGTTGCCGGTACACAAGCCTCGAGCGCTCACTTAAATAATGTTTTAGTTGTAAAAATGTCAAATTTACACACCACCGCAAAACCAGAGGATGAAGAAGAATCCGATGAGGACGATGATGACGAAGAGGAAGATGAAGAAAAGAAACCACAAATGACTTTTGCATTTATTAAACACCAGGGCTGTGTGAATAGAATAAGAACCACAAACTATAAAAACTCAGTTTTGGCAGCAACATGGTCTGAATTAGGAAGGGTGGATGTGTGGAATATTACTCAGCAATTACAAGCAGTTGATGAACCAGCATTACTTGAGAGATACAATCTTGACACCGTGTCTAATCCAGTGAAACCATTATATTCATTCAATGGACACCAACAAGAAGGATTTGGCATGGACTGGTGTCCAACTGAGCCAGGAGTATTAGCAACAGGTGATTGCAGAAGAGACATTCATATATGGAAGCCGAATGAGGCTGGTACTTGGACAGTGGACCAAAGACCCTTAGTTGGACACACAAGTTCAGTGGAAGATATCCAATGGTCACCTAATGAAAAAAATGTCCTGGCTACCTGCTCAGTTGATAGAACTATCAGAATATGGGACACAAGAGCACCACCACACAAAGCGTGTATGTTGACAGCTGAAAATGCTCACGAGAGAGATATTAATGTTATATCTTGGAATAGAAAAGAACCATTTATAGCTAGCGGTGGCGATGATGGTTTTCTCCACATATGGGATCTCCGACAATTCACTCGCAGTACGCCTGTTGGTACTTTCAAACATCATACTGCGCCGATCACGTCAGTTGAGTGGCACTGGACAGAGCCCAGTGTGCTTGCTTCAGCAGGAGAGGATAACCAAGTCGCTCTGTGGGACCTTGCTGTTGAAAGAGATGATGAAGAAGTAGTGGAAGAAGAGTTAAAGAATTTACCACCACAATTGCTTTTTATTCATCAAGGACAAACAGATATTAAGGAACTTCATTGGCACAAGCAAATTCCTGGCGTCATAGTGACAACCGCACATACAGGATTCAATATATTTAAAACTATAAGTGTATAA

Protein sequence:

>DPOGS207262-PA
MEEGDQTEEDSRPKTYLPDQPLQEDEHLICDQSAYVMLHQAQTGAPCLSFDIITDNLGNDRQQFPMTAYLVAGTQASSAHLNNVLVVKMSNLHTTAKPEDEEESDEDDDDEEEDEEKKPQMTFAFIKHQGCVNRIRTTNYKNSVLAATWSELGRVDVWNITQQLQAVDEPALLERYNLDTVSNPVKPLYSFNGHQQEGFGMDWCPTEPGVLATGDCRRDIHIWKPNEAGTWTVDQRPLVGHTSSVEDIQWSPNEKNVLATCSVDRTIRIWDTRAPPHKACMLTAENAHERDINVISWNRKEPFIASGGDDGFLHIWDLRQFTRSTPVGTFKHHTAPITSVEWHWTEPSVLASAGEDNQVALWDLAVERDDEEVVEEELKNLPPQLLFIHQGQTDIKELHWHKQIPGVIVTTAHTGFNIFKTISV-