Monarch geneset OGS2.0

DPOGS204694
TranscriptDPOGS204694-TA1710 bp
ProteinDPOGS204694-PA569 aa
Genomic positionDPSCF300170 + 432784-442223
RNAseq coverage1226x (Rank: top 10%)
Annotation
HeliconiusHMEL0082514e-8047.86% 
BombyxBGIBMGA007473-TA1e-14848.04% 
DrosophilaCG7082-PA5e-7333.47% 
EBI UniRef50UniRef50_B0X0X88e-7232.80%Tudor and KH domain-containing protein n=3 Tax=Culicidae RepID=B0X0X8_CULQU
NCBI RefSeqXP_001663280.11e-7335.01%hypothetical protein AaeL_AAEL013072 [Aedes aegypti]
NCBI nr blastpgi|1571680012e-7235.01%hypothetical protein AaeL_AAEL013072 [Aedes aegypti]
NCBI nr blastxgi|910896253e-7236.81%PREDICTED: similar to tudor and KH domain-containing protein [Tribolium castaneum]
Group
Gene OntologyGO:00037238.9e-15RNA binding
GO:00036761.4e-11nucleic acid binding
KEGG pathway 
InterPro domain[218-346] IPR0081914.9e-30Maternal tudor protein
[44-158] IPR0040878.9e-15K Homology
[48-108] IPR0181117.6e-14K Homology, type 1, subgroup
[269-333] IPR0029991.4e-11Tudor domain
Orthology groupMCL13021 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204694-TA
ATGGCTCTTAACACCAAATTGTTATTGCCCGCTCTATTGGGCTTCTCATTGGTGACAGTTAGTGCATTTGTTGTGTACTATGTGTTCAAACGAGAAGATGAAGCGGAAGATAAACCAATTCGATCAACTAAGCTTAATATGATTGAAGTTAAAGTACCTAAAAGTATTGTCCCAGCTCTCATCGGTAGGGGCGGGACAAATATTAAACAAATCGAGGAGAAGACGGGTGCAACGATACACTTTAAAAAGTTCAGCGATAAGCAATACGATGTTTGCATTCTACGCGGAAGATCTCACTCTACGCAGGTAGCTGAGACATTGATACATGACTTCATCAAACAACAGCCGTTGATAGAAAACGACACTATGGAGGTTGAGGGGGAGAGAGGAGACAACGGTCAGCGGCACGTCAGTCTGAAGGGCACCGCGGAACAGATCGAAGCGGCCAAGAAAATGATCGCTGAATGTGTGGAATCGGAGAGATGTCGTCGCGAGATAGAACAGTCTAGACGTTTACCGCGCGTGCCGACCCCGCCAACCGGGCAGCCAGCCCAGCCGGTGGCGGTGCAGGCAGAGCTGCCGGCAGCTCAGCCCAACAGACCGGACACGCCAGGTGACACCGCCAAGCATGTGAAATACAGGCGTGAGGCGTCTTCGTCAATAGAGGTGTACGTGTCAGCGGTGTCGTCTCCGTCACGTTTCTGGGTTCAGTTCGTCGGGCCTCAGGTCTCCCAGCTGGATGAACTGGTTTCACACATGACGGAATATTACAATAAGAAAGACAACAGAGCGAACCACGCGTTAAGTCACGTGAGCGTGGGCCAGGTCGTGGCGGCTGTGTTCAGACACGACGGTCGCTGGTACAGGGCGCGGGTCAACGACATCAGGCCGAACGAATTTGATCACACCCAGCAAGTGGCGGATGTGTTTTTCCTGGACTACGGCGACAGTGAATACGTGGCTACCCACGAGCTCTGCGAGCTGAGGGCCGATCTGCTAAGACTGAGGTTCCAGGCGATGGAGTGTTTCCTGGCCGGCGTGGAGCCCGCGAGGCAGAGGAACGAGGCCGGCGCGGCTCGCTGGCATCCCAACGCTATTGAAAGATTCGAGGAACTGACACAGGTGGCCAAGTGGAAGCCTTTGATGTCCCGCACGTGTAGCTACAGACGTAGCGCTGTGGTCGGCGGCCGGGAGAGAGAAGTTCCCGGGATAAAGTTATATGACGTCACGGACGGGAATTACTTGGACGTGTGTGAGGTGCTCATAGCTGAGGGCTGGGCGGTGCCGGCGGCCTGCGACACGTTCCGCGGGGCCTTGGGCGGCGGGGGAACACCGGGGGGGCCGTACGGAGACTTGGGACACTCACGCGTTCTGGGTATGGTGACCGGAAATCGAGCGTCATCAATGCCGGCTGAGGCCAAGGACAAGCCGAGGACACCGGGACCGGAAACGTTCAGAGACAACCCGATGTCAAAGTCACTGGCTTCCGGTTTGGAAAATTCAGATAAAATCAAATCCGTATCTAACTTCGATCTCTCCTACTCCGAGACTGTTCCGAAGTCGAACGGCACGAGCGACTTCCTCGACTCCGAGATAACGGCCAACGAGAAACAACTCGAAGTCAAATCGAAGTCACTCAACGACGAGAAAGCGAAGATGAATAGGATCGACTCGCATCATAGTAATTTAGAATCTCTCGGAAGGATTTGA

Protein sequence:

>DPOGS204694-PA
MALNTKLLLPALLGFSLVTVSAFVVYYVFKREDEAEDKPIRSTKLNMIEVKVPKSIVPALIGRGGTNIKQIEEKTGATIHFKKFSDKQYDVCILRGRSHSTQVAETLIHDFIKQQPLIENDTMEVEGERGDNGQRHVSLKGTAEQIEAAKKMIAECVESERCRREIEQSRRLPRVPTPPTGQPAQPVAVQAELPAAQPNRPDTPGDTAKHVKYRREASSSIEVYVSAVSSPSRFWVQFVGPQVSQLDELVSHMTEYYNKKDNRANHALSHVSVGQVVAAVFRHDGRWYRARVNDIRPNEFDHTQQVADVFFLDYGDSEYVATHELCELRADLLRLRFQAMECFLAGVEPARQRNEAGAARWHPNAIERFEELTQVAKWKPLMSRTCSYRRSAVVGGREREVPGIKLYDVTDGNYLDVCEVLIAEGWAVPAACDTFRGALGGGGTPGGPYGDLGHSRVLGMVTGNRASSMPAEAKDKPRTPGPETFRDNPMSKSLASGLENSDKIKSVSNFDLSYSETVPKSNGTSDFLDSEITANEKQLEVKSKSLNDEKAKMNRIDSHHSNLESLGRI-