Monarch geneset OGS2.0

DPOGS208624
TranscriptDPOGS208624-TA2658 bp
ProteinDPOGS208624-PA885 aa
Genomic positionDPSCF300052 + 1037500-1045157
RNAseq coverage1917x (Rank: top 6%)
Annotation
HeliconiusHMEL0158510.067.11% 
BombyxBGIBMGA005733-TA2e-12193.09% 
DrosophilaCG31716-PH2e-9750.13% 
EBI UniRef50UniRef50_E2BJE23e-9848.56%CCR4-NOT transcription complex subunit 4 n=6 Tax=Eumetazoa RepID=E2BJE2_HARSA
NCBI RefSeqXP_972337.24e-9974.88%PREDICTED: similar to AGAP009827-PA [Tribolium castaneum]
NCBI nr blastpgi|1892384028e-9874.88%PREDICTED: similar to AGAP009827-PA [Tribolium castaneum]
NCBI nr blastxgi|1954736513e-10730.81%GE18937 [Drosophila yakuba]
Group
Gene OntologyGO:00036766.3e-19nucleic acid binding
GO:00001664.5e-09nucleotide binding
KEGG pathwaytca:6610551e-98 
 K10643 (CNOT4, NOT4, MOT2)maps-> RNA degradation
InterPro domain[75-154] IPR0039546.3e-19RNA recognition motif domain, eukaryote
[72-158] IPR0126774.5e-09Nucleotide-binding, alpha-beta plait
[1-47] IPR0130836.1e-06Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL30368 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208624-TA
CAGATATGCAGGTTTTGTTGGAATCGAATCCGGGAAGGGGAGAACGGTCTATGTCCAGCCTGCAGGAAGGCCTACCCCGAAAACCCTGCAGACTTCACCCCGCTCAGTCAAGAGCAGGTGGCCGCTATAAAGACGGAGAAGAAGGCCAGGGAACAGAAGCGTCGCAACAAAACTTTGGAGTCACGACGAGCTCTGGCCAACGTGAGGGTTGTTCAGAACAATCTCGTGTTCGTTGTGGGACTTCCGGTCAGGCTGGCGGATCCAGAGATACTAAAACGGCAAGAGTACTTTGGGAAGTATGGAAAAATTCACAAAGTAGTTATAAATCAAAGTAGTTCATATGCCGGGTCACAGAGTCCATTGGCGTCCGCGTACGTAACTTACGTGTCGCCCGCGGACGCGTTACGTGCGATCCAGGGGGTGAATAACGTTACGTTGGACGGTCGGGTGTTGAAGGGCTCGCTAGGAACTACCAAGTATTGCGCTAACTTTATGAAAAACCAACCCTGCCCTAAGCCAGACTGCATGTATCTACACGAACTGGGCGATCCAAAGGCGTCGTTCACAAAGGAGGAGATGCATGCGGGGCTTCACCAAGTGTACGAGCGGCGGCTACACCAACAGTTACTACAGGCGCAGAGGGATCGTCCAGACGATAGGCACTATAGCGACGGAAATCAACCAAATTTTATACCAACCACAGTGGTCACATCGTCTCAAATAAATGTAGTCAGCACATCCAAATCAAAAAAGGAACCAATGAACGGCATCATAAACGGCAGCGGCAGCAAGGAGGCGTGGCCGAGTCTCGGGGCTTCCCCGCCGGCAGACAGCCCCGCAAGGAAACAAAACAGCCCAAAACCCACAGTAAAGTCACTAGAAAACGGCATATCAGAATCCTCGAGTCCAACACAACTACAAACACAACAGTCTCAAACGAAAAACGTCAAAGAAAGTAACACGGACAGCAAAAATAGAAATAGTAGCAACACAAACGGTCACACGAGCAAAAAGAATAAAGACAGCAAATCCAAAAAGGGCAATAGCAATACGAGTGAATCCAGCGACCAGGGAGCCGACGAAAAGACGGAAAACAACGCTATCAGTATACAAACACAGGCGAACTCTGTGTTCGAAAACGACACACAGAGCTATCTATCAAACGAATTGGACGAACTGGAATCAGATAGGCACAGCTTGTTGGAAAACCACGATCTATTGGACAGTTCGGATCACAGTCTACTAGAGGACGACAACACACACAATCTGCTCAACGACGCCAGTAACGAAGTCATGATGATGCAGAGGCATAGAGAAATGCTCGCGGGTCTAGTTGATAATAACCACTTACTGCTGAAAAGTCTACCTATGAACGGTTACGGCTTGTCAGATAGAGATATTTACCTAGCTAGTGATAGACAGATTAGTCAAATTAATCATGGTATCATGGAAAAAGAATTGATGCTCAGAGACCGCAATAGAAGCAGTCTTCTGATGAGGCAGAATGAAATGATGTCGAGACATAGCATGGAACCTAAACATTATATGCCCACATCCAGTATAGGACTCGAATCAGCTAGTGAATTATTAGGTGCTAATTTTCATCCAAATCACAGTATGCTACCCCCGTTCGGTATGCGTGTAGACAACTCAATGGGCGCTAGCTCTTTAGGAAATACAATGACAACACTATCAGCGAACACATTAAACAATCCAATAACAAATTCACTATCAAATACTTTGGCGAGTTCCATGGCAACGAATTCGATGGCCGCGAACTCTATGGGCAGCTCTATGGGTTTGTCTATGGGAAGTTCCATGGGCAAGTCCATGGGGGGATTCATGCTGGCCGCCCAACAGTTACCGCAGATTCAGAACCAGGGACAGATGCAGAGCGGCCTCGTCAACGGTTTCGATTCCACACAAACTACTAGCGAGGGTCGCTACCAGGCTGAGACTATGGACAAATTCTTTACGGACTTCCACAAGGCACAGCAGATGCGGCAGATGAGAGACGAGCGGAGGGAGCCACCACCACACGCCCTTAGCGCTGAGAGGCTTGAGATGGAGCAAAAGCACAGGATGAACAACATGAGATCCAGTGAGACACTGAGCGGCCGGGCTGCCGGGGACGGTGATGATGATTTAGACTTTGATCCCTTCAAGGAGACGCAGAAAGCACTCGCTGAGATGATGGAGACCGAACTCATGTTGAACTCTATATCCAGCGGAGACAATATGGAGCGTGTCCGTCGCTCGCGACTCCCTCCACCAGGATTCAGTCACGTTAATACATTTGGTATCGGCGTACCGCGACACCAGACACACCATCAAGGCTACAGTTCCAACAACGCCATGTTCTCAGACTGGACACAAATGGACCCCGCTATAATGTCGACATCCGTTAACTTTGGCAAGAGTGCAACAAACGCCCCCGCCGGTTCTAGCGCAGCATTATCTCAACAACAGCAGGAGCTATTCGCGCGGTTCAACCAGCTCCAGGTGGCCGCGAATGCTCCTAACGGCGTTAAGCAGTCACAGCTCAACCTGAACTGGGCACCACCAAAACTTGGCTGGGGTCACTCTGTACCCCTCCCGCCCGGCTTCGCTCCGCCCAAGCCGTCCCAACACCCCGCTGAATGCATCGACGCCAAGTAG

Protein sequence:

>DPOGS208624-PA
QICRFCWNRIREGENGLCPACRKAYPENPADFTPLSQEQVAAIKTEKKAREQKRRNKTLESRRALANVRVVQNNLVFVVGLPVRLADPEILKRQEYFGKYGKIHKVVINQSSSYAGSQSPLASAYVTYVSPADALRAIQGVNNVTLDGRVLKGSLGTTKYCANFMKNQPCPKPDCMYLHELGDPKASFTKEEMHAGLHQVYERRLHQQLLQAQRDRPDDRHYSDGNQPNFIPTTVVTSSQINVVSTSKSKKEPMNGIINGSGSKEAWPSLGASPPADSPARKQNSPKPTVKSLENGISESSSPTQLQTQQSQTKNVKESNTDSKNRNSSNTNGHTSKKNKDSKSKKGNSNTSESSDQGADEKTENNAISIQTQANSVFENDTQSYLSNELDELESDRHSLLENHDLLDSSDHSLLEDDNTHNLLNDASNEVMMMQRHREMLAGLVDNNHLLLKSLPMNGYGLSDRDIYLASDRQISQINHGIMEKELMLRDRNRSSLLMRQNEMMSRHSMEPKHYMPTSSIGLESASELLGANFHPNHSMLPPFGMRVDNSMGASSLGNTMTTLSANTLNNPITNSLSNTLASSMATNSMAANSMGSSMGLSMGSSMGKSMGGFMLAAQQLPQIQNQGQMQSGLVNGFDSTQTTSEGRYQAETMDKFFTDFHKAQQMRQMRDERREPPPHALSAERLEMEQKHRMNNMRSSETLSGRAAGDGDDDLDFDPFKETQKALAEMMETELMLNSISSGDNMERVRRSRLPPPGFSHVNTFGIGVPRHQTHHQGYSSNNAMFSDWTQMDPAIMSTSVNFGKSATNAPAGSSAALSQQQQELFARFNQLQVAANAPNGVKQSQLNLNWAPPKLGWGHSVPLPPGFAPPKPSQHPAECIDAK-