Monarch geneset OGS2.0

DPOGS208376
TranscriptDPOGS208376-TA7155 bp
ProteinDPOGS208376-PA2384 aa
Genomic positionDPSCF300146 + 157925-171312
RNAseq coverage775x (Rank: top 17%)
Annotation
HeliconiusHMEL0072370.086.64% 
BombyxBGIBMGA012230-TA0.083.75% 
DrosophilaNot1-PG0.049.13% 
EBI UniRef50UniRef50_F4W6V50.059.74%CCR4-NOT transcription complex subunit 1 n=14 Tax=Endopterygota RepID=F4W6V5_ACREC
NCBI RefSeqXP_395830.20.060.78%PREDICTED: similar to CCR4-NOT transcription complex, subunit 1 isoform a [Apis mellifera]
NCBI nr blastpgi|3838584230.061.13%PREDICTED: CCR4-NOT transcription complex subunit 1 isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3838584230.061.29%PREDICTED: CCR4-NOT transcription complex subunit 1 isoform 1 [Megachile rotundata]
Group
KEGG pathwayame:4123710.0 
 K12604 (CNOT1, NOT1)maps-> RNA degradation
InterPro domain[2010-2369] IPR0071961.2e-140CCR4-Not complex component, Not1
Orthology groupMCL12100 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208376-TA
ATGAACCTGGACCCTCTAACGTTTAGTTTATCACAAATAAATTATCTTGTTGTAAATTTAAATAAGAAGAACTTTAAACAGACAAGTCAAGAGTTGTCCCAGATTGTCAGTCTCTATGGCTTGGAAGCAGAGAATCAACTGCTGAGGTGTCTGCTCAGTGAGGCGGCTAAAACGTGGGACGAGCGTACGGCATCCAGCGTGCATGCGTCACTCCTCGCCCAGCACTTGGCCTGTCTACTCAACCACCCCGCAAAGTCTACGGTGATCTGCCAAGCTGTTGATCAACCCACACGCTCACTGCAAAAGGTTTTAAAACCAACGAATTCACTACTTAGTCGACTAGCAAGGTTGCTGAAATTTACGACAGCCCAAGATGTTGCCTTCACATTGGTGCTGAGAAGAAATTCTTCGAAACCTGACATTGTTTCTCTTGCCAAACAGCATTTAAAGAAAAGATTTTTGGACTTTGTCCAGTGTTATCTTGACGCAGAGCGTGGTCATCAAGTTGAGAGAGCGGGACTTCAGGAATGCAGCCCTGAAGTTTTGCAAACTCTTCTAACCAGTCTCGCCTACGAGAACTTCCGGCTTGCAGCCGTCACGAAGGATTTGTTTTTGAAACGTCTGCGTATAGACTTTCCCCGCGAGGTTGTTCCTATTGTACTTGCGCCCCTGCTGTACCCCGACGACACACAAACTCCTTTGGAGGAGATGACAACGTCTGATGATATGACAGCAGCAATGATGGATAATACTTTAGCGGAAATCATTCGAGACATCGGCTATGCCTTCACGGCTTCCGTCGAAGACTGTAAAAATAATATGGTCAATTTTGGTGCCAGGGAGCCCACAGCAATTGACGTTGCCAGAATCATATCTACTATGATCAAGTATCATGCAACTATACAAGAAGCTCCACATGTCCAAACTCCAGGGAATTTCTGGATGAATCATGAGGCTAAAAAGGAGGCCATGGCCCATGGGCACGTCGGAGAAACGTGGAACCCAGAGGTATTTGTCCAGACACTCAAAGAACTTGCTTCAAATTTAAATTGGAAAGAAGTCATTCTACAATTAGATCATCCAGAATTCATTGTTCCCGACAGACAGGGTTTGAGCCTACTATTTACTATTTTGCGCCTAGGTCTCCAGAGCGCTGGATATCCTGCAAATATATTCCCCGTTGAATACCTTTGTCGTCGTTGGGCGAATTTGGAAGGTCAAATGAGTTTATTAACGAACATACTCAAACATCCGGATATATTCAGCTTTGCCGATCATCCTTTCCATCCAGTATCGATAGATCTGCTGAAATCGCCACCGGAAACAGATAACAAAGAAGTATCTACTTGGCGATGTCTATATTTGGTAGAGCTATTATTATATGCTTCAGAACGCGGCTATTATCTGCAAGTACACGAGCTATTTAAATATCCACTACAGCACTGTCCTGACATACTATTGTTGGCGTTGTTACAAATTAGTCCACCTATAACGGTGTTTAGACAAGAATTATTAACAACACTCATTCCTATATTTTTGGGCAACCATCCAAACTCGGGCATAGTTTTACAACATGCATGGCATTCACAAAATCCCAATATCAAGCCCATAATCATGCACGCAATGGCAGATTGGTACATACGAGGAGAATGTGATCAGTCGAAGCTATCAAGAATATTGGACGTTGCGCAGGACCTTAAGGCTCTATCCTTATTGTTGAACGTCCAATCTTTTCCATTTATAATCGATCTAGCTTGCCTAGCATCGCGTAGAGAGTACCTCAAACTTGACAAGTGGCTAACAGACAAAATACGTGATCATGGAGAGACATTTGTTACAGCCATGGTTAAATTTCTGCAACGACGATGTCCCCAAATAATTGGGAAAATACCGGAAGACCAATTGCCCAAAGCGGCACAGTTACCGCCGGAGACTGTGGGCACAATGTTGGCCTGTCTACAGTTGTGTATCCCCAATGTTCAGCAAGAATTACAGGAGGCGATATACAACTTAATGGCTAGCTGTCAAGCCTTAATTCTTACTAAAGCTAGACCGGGTATACCCGGTATTGCAAGACCTCATACACGGATTTTAGAAACTCCATTCAATCCTGCTGGCTTGGGCCCTCAGCTTTTTACTCCCCATGTTGACGCTATTGCAAATTTAGCACCGAATGTCGCCAATATGACATTGGGAGCACCGGCAAATACAGCTTTTGCAATGCCAGGTACTCTCGGACCATTAGTCGCAGCTCCAGGATCACCATCTCGTCTCCTAGGAGCTGGACCTAATAGTCCCTTTGCTATGATGCCCATGCAGCAACATGTCGCCAATGTGGCAAATATGGGAGCATTAGCCCGAATGCCTCCAACACCTATGGACAAGCCACGATTGCCAGATCCTATACATTTACCAGAAATGATTCACAATGTGTCCAAAGAAATAGAAGACGAAGCCAATGGTTATTTTCAAAGGATTTACAATCATCCTCCTCATCCTACATTATCAATAGATGAGGTACTAGAAATGCTTAAGAAATTCCAAGATTCGCCCAACAAAAGGGAACGTGATGTGTTCTCTTGTATGCTCCGGAACCTATTCGAAGAGTATAAATTTTTCCCACAATACCCCGATAAAGAGTTGCATATTACAGCGCAGTTATTCGGTGGTATCATTGAGAAGGGATTAGTTCCTAGTTATGTGTCACTAGGGCTGGCTCTAAGATTCGTCCTAGATGCTTTACGAAAGCCGGAGGGCTCTAAAATGTATTACTTTGGCATAGCGGCTTTAGATAGATTTAAGTCGCGATTAAAAGATTACCATAAATATTGCGAACACGTAAGAGCCATACCGCATTTTAATGAGTTCCCTCCACACTTAATCGAATACATTGAGTACGGTCTCCAGAGCCAAGAGCCGCCCACTAAACCACAGGGGGCAGTTTTACCTACGAGTCTAACCGCCATCTTGAATCAGACCGCCGTTATAACAGTTTCAGCACCTTACAGGGCAGTAATTTGCGCTCCCAGTGCCATCTCTGTCATCTCGAAAGTGTCAAATTGTATTGCGGGCGGTATAGGAAGTCGGCCGTCAATAGCCAACGCCACCAACATTGATACACTACTGACTGCTACCGACAGGGAAGAGAAGATAAACGCACCACCAGAGGCTATTCAAGATAAAACTGCTTTCATATTCAATAATCTTAGTCAATTGAACTTACAACCCAAATGTGAAGAGTTAAAAGAAATTATAACAGAAGAATATTTCCCATGGCTATCACAGTACCTAGTGATGAAAAGGGCGTCCATAGAACTAAATTTCCACGCTCTGTACTCAAATTTCCTAGACGTCCTAAAAATTCGTGAAATAAACAGGTTAGTTACTAAAGAAACTTATCGGAACATCAGAGTATTGTTGCGATCTGATAAAGGCATAGCTAACTTTTCTGATCGATCGTTACTCAAAAACCTCGGCCATTGGTTAGGCATGCTCACCTTAGCTCGCAATCAACCGATCCTCTACATCGACCTCGACCTCAAAGCACTCTTACTTGAAGCTTATCACAAAGGCCAGCAGGAGCTGTTATATGTCGTGCCGTTTGTTGCGAAGGTCTTGGAATCCTGCGCCAAGAACGTCGTATTTAAACCGCCGAACCCTTGGACAATGGCCCTAATGAACGTATTGGCTGAATTACATCAAGAACCAGACTTAAAATTAAATCTGAAGTTTGAAATAGAAGTGCTTTGTAAAAACTTGAGTTTAGACATAGCCGATCTTAAGCCATCTCTGTACCTGAAGGACCCAGAGAAAGTGAGGACGATAGAGTTCCAGCTCTCACAACCGAAACCGGTCAAAGAAACCCCCAACGTGATGCCAGTGAATCAGACATTAGTTCCGGCACCACAAATACAATTGATGCCACCACAGCCTCAGATGATACCCGTCGAAGATATGTCAGCTGCCGCGCCCACGCCCACCGCTGGGCTGGTCGCCAATGATCCAAACCTCATGGGCGTCCTAGGTTTGCCAGAGCCACGGTTCAACTACCTCGACGTCAACGTCTCATCCACCTCGGCCTTCGGACAGAAAATATGTTTCAATCCGCATATCATTCTGTTCCAAAACTACCCACACTTGAAACAATTTGTGAAACCTGCTATAGAAAGGTCGATTCAAGAATGGATACATCCAGTCGTCGATAGGTCCATCAAGTACGCTCTGACGACTTGTGAGCAGATAATAAGGAAAGACTTCTCCTTCGACCCCGACGAAGTACGTATGCGCACTTGCGCTCATCACATGATGAGGAATTTAACGGCCGGCATGGCTATGATAACCTGTCGGGAGCAGATCATCAGCACCATTAGCACAAACCTTAAGGCGGCGTTCATCACGGCTTTGATACCGACCACGCCGCAACAGAAGGATATCATAGAGAGTGCCGCAGCGGTGCTTGCTACTGAGAACATGGAACTTGCTTGTGCTTTCATCCAGAAGACAGCCGTTGAGAAGGCGCTCCCGGAACTCGACAAACGACTGATGAACGATTACGAAATGCGTAAAATTGCTCGGCAAGAGGGCAGGAGATACTACGATCCCATTGTCTTGACGTATCAGACAGAGAGGATACCGGAACGAGTCCGCCTACGCGTCGGAGGTCCAACGGACTTGCAGATCTCTGTCTACGAGGAGTTCGCGTGCAACATTCCAGGATTCATGCCTGTGAGAGACGCTGGAATGTTCATACCGAAACCGTCCGCCCAAGAACAAGTACCACAGATGACGTTTAATCAAGTAATGAATCCGCAACAGGTATATGGAACGGATGAGATGGGTACACTGATATCAGCTGCGGAGTTGTTCCTCAGCAACGCCCTGTCTGTTCCCTCGTTCGCGGTGCAAGCGACAAACATGCATACTTTACTCGAATGCCTCATCATCGCCAGACGGAATCGTGATATCGTTTCGGGCTACACTCTCCTACAACGAGCTGTTGAGGGTCTCCTAGATGGTCACATTGTACAGCCGGGCACGAACCCAGAACACGCTGAAATGATGACCCGTTATCGTGATATCCACCTGCGAGTACTGAAGCTGTTAGAAGACGCGAGGGTGTACGGCCACGCGTGGACAACTAAACAGATCACATACTGCGTATCCGAATGTAGGGATGAACTGAGATACAACCTGGAAGCTATCGACTGTCTCGTAAGGAACCACCTGATCAACATGCCACAGTACGATCTTGCGTTGGCACATTTGATGGACAACGGCAACAACTACGTCGCCGTGGCTTTCGCGATGCAAGTGGTTCAGTTATACCTTGTGGATGACAGGAACAACGTGTACGCAACGGAATCAGACCTCTACCACACTACTGACACCCTCGTTAGGATGATGTCACACTCGCGGCAGCCGCCGCCAGAGGGTCTTGCCACATTGATTGAAACTATCCGCATCAACCAGGACCCCAGCACATATCTTGGTGAACGTTCACCTCTTGGACCCACCGCTCACATTCACAATGGCATTTTGCAAGTGCGGGCCCGCGACTACGAGGATCCACCCGGTCTCCAAGAGAAGACGGAAAATCTGCTCCGCGAATGGAGGAACGTGCTCCTCAGTCCACTCACTGAAATAGAGATCGGACAGAACTTCAATATATACGTGCACAGGATGAACATGAATGGTATACTGAAATCTGATGACATGATCACACGTTTCTTCCGCATAGCCACTCAGATGTGCGTCGAGAATGTATACCAGCTGTTGAACGAGGACAGGATGAATCCTCCCCCCGTGCCGCCCAAGAGGGACAAGTATTACGCTATGTGCGACTCATTCATCAAGCTTGTGTCGCTGCTGATTAAGAATACGGCTGACGGAGGAAATCCAACACCGAAATTGAACTTATTGAACAAGATCCTGGGTATAATCGCGGGCTGTCTGCTGCAAGACCACGAGGAGCACGGCTCGAATTTCCAGCAGCTGCCGTACCACCGTCTCCTGCTGATACTGTTCCTAGACATGAACATGGCCGAACCCGTCCTTGAATCTATGAACTACCAGGTTCTAACAGCATTCTGCCACACCCTCCGCATCATACGCCCGAGTGTAGCTCCAGGGTTTTGTTACGCGTGGCTTGAAATAGTCGCCCACCGAGCATTCGTGAATCGTGTTCTGGCTGTGACGCCGCAACAGAAGGGTTGGGGGATGTATTCGACGCTGCTTATCGACCTTTTCAAGTTCCTCGATCCGTTCTTACGTAACACGGAGCTGGCGACGCCAGTCATGATGCTGTACAAGGGAACACTTAAAGTGTTGCTAGTATTGCTCCACGACTTTCCCGAGTTTTTGTGTGACTATCACTATGGCTTTTGCGATGAGATCCCACCGAATTGCATACAGATGAGGAATCTCATTCTGTCCGCGTTCCCGAGGAACATGCGTCTGCCGGATCCATTCACACCCAACTTGAAGGTGGATCTGTTGGCCGAGATCACTCTACCACCGCGTGCCGTTATCAACTACGCCAATATAATACCGGCGTCGCAGTTCAAAAAGGATCTGGACGCGTATATCAAGGCCAGGGCTCCGGTTACATTCCTATCGGAACTGCGCAGTAACATGCAGGTGGTGAACGAGCCAGGTCGCAGGTACAACAGCCAGCTGATGAACGCGGTGGTGCTTTACGTCGGGACGCAGGCGATCGCTTACATCCGTGCCAAAGGGCAGACGCCGAACATGTCGACGATAGCACATTCAGCTCACATGGATATATTCCAGAATTTCACCGTAGACTTTGACTATGAGGGCCGGTATCTATTCTTGAACGCTATCGCGAATCAGCTCCGTTATCCGAACAGTCACACGCACTACTTCAGTTGCTGCCTGCTGTATCTGTTCGCCGAGGCTAACACGGAGGCTGTTCAGGAACAGATAACGAGGATGCTCCTAGAAAGGTTGATAGTAAACCGACCACATCCCTGGGGGCTCCTCATCACATTCATCGAACTCATCAAAAATCCTATATATAAGTTCTGGACACACGAATTCGTACATTGCGCGCCCGAGATCGAAAAGTTGTTCGCGTCGGTCGCCCGCTCGTGCATCGCGGACAAGGCTGGGGGGGAAAGGGATATGACCGAGTAG

Protein sequence:

>DPOGS208376-PA
MNLDPLTFSLSQINYLVVNLNKKNFKQTSQELSQIVSLYGLEAENQLLRCLLSEAAKTWDERTASSVHASLLAQHLACLLNHPAKSTVICQAVDQPTRSLQKVLKPTNSLLSRLARLLKFTTAQDVAFTLVLRRNSSKPDIVSLAKQHLKKRFLDFVQCYLDAERGHQVERAGLQECSPEVLQTLLTSLAYENFRLAAVTKDLFLKRLRIDFPREVVPIVLAPLLYPDDTQTPLEEMTTSDDMTAAMMDNTLAEIIRDIGYAFTASVEDCKNNMVNFGAREPTAIDVARIISTMIKYHATIQEAPHVQTPGNFWMNHEAKKEAMAHGHVGETWNPEVFVQTLKELASNLNWKEVILQLDHPEFIVPDRQGLSLLFTILRLGLQSAGYPANIFPVEYLCRRWANLEGQMSLLTNILKHPDIFSFADHPFHPVSIDLLKSPPETDNKEVSTWRCLYLVELLLYASERGYYLQVHELFKYPLQHCPDILLLALLQISPPITVFRQELLTTLIPIFLGNHPNSGIVLQHAWHSQNPNIKPIIMHAMADWYIRGECDQSKLSRILDVAQDLKALSLLLNVQSFPFIIDLACLASRREYLKLDKWLTDKIRDHGETFVTAMVKFLQRRCPQIIGKIPEDQLPKAAQLPPETVGTMLACLQLCIPNVQQELQEAIYNLMASCQALILTKARPGIPGIARPHTRILETPFNPAGLGPQLFTPHVDAIANLAPNVANMTLGAPANTAFAMPGTLGPLVAAPGSPSRLLGAGPNSPFAMMPMQQHVANVANMGALARMPPTPMDKPRLPDPIHLPEMIHNVSKEIEDEANGYFQRIYNHPPHPTLSIDEVLEMLKKFQDSPNKRERDVFSCMLRNLFEEYKFFPQYPDKELHITAQLFGGIIEKGLVPSYVSLGLALRFVLDALRKPEGSKMYYFGIAALDRFKSRLKDYHKYCEHVRAIPHFNEFPPHLIEYIEYGLQSQEPPTKPQGAVLPTSLTAILNQTAVITVSAPYRAVICAPSAISVISKVSNCIAGGIGSRPSIANATNIDTLLTATDREEKINAPPEAIQDKTAFIFNNLSQLNLQPKCEELKEIITEEYFPWLSQYLVMKRASIELNFHALYSNFLDVLKIREINRLVTKETYRNIRVLLRSDKGIANFSDRSLLKNLGHWLGMLTLARNQPILYIDLDLKALLLEAYHKGQQELLYVVPFVAKVLESCAKNVVFKPPNPWTMALMNVLAELHQEPDLKLNLKFEIEVLCKNLSLDIADLKPSLYLKDPEKVRTIEFQLSQPKPVKETPNVMPVNQTLVPAPQIQLMPPQPQMIPVEDMSAAAPTPTAGLVANDPNLMGVLGLPEPRFNYLDVNVSSTSAFGQKICFNPHIILFQNYPHLKQFVKPAIERSIQEWIHPVVDRSIKYALTTCEQIIRKDFSFDPDEVRMRTCAHHMMRNLTAGMAMITCREQIISTISTNLKAAFITALIPTTPQQKDIIESAAAVLATENMELACAFIQKTAVEKALPELDKRLMNDYEMRKIARQEGRRYYDPIVLTYQTERIPERVRLRVGGPTDLQISVYEEFACNIPGFMPVRDAGMFIPKPSAQEQVPQMTFNQVMNPQQVYGTDEMGTLISAAELFLSNALSVPSFAVQATNMHTLLECLIIARRNRDIVSGYTLLQRAVEGLLDGHIVQPGTNPEHAEMMTRYRDIHLRVLKLLEDARVYGHAWTTKQITYCVSECRDELRYNLEAIDCLVRNHLINMPQYDLALAHLMDNGNNYVAVAFAMQVVQLYLVDDRNNVYATESDLYHTTDTLVRMMSHSRQPPPEGLATLIETIRINQDPSTYLGERSPLGPTAHIHNGILQVRARDYEDPPGLQEKTENLLREWRNVLLSPLTEIEIGQNFNIYVHRMNMNGILKSDDMITRFFRIATQMCVENVYQLLNEDRMNPPPVPPKRDKYYAMCDSFIKLVSLLIKNTADGGNPTPKLNLLNKILGIIAGCLLQDHEEHGSNFQQLPYHRLLLILFLDMNMAEPVLESMNYQVLTAFCHTLRIIRPSVAPGFCYAWLEIVAHRAFVNRVLAVTPQQKGWGMYSTLLIDLFKFLDPFLRNTELATPVMMLYKGTLKVLLVLLHDFPEFLCDYHYGFCDEIPPNCIQMRNLILSAFPRNMRLPDPFTPNLKVDLLAEITLPPRAVINYANIIPASQFKKDLDAYIKARAPVTFLSELRSNMQVVNEPGRRYNSQLMNAVVLYVGTQAIAYIRAKGQTPNMSTIAHSAHMDIFQNFTVDFDYEGRYLFLNAIANQLRYPNSHTHYFSCCLLYLFAEANTEAVQEQITRMLLERLIVNRPHPWGLLITFIELIKNPIYKFWTHEFVHCAPEIEKLFASVARSCIADKAGGERDMTE-