Monarch geneset OGS2.0

DPOGS208176
TranscriptDPOGS208176-TA4974 bp
ProteinDPOGS208176-PA1657 aa
Genomic positionDPSCF300207 - 127473-143125
RNAseq coverage550x (Rank: top 23%)
Annotation
HeliconiusHMEL0066510.062.08% 
BombyxBGIBMGA010263-TA0.053.79% 
Drosophilacic-PD8e-5152.97% 
EBI UniRef50UniRef50_E2ADM33e-10043.32%Putative transcription factor capicua n=6 Tax=Formicidae RepID=E2ADM3_CAMFO
NCBI RefSeqXP_002423691.17e-9941.18%capicua protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3071803591e-9943.32%Putative transcription factor capicua [Camponotus floridanus]
NCBI nr blastxgi|2420056810.034.83%capicua protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055151.1e-22protein binding
GO:00036772.6e-22DNA binding
KEGG pathway 
InterPro domain[919-1011] IPR0090711.1e-22High mobility group, superfamily
[938-1017] IPR0009102.6e-22High mobility group, HMG1/HMG2
Orthology groupMCL15552 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208176-TA
ATGCAGCGGCAGGCGGGCGGTGCGTCCTCGGCGCAGAGCCCGCGAGCCGTGTACCGCACGCACGCGTCGCACATATACGACATTGATGATCAGATACCAGCGTCTGTTATCAGTAGTGTGGGTACAGTGAGCCTGTCACAATCGACAACAAACCCAACGTCGAACACAACCATGGCCAGTCGCGGGAATACGATATCTAATAACAACACACCCCAAACACAAGCACCAGTACGGAACCTTCCCAAGAAGCGTAAATTTGATCCGTCTGAACTCGAGGAGATAGAACGTAATTGCGTCAACAGTATCGCTGAGAGGAACAGCCTGAATATACCCACAGCTGTCACGAATTCAATGGATTACACGTCAAGCTATCAGCCAATAGCTCAGCCATCCGTAGTACCGAGATCTTCCCCCCACGATACGAAACAATACATCCAATACCCTAACATAGATCTATCTGAGTGGCGTGATCACCGAGTGCTGGCTAAACATCGCGGGTTATACCTCCCAGGGGTGATAAGGCAGGCTGACGGCTGTAAGGTCACCGTGGAATTGGATGGACAGGATATTGAACCGATAGAGTACAGTGATTTGTTCGGTGTCAATAGATATGACGTCATAAGTGACGCGAGTCCTCAGCTCAGTCATTTACCGGTGGGGTCCGCCTGCGTGTTCAAGACCACAGACCCCGCCAGAGATGGAGGGCACAACGTGTTCGTCGAGGGTCAAGTCTTCGAAGTTAATAATTCTCCTATCAGGATTCGTGTTAAGGTTATAGAGGGCGACACTTGTAAGGAGGTGGTAGAGGTGAAGCGTGCTGATATCCGTCTTCGTCAACCGCCTTGGGCCGACGAGCTGGAGGACGCCGGGTCACACGCACCCGCTGTCCCTCATATGAGACAGCAGTGCGTCTCTTACTCTATGGGTGATCACTTCGCGACGTCGTCTCCTATGCCGGGCGCGTCTCACGTGTCTGTGGGGGCGCTGTCTGCTGGGTCGCGTCCCTTCGACGACTACGGCAGCGACGACGACCTGCCTAGCGAGAACATTATGTTCCCCACTGACGCCTCGCATATGGATTGTAATAACAGTAAGAGGAGCAGTCTACAGAGCAGAGGCAGCACATCCAGTCTGGTCGAGGGTAGCCTGACGCCTCGCTCCCAGCCACCTACACCTAGATCTCAAGCCGCCACACCTCATAAGTACAAGAAGGGTGACGTGGTGTCCACTCCCACTGGGATAAGGAAAAAGTTTAACGGGAAACAGTGGCGAAGGCTGTGCTCCAAGAACGGGTGCGGCAAGGAGAGTCAGAGGCGAGGGTACTGTTCACGACATCTCTCGCAGGGAGGCGTCAACAGGTCATCCAACACGCCGCTAACCCAGGGATCCACACATACTCCGCAGCAGAGAAGCAGCAGTAAGTCGTTATCATCGAACGGTACTGGTGTAGAGGGAGATGATACGTCCCGCGAGTCGGACACCACGCCGCCCAACTACCGCGTCACCGGCAGGTTCGACCAGGACGAGACAGAGGCCGCCAATATGCTTGTGTCACTCGGTAGTTCTCGTTCAGGCAGTCCGGGCGCGTCTCCAGTGAGCGGGTCGCCGGTGCTCCGCGGTAACGTGTTCGTGCCGATATCATCGCCACAGCCTCCGCTCAATAATCCTCCGCACAAGAACTACCATCACCACCTTATCAGACCTGAGTCACTACGTCCAGCCATAGTGAGTCCACCGGTGGGGGGCGTGGCCACTAGTGTCATAAGAGTCTCCCCCGCCCCCACCCATCACTATCAGGTAGATAATCGCAACGGACAAAATATTCAATCGAGCCAACCGAATATGATGGGACTACAAACGACGCCATACAACATACAGAGCAACATGCCATCCAATCTGAACGCGCCCACGACGATGCAATCGTCATTAAACTTCCCTACCATTATAAACAATCTGAATCAAAAATTTCAAACGTACGCCAGTTCAGTGCGACCGACCAAAATAGAGGATTCGTTACACAACGTGGTCGTCCACCGCATGCCCAGCAACGGCACGGACTCGGACTACAGGAACAAAGCCTACCGCAGGAACGGTATACAGGAGCAGTTCAGGCGAGACGCGGATATGTCACCGCCTTTGAATAACTATGAAAATTTTCTGAATAGAAGAGTCTCGGATTACGATGAGGAGGACCACTCGGTCCCTCAGCCAGATAGCGGCCACTTAGAACTGTCCGAGGCTCGCTTGATAGACGACAAGAGGATCGTCAAGCCGGCGCCGCTACCTGGCCGGTACATCTCACTGGTCGACGACACCAAGGACACGCTGCGGAAACTGTACGTCATACCGCAGAACACCATCGACAAGAAGATAGTACTCATCAAGAACGAACCCACAGACATACAGATAGAACACAAGCCGCAGTCGCAGCAGCTGAACAGCAGCGACCAGGACATGGAGCATCGCAGCACGGACAACGGGGACACTGGTAATAAGCTCAACAACAGCGCCGTTATAGTACATCCAAGTCAACTACTGCCGGTGTTGCCGCCGCCTTCCTCGGCTATTATAGTGTCATCCAGCGGTGTGCCCAGCGTGTTCTCTTGGCAGTCACTGGTGCCTCTGCTAAGAGCGGCGTCTCCCCCGGCAGTGCCGCACTCGCCACGCACGCCACACACGCCACACACGCCCCACACGCCGCACACGCCGCACACGCCACATACACCACACACGCCACACGTCAAGACAGAGGATATCAAGACTGAGAATGAGTTATATGTCATAATGTATTATCTTCGGCAGAAGGAGCGTCGCATCCGCCGGCCGATGAACGCTTTCATGATATTTTCCAAGCGCCACCGCCAGATAGTCCACCAGCTGCACCCCAACCAGGACAACAGGACCGTCAGCAAGATACTGGGAGAGTGGTGGTACTCGCTCAAGCCCGACGAGAAGAAGAAATACAACGAACTGGCCAGCGAGGTGAAAGAGGCTCACTTCAAAGCGCATCCGGAGTGGAAGTGGTGTAATAAGGATCGCCGCAAGTCATCGAGCAGCAGAGATCCTACGGGCTCTACGCCGCAGAGTCCTCGAACTCCATCCGAGGGGCCAAATCCCATGATGGCCAGTGCGGACATGTCTGTGAACTCACAGACATACACACACATCGGCTCGCCGCAGCTCAGCGACGACGAGCCTATGCAAATTAGTCAAACAGTAGAAGAACCGTCGGCGCCGGCGCAGAACATCGAGATCGATCTCAAATGTGGCGAGAAAGTGACGGACTCGGACTCCGAGGGGATCGACGCGAGAGAGTATCTCACGCATCATGACACGAGGCGGCCAAAACCTATTAAAGCTAGGGCGGGATCGTCTGATAATCTGTTGGGTATAACAGCGTCCAGCCCGGGAGGCTTCAAGGTGTTCCAGCCGACGGGAGGAGCGTTTAAATCAACGCATGCTGATAGCGGTGATAACCATAGACAATGGACGGCGTTTACATCGGTAAATAAACCGAACATCAATCAGGATCTGAATTCGCCTCACCCTAACACTCAGAGCCTAACGAACAGCGTTCAGGGTATATCGATAAGCGCTCCGAATCTGTCGACACAGGCGGCCCTAGACAACGCGATCGCATCGATAATAAGTCCCACCACTAGTGGTGTGCAAGTTATATCCAGTGGTATATCGATGCCGCATACTATCTCCCAGTCGCAGGCTCCAACGTCCACCACTACAGCCCTGACGAATACTTTGTTGAAGAGTGTCACATTGGTGAAACGAAATATTGGAGACAATACTGCGGTTCCAATAACCCTGTCAGTTGATACATCCGGCAACATAGTTATAAAGGCGAGTCAAGCGAGCGACTCCCCCGCTACCAGCGACTCTCAGCCTCTACATTACGTACAATTACAGAGACTATATGTGTCATCGGTCAATACTGCAGAATCGGAACCAGCTAAGACACCCGTCTCGAACCCTCAAACCGGTCCATCTGTTATAGTGTCACAAAGTAACAACCACATATCACCCAGTAACGCAACAATGGAACCGATGGAGACCTGGGATACTCCGATGTATGAGGCCCGGCCATTCCCTCTTGCACCCACACCAGCGCAATTGGGACGGGCACCACTACAGAAGAGACTCAGTAGAGGTACGTCAACTGGTTCGACTGGTAGCAACGAGGCTACGATCCCTCGGTCGGAGAGCGGGCCCACCACGCCGTTGGACGTCGGCGAGGTGGGCGTACACTCACCCAAGAAAGAAAACCTGCCCAGTCCATCGCTGAAGAAAAGCCTCTTCAAGAAAGGCAACGAGGATGGAAGGGACAAAGTTCTAGAGACGGTGAACTTCTCAGAGAAGTTCAATACGTTGCCTCAGTTCAAACCGGAAGCGTGCAGTCCCAGTGCGATGGCGGTGCCGCGCTCACCGCAGCTCTACCTTAGAAAGAAACACCACAAAATCAGTATGGAGGAGGATCAGACGGTGGTGACGCCGCAGATTGAAAACGAAATCATGAATGGTAACGGTATGCCGACACCACACTCATACGGAACACCTCACTCTACCACCAAGCTAGTTGGTACCACCTTCTTCGGACCTGACTTCAATCCTGAGAATTTTAGAGTGCCATGTTCGGAGGCTTCAGAGGAGATGTCTCCCCGCACACCCTGTTCGGCTCGCGGCGAGGCTGGTCACCGGCGGGTGTTGGAGCAGAGACGACATCTGGTGATGAAGCTGTTCCACGACCACGGCATGTTCCCCTCCACACAGGCCACTACACACTTCCAGGCTGCTCATGCCGATATCTTCCCCAGCAAGGGCTCCCTGCAGCTGAAGATCCGTGAAGTCCGTCAGAAACTGATGGCTCAGTCCAACCTCACACCGCACTCCGATCTCAACACTCCCACTAATGTGAACTCCCCTATAGTATCGTCATTGCTACCGACCTCTACAGCCAGTTAG

Protein sequence:

>DPOGS208176-PA
MQRQAGGASSAQSPRAVYRTHASHIYDIDDQIPASVISSVGTVSLSQSTTNPTSNTTMASRGNTISNNNTPQTQAPVRNLPKKRKFDPSELEEIERNCVNSIAERNSLNIPTAVTNSMDYTSSYQPIAQPSVVPRSSPHDTKQYIQYPNIDLSEWRDHRVLAKHRGLYLPGVIRQADGCKVTVELDGQDIEPIEYSDLFGVNRYDVISDASPQLSHLPVGSACVFKTTDPARDGGHNVFVEGQVFEVNNSPIRIRVKVIEGDTCKEVVEVKRADIRLRQPPWADELEDAGSHAPAVPHMRQQCVSYSMGDHFATSSPMPGASHVSVGALSAGSRPFDDYGSDDDLPSENIMFPTDASHMDCNNSKRSSLQSRGSTSSLVEGSLTPRSQPPTPRSQAATPHKYKKGDVVSTPTGIRKKFNGKQWRRLCSKNGCGKESQRRGYCSRHLSQGGVNRSSNTPLTQGSTHTPQQRSSSKSLSSNGTGVEGDDTSRESDTTPPNYRVTGRFDQDETEAANMLVSLGSSRSGSPGASPVSGSPVLRGNVFVPISSPQPPLNNPPHKNYHHHLIRPESLRPAIVSPPVGGVATSVIRVSPAPTHHYQVDNRNGQNIQSSQPNMMGLQTTPYNIQSNMPSNLNAPTTMQSSLNFPTIINNLNQKFQTYASSVRPTKIEDSLHNVVVHRMPSNGTDSDYRNKAYRRNGIQEQFRRDADMSPPLNNYENFLNRRVSDYDEEDHSVPQPDSGHLELSEARLIDDKRIVKPAPLPGRYISLVDDTKDTLRKLYVIPQNTIDKKIVLIKNEPTDIQIEHKPQSQQLNSSDQDMEHRSTDNGDTGNKLNNSAVIVHPSQLLPVLPPPSSAIIVSSSGVPSVFSWQSLVPLLRAASPPAVPHSPRTPHTPHTPHTPHTPHTPHTPHTPHVKTEDIKTENELYVIMYYLRQKERRIRRPMNAFMIFSKRHRQIVHQLHPNQDNRTVSKILGEWWYSLKPDEKKKYNELASEVKEAHFKAHPEWKWCNKDRRKSSSSRDPTGSTPQSPRTPSEGPNPMMASADMSVNSQTYTHIGSPQLSDDEPMQISQTVEEPSAPAQNIEIDLKCGEKVTDSDSEGIDAREYLTHHDTRRPKPIKARAGSSDNLLGITASSPGGFKVFQPTGGAFKSTHADSGDNHRQWTAFTSVNKPNINQDLNSPHPNTQSLTNSVQGISISAPNLSTQAALDNAIASIISPTTSGVQVISSGISMPHTISQSQAPTSTTTALTNTLLKSVTLVKRNIGDNTAVPITLSVDTSGNIVIKASQASDSPATSDSQPLHYVQLQRLYVSSVNTAESEPAKTPVSNPQTGPSVIVSQSNNHISPSNATMEPMETWDTPMYEARPFPLAPTPAQLGRAPLQKRLSRGTSTGSTGSNEATIPRSESGPTTPLDVGEVGVHSPKKENLPSPSLKKSLFKKGNEDGRDKVLETVNFSEKFNTLPQFKPEACSPSAMAVPRSPQLYLRKKHHKISMEEDQTVVTPQIENEIMNGNGMPTPHSYGTPHSTTKLVGTTFFGPDFNPENFRVPCSEASEEMSPRTPCSARGEAGHRRVLEQRRHLVMKLFHDHGMFPSTQATTHFQAAHADIFPSKGSLQLKIREVRQKLMAQSNLTPHSDLNTPTNVNSPIVSSLLPTSTAS-