Monarch geneset OGS2.0

DPOGS208939
TranscriptDPOGS208939-TA6312 bp
ProteinDPOGS208939-PA2103 aa
Genomic positionDPSCF300009 + 167172-174442
RNAseq coverage232x (Rank: top 44%)
Annotation
HeliconiusHMEL0080080.065.13% 
BombyxBGIBMGA002412-TA0.060.09% 
DrosophilaCG14073-PB2e-5244.05% 
EBI UniRef50UniRef50_E2AM482e-15043.26%BCL-6 corepressor n=4 Tax=Formicidae RepID=E2AM48_CAMFO
NCBI RefSeqXP_391923.31e-14844.94%PREDICTED: similar to CG14073-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838491131e-15545.50%PREDICTED: uncharacterized protein LOC100877383 [Megachile rotundata]
NCBI nr blastxgi|3838491132e-18029.63%PREDICTED: uncharacterized protein LOC100877383 [Megachile rotundata]
Group
Gene OntologyGO:00055155.3e-06protein binding
KEGG pathway 
InterPro domain[1815-1936] IPR0206832.8e-34Ankyrin repeat-containing domain
[1859-1888] IPR0021105.3e-06Ankyrin repeat
Orthology groupMCL16841 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208939-TA
ATGTTTGTGGTGTGCGCGTGTGTGAATTTTTTTGAGAGGGCGAGAACCGCTCAAGGGCAACGAGCGGTTAATGTCCTCGTAATGAACCTTCAAGAGTGGTGGATTCAGGACCGGGGGCTGCATAGTATGGATGTCAGTTTTAACAATGTGCTGGAAGGGACAGTGCGCCAATACTTCATGGGATTACCAAAGCCGGCACAAGAGAGTACCGGTCAACAAATGTGGCCTTCGGGAGCTTCAACTGAGGAAGGAACCCCTGCGCTTTCGGGGGCTCCAACCCCAAGTTCAAGGCCACCTTCCGCTGCCGCTGCACCGCCATTAACGCCAGCGCCCACGGTAATTGCTGTGCCCCCACGACCACCCTCGGAGCCCCTTGAGCCAGAACCACATATCATTCACAAAGCTACAGTTATGCGCGAAGCAGCAGAAAAACGCCGAGAGCGAAACGAGATAGAGGATGAGGAGGAGAACGTCGGGGGATTGTCACCACCTTCTCATATTATACGGAAGCCTTCAACTCATACCTTAGCATCACCAGCACCGGCTTCGGCGGTGCCAAGTCCGTCCCCAGTTCGTCCACCTTCCGCCGTGTCTCACTTGTCCACGCCGGCAGCAGAACAAGCAGATCCATCCACTCAATATCGAGCGTCTAACCACAGGGTACAAACTCCCATTGGTAGACCAGCCGAATTTGCGAATAGTCGGCCTGCAGACCACCATTCTATCTCGCATCCTTCAGAAACTTACACGCGCTATCAGTACGAGTCAAGTGTAGTTGAGAGTGGAAGAAATTCACATGGGCCACCAATCGTCGTCCCTCAGGCTACAGCCCCCCGCCAAAGTCCTCAAACATATGCAAGGATATCAAATAAAACGCCGCAGCATGCCGTCAGCCGTTTGACTGAATCTTCTGCACATCATCTACAAGTACAGCGTTATAATGTACCAACACAAGTCGCAACTCCTCATAACGACAGGGTACCACCGATTCATCATCCACATTTACCATCTACAAAAAGACTTACTCCGTTGGAAAGTCGGGCGGCGCAAGTATATTCATCCTTACCACAACGACGAGAATCCGCATCATGTGTTCGACCATCATCACAATATGAAAGGCAAGGGGTTTTGCCGCAACGAAAATATGATCCACAACAAGTGTACCCTGGATATCGACCCACGTCTGCAGGGCCACAACATATTCCAAATAATAGAGTTGATTCACATCACCAAGTACAATATAAACATACACCCGTGGACGTGGTCAGCCCACATTACGCTATTCGTCCTCATGATCGAGTTCAGGGAACATCACAAATGCAAAGTCATGAAATTGCTAGACTCCCGCCGCGCATTGATTATATAGGAAAAAATAGATCTGTGGAGAGAAGTAACTATCCTTATGCTGCAGCATCGACGTCCTCTGCTTCGTCAGTCAGATATGCACAGAATGGAAATATTCCAGCCGTTCAAAATAACTCGACGGTGCAATCTAGATCTTCATATCGCAGCGCAGTGGCTCCAAAGACGAATTATGATTTTGCAACATCTAATCATGTACAAATATCTCCATCCCGTTCTACCGCTAAACCTGGTTATCCTAGTCAACAGGCGCGAATGACAGTGTCAGTTACATCTATAGTAAATCAAATGAACCAAACTAAACAGAAACGAGAATCACCTTTAGATTTATCGGTAAAAACTGTAAAAAATTCGGCGGATTCTTCGACCACACAAGACGATGCAATTGATTCGATATCGTTGAAAAATGATAATCTTGCTTTGCAAAGTAGACCAACATCAAGTACAAGAAATGGTTCAATCGGTTATATAGCATCACATAAAGTAGATTTTTCACCTGATTTTGCTCAATATAGGGAAAGAACATCAAGTCAGGGATATAATTCACTGCCAAATGTTCCTCCTCAAATGCGTTACAACGGTGCAAATGATTCGCCTCAGCCAGCTTACCCCGTGGAACCTCGCCACCAAGTATACGCTGAACGTATTAAATATAACGGTTCAATAAATAGACCAGATTATGTTCCACGCATAGATCTAACGAGGCCCAATAGCGAGGATCGATATAACGAGAGTCGTATACGTGATGATAGACATTTGATCATAGAAGAAAGAAAAAGACCATCTGGACCTATTGTTAGTAACATTCCCGAAAAAATCGTACGATATGAGGCTTGGCCTCAGTCTGATCGCATGGAGCGAATGAGCCAATCAGCTCGCGAACAGCACGAACTTATGAGTCGACCAGTGTACAGCTATACTAGTCATAAAAAATTTGAAGCATATCAAAGTGATCATAAAAGATCAATAACTTCCAATGTGTACCGTGAACATCATTCTGTTCATCACTATCCTAATGATTACCATACGCATGAGGCTAATGATTACCATACGCAGTATCACGAAAGGATGCCAAATAATCGTCATATTATTCAGAAGATTCCCAGCCAAACGCATAGAGATTTCCACCCCCACGTAAATCATAATAATGTACATCCTCCAGATAAAAAGAAAGTCTTGAGCATATTAAGAAATAGTTTAGAAACCAAACAAACGGGAGTCTCAGAAACATGTAAATCAACAAAAGGCATTCCTGATCTCATTGTTATTGACGATGTTGACGATTCAGTAATAGAAGTTACTGATTTGACTAAAGAAGTTGATGTCGATATAGCAACGAACTCAAGTCATTTAAGTAATCCATGCCAGTCAAGTGCTAGTTTAGACTATCACATAAAAATGCCTAAAGCAGTGGATTCATTGCCAAGAGATTCAGACTATCAACGGTTTGAAAGTCGTGATGCAAAAACGCATTCACCGGAGAGTGATGTCGCTAGAATAAGGACTAAAGCAGAACTTAAAGTTATGCCTCCAAGTCACGACAGTGCTTCAAAAATTGAAATGAAAGAAGATTTGTTACTATCCAAACATGAAAAATGTAAGCCTTTGCCCAAATCACAAAAACAACACCTTTTTAACCAAATACGGGAAGATAACTTACGATTGGAATCTGTTATTAAAAATGAAAAGCCAAGTGACTCAGCTCCAGAAGTGAAATCTGAGCCTATGAATATTGATGAAATTGAAAAAGATGCTATGCTACATATCAAATCTGAAACTAATGAACCAATTATAAACTCTGGTTTATTTCAAGCAAAAGCAAATATTCCTGTAGAAGAAGTAGATATAGATTGGGCTAATGCTTGCGATAATTTTATGGAACAATTAAAAGTTGGCTGTCACAAGAAAAAGGTAATAAGGAAACGTAATAGTACCTTAGAATCTCAGAGTGAAAATAAAACAGATCATCTAGAAGACGATGAGTCTATTGTTTCATCTCTCTATCCATCTGAAACAAGACCCTCCTTCACAGAATCAACTGTCATTCAAATTAAAAAAGAACCAGTGGATGAAGAGGAACTGAAGCTAGAAATTGAACAGTCAAATAAAAATGACTCTTCAATAAAAAACAAACAAAAGAGTAAGCCGGAAAAAAGCAAAAATTTGAAAAAGAACGATAAAGACAATGATAAGCCACTTAAGTCCAGATCTAATCAAAAAGCTAAAAATAAATTTCTGAATAATGGCACTTGTTCTACGAATAGCGTCTTAGTAAAAAAAGAAGCAGAAGATGAAGTAGTAATGAAGACCGTAATTAAAGAGGAGTTAGAGTCTACTGATGACGATGAACCTCTTATTAAGAGCAAAATATTCAAACAAAAACAACAGGAACAAAATGATAACTTAAATTATCAATTATTAAAAGACTTGTCTGAAAAATCTGCATATGTCAAACTAGAGTGCTGTGACGATCAAATGACAAGTTCACGAAATTCAATAGATACCTCGCCAAAAGAAGAAAAAGTAAGTAAAGCTAAGAAAACACGTCAAACAAAATACAAAGGGAAATCTTCCGAAAATACGAAGAAAATAAAAGTTTGCGACGATTTTAGCTCGGATAGTGACGATACTTTAGGTGTAGCTAACAGACTAAGAGTCCGAAAAAATGGTAACACTGATGAGGGTAAAACAAGCACAGCTACACAAAAATTATGTGTTAAATCTACACCTACCAAAAGGTTCGATAGTTCTAGTGCCCATAGTTCACCAATGCGTAAGCCAGGTTTTGGTGACGGTTCTGATTTTCATCCAGGTTGGGAGGAAGAATTATATAGATATAAGAGATCTCTTCGCATGCCTACGAGACTCATTGCTATACCAAGAGGGCGCTCTGGTGGACCGTTTGCTAAAGGATGTATGTCACAATTTCAACGTGGTTCTACGTCTCTCCCTGATTTAGATCCCGTTCCTCTTTCTCCAGCTCCATCATCGGCGCCCTCCGCCGCAACTGATGAATTATATAGAAGACCAGATAAACTTAACTTGGATAGTGACCTCGATTCTAATTCAAGCTGTTCAGCTTTTAATAAATTACATTATGATTCAGAAGCCTCAACGTCAACAGTCTTTTCTGCGACGAAAGCTAATAAAAACAGGAGCTCTATAGTCGACGTTCTAATACAAAAATGTGGTAAAAGAGAAGAATCTAAAAAAAAGAATAAAGATAAAGATGACAAAACACCGAAAGTTATGGCAAAATCTTCCAATGCTGCAGAACTTTTACCAACTCCAAGCTTAGGACTTTTGAAAAATGGAAATAAGGGTAATTTGAGTGCAAAGAAAGAAAAATTTATGGAGGATATTTTCTACTTAGGTGCTTTCAGAAAAGAAACTGTCACTATGTTTAGGACAGCATTTATTAAAGAGACTGATGGGCTTATCGGTGCTACGGAAGAATTTGCTCCGGTTGTTTTGAAGTCAAGGACACGAACTGAGAGTAGAGTATTGAAGCAACGAGCTACTATTAAAGAAATTTTTGGTGATGATAGACCAGCTTCAGCTCCACCTTCATCATGTAGGGAAGAATTGGCCCTTGAAAAGGAGGAAGAACAACCAAGTGTCGTAATAAAAAAAGAACCAGATACTAAGAACAATATGAAACCGAAGAAAGTAAAGGACAAGATGAAGCGGCGATCTAGTTCAATTAGAGATGGATTGAGGAGCACGAAGTCTTTGAAAACAAATGATGCTAAAGGAAGACTCATGCGTTTGAAGAAAAGAAACAGTTTAATGAAAAGTTTTGCTCATAAACGAAGAAAAGACATGACTAATAAATTGAAAAAAGACATTAGCTCTACAACCGATGAAAAAGATAATAGCAAAGAAGGAACTACTTCACCGTGTGCAACCGAGAGCAATACTAAAAGGCGCTTGAAACGTCTCTTTGGCAGGAGAAAATTTAGTTCTGGTTTTGATTACATCAGGAAGAAGAAGAAGATTATAAGACGCGAAGATAATGCTCCAAAGATACGTAGAGCTGCACCTAAACCTAGTCCAGAGTCCGTTCATGACATTCAAAAAGAAATAAAAAGTTGGTTCATTAACAAAAGTATCGGAGAAACCCATTTACATCGAGCCGCGAGACTCGGTTTTACCGACTGTGTAGCGTATTGCTTAGAGAAAATGGATTTGAATCCGTCAGCGAAAGACAATGCAGGATTCACTCCTTTACATGTGGCTTCAGCTAGAGGCCACGTTAGAATAGCCAGATTACTCTTACAATACGGTGCCAACGTTTCGGCCGCAGCCCAGGGCGGAATAAGACCGTTACACGAAGCGTGCGAGAACAGCCATGTTGAAATTATAAGATTGCTTCTGGCGTATGGGGCCGACCCGCTACTCGGAACGTATGCGGGCCAAACTCCGGAAGAGCTAGCGGAGGGACAGTCAGCCAAACTGTTACATCTCTATATAGCCGATGTTCAAGGACGGGCTATCGAGCCATGGAAGTTTCCGACGCCCGCAGCAATAATAGATCGTGAGGAATTAGGTTGCGACCCGTTGTCGTCCCCACCGCCGGCGTCTCCTCCCCCACCACCCGACACCACCATCGAGATACAATGCACGGAGGCTCCTCTGCCACCGTTTTACAGCTTACGTACAGCCACAGGACAACCGGCCGACGGGCTCTGGTGTCTGCTGCAGGACATCACTAACGTTCTCCAGATCAAATCAAAGGATAGTCTACTGAAGCAGATCCATTGCGGGTCGGGATCACCGCGGGAGTTGCTCCGGGAGATTCGCACGCAGGAGTTCCTAGAGCGCGCTCAGTGTCACCAGCTCCTCTGTGCGGGGGAAAAGGTCAACGTCCGCGCTTCCAAGGTTGCGCTGATCCGTGTCACCGACAAGCTGCGACAACTGCTCAAGATAGAGACCGTCCTTGTCAGCTGA

Protein sequence:

>DPOGS208939-PA
MFVVCACVNFFERARTAQGQRAVNVLVMNLQEWWIQDRGLHSMDVSFNNVLEGTVRQYFMGLPKPAQESTGQQMWPSGASTEEGTPALSGAPTPSSRPPSAAAAPPLTPAPTVIAVPPRPPSEPLEPEPHIIHKATVMREAAEKRRERNEIEDEEENVGGLSPPSHIIRKPSTHTLASPAPASAVPSPSPVRPPSAVSHLSTPAAEQADPSTQYRASNHRVQTPIGRPAEFANSRPADHHSISHPSETYTRYQYESSVVESGRNSHGPPIVVPQATAPRQSPQTYARISNKTPQHAVSRLTESSAHHLQVQRYNVPTQVATPHNDRVPPIHHPHLPSTKRLTPLESRAAQVYSSLPQRRESASCVRPSSQYERQGVLPQRKYDPQQVYPGYRPTSAGPQHIPNNRVDSHHQVQYKHTPVDVVSPHYAIRPHDRVQGTSQMQSHEIARLPPRIDYIGKNRSVERSNYPYAAASTSSASSVRYAQNGNIPAVQNNSTVQSRSSYRSAVAPKTNYDFATSNHVQISPSRSTAKPGYPSQQARMTVSVTSIVNQMNQTKQKRESPLDLSVKTVKNSADSSTTQDDAIDSISLKNDNLALQSRPTSSTRNGSIGYIASHKVDFSPDFAQYRERTSSQGYNSLPNVPPQMRYNGANDSPQPAYPVEPRHQVYAERIKYNGSINRPDYVPRIDLTRPNSEDRYNESRIRDDRHLIIEERKRPSGPIVSNIPEKIVRYEAWPQSDRMERMSQSAREQHELMSRPVYSYTSHKKFEAYQSDHKRSITSNVYREHHSVHHYPNDYHTHEANDYHTQYHERMPNNRHIIQKIPSQTHRDFHPHVNHNNVHPPDKKKVLSILRNSLETKQTGVSETCKSTKGIPDLIVIDDVDDSVIEVTDLTKEVDVDIATNSSHLSNPCQSSASLDYHIKMPKAVDSLPRDSDYQRFESRDAKTHSPESDVARIRTKAELKVMPPSHDSASKIEMKEDLLLSKHEKCKPLPKSQKQHLFNQIREDNLRLESVIKNEKPSDSAPEVKSEPMNIDEIEKDAMLHIKSETNEPIINSGLFQAKANIPVEEVDIDWANACDNFMEQLKVGCHKKKVIRKRNSTLESQSENKTDHLEDDESIVSSLYPSETRPSFTESTVIQIKKEPVDEEELKLEIEQSNKNDSSIKNKQKSKPEKSKNLKKNDKDNDKPLKSRSNQKAKNKFLNNGTCSTNSVLVKKEAEDEVVMKTVIKEELESTDDDEPLIKSKIFKQKQQEQNDNLNYQLLKDLSEKSAYVKLECCDDQMTSSRNSIDTSPKEEKVSKAKKTRQTKYKGKSSENTKKIKVCDDFSSDSDDTLGVANRLRVRKNGNTDEGKTSTATQKLCVKSTPTKRFDSSSAHSSPMRKPGFGDGSDFHPGWEEELYRYKRSLRMPTRLIAIPRGRSGGPFAKGCMSQFQRGSTSLPDLDPVPLSPAPSSAPSAATDELYRRPDKLNLDSDLDSNSSCSAFNKLHYDSEASTSTVFSATKANKNRSSIVDVLIQKCGKREESKKKNKDKDDKTPKVMAKSSNAAELLPTPSLGLLKNGNKGNLSAKKEKFMEDIFYLGAFRKETVTMFRTAFIKETDGLIGATEEFAPVVLKSRTRTESRVLKQRATIKEIFGDDRPASAPPSSCREELALEKEEEQPSVVIKKEPDTKNNMKPKKVKDKMKRRSSSIRDGLRSTKSLKTNDAKGRLMRLKKRNSLMKSFAHKRRKDMTNKLKKDISSTTDEKDNSKEGTTSPCATESNTKRRLKRLFGRRKFSSGFDYIRKKKKIIRREDNAPKIRRAAPKPSPESVHDIQKEIKSWFINKSIGETHLHRAARLGFTDCVAYCLEKMDLNPSAKDNAGFTPLHVASARGHVRIARLLLQYGANVSAAAQGGIRPLHEACENSHVEIIRLLLAYGADPLLGTYAGQTPEELAEGQSAKLLHLYIADVQGRAIEPWKFPTPAAIIDREELGCDPLSSPPPASPPPPPDTTIEIQCTEAPLPPFYSLRTATGQPADGLWCLLQDITNVLQIKSKDSLLKQIHCGSGSPRELLREIRTQEFLERAQCHQLLCAGEKVNVRASKVALIRVTDKLRQLLKIETVLVS-