Monarch geneset OGS2.0

DPOGS215957
TranscriptDPOGS215957-TA4929 bp
ProteinDPOGS215957-PA1642 aa
Genomic positionDPSCF300078 - 910651-931773
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0164520.069.55% 
BombyxBGIBMGA001072-TA0.082.46% 
DrosophilaCG33291-PA0.045.43% 
EBI UniRef50UniRef50_D2A2100.050.31%Putative uncharacterized protein GLEAN_07059 n=3 Tax=Tribolium castaneum RepID=D2A210_TRICA
NCBI RefSeqXP_001662621.10.048.72%hypothetical protein AaeL_AAEL012496 [Aedes aegypti]
NCBI nr blastpgi|1571327080.048.72%hypothetical protein AaeL_AAEL012496 [Aedes aegypti]
NCBI nr blastxgi|2700050550.051.06%hypothetical protein TcasGA2_TC007059 [Tribolium castaneum]
Group
Gene OntologyGO:00055155.7e-18protein binding
GO:00036777.8e-18DNA binding
KEGG pathway 
InterPro domain[1318-1341] IPR0206838.8e-21Ankyrin repeat-containing domain
[1430-1564] IPR0113336.2e-20BTB/POZ fold
[1459-1564] IPR0002105.7e-18BTB/POZ-like
[783-884] IPR0090727.8e-18Histone-fold
[1454-1560] IPR0130697.6e-14BTB/POZ
[1099-1129] IPR0021105.6e-07Ankyrin repeat
Orthology groupMCL12053 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215957-TA
ATGCTGTGCTCTGCGCGTTGTTCTTGCTTTCTTTGCCGTGAATTACAATCGTGTCAAGCCGCAGACGGTCGGAAACCACCCGCGAAACACAACATGTGTGACAATTTCATACTTAAAAAAGCTGAAGCCTGGGAGCCTATGAAATTCATTCGTGGAGCTGCTAGATTTTCTGTGAGACATTGGCAGAACCGGGAACCGATCGCAGCTCCGAAAACTCGTGTTTCACCACACAAGTCGACAAGTTGTGAACAGTTATATCTCAAACCATCTGAGGATTCTGGTCCTTACGTATTGCGAAAAGCTCCAAGTTCAGATCGTGTTAAACAAATGGAGCGCAAACCAGTCTTTAAGCTTCTGGGCAATTCTCGTTCAAAAAAAATAACTAGTATTGAGGCGCCGGACTCAAATCTGAGAAGATCCGTTTTGAATTTTAGAAAAAATGAAATGACTAATTCCAAAAATATTCATGAAAAATATATTCCGTTAAGACGAGCCCCTTCTCCTTTGGAACACATTTATAGCGAACCAAAGCAATGGCCAACGCAACTTGACGATTGTTGTAGACCTCCAATATATCAAAGTGTTGAAAAGAAAATGTCCTCGAATAAAGAAGCTTCATGTGAAAAGAAAATGTATAACCATCGACCTCCCTGTTATGACGAATTAGCAGAAAGGCTATCCCATGAGAGATTGAAAAAAAATAGCAACCCTGTTCCTGATGGAGGGTTAAGTAATAAAATTACACAAAACAATAACAAAGTAGTTATATATTTTGGAGATTCCATTATTAGGAAACACAATGAACTGAAGAAAGAAGTTAATGAGCTTAAAGAAGATTCCGCAATTCGAAAAGAGGATATCAAGCATAAAGAAGAAGCAATTTATGCAAATAAACAGTCCATTGAAAGTGAAAACTTCCATTTAAGTCAGGAAATTGACAGAGGAGGGACAAAATTAGTTTCAGAAATTGAAGTTCCGGAAGGTATTTATTCTGAAATCAAAAATGACGATATTAATGAAGACACTAATGTAGTGACTATTAATGATGCCACTTACAAAAAAGACGTCTATGAGACTGTTAGTTCCGATGGATTTCTGGTTGTTGAGTTTGAAGGTAACTTCGAAAGAGCACGTCAATTAGTTGAACTTGTGCATGGAAACAATGTTGACCAAAATGAGGCGACCGTGCTCGATGAAGATGACCAATGTGACTGGAGTTTCATACAAGACTGGAGAAAACGGCCTTCGAGCGGATGTCGTCCTACTGAGGAGTGGTGCGGCGCGGGTGTGTCGGGAGCGCGTGCGTTGCGCGTGCGCGTGTCCAGCGCGGCGGCGGGCACGGCAGCACATGCGAGATGTCTCTCCCCTCCGCCACATGCACACCGGGCGACAACTCCACCACCTCCGCCGCCAGTGGATCAAACTCCAAAGACTAATACTGAGGAGGTAGTATCCCCGGACGCAAGTGTGGCCCAGTTGAAACCTGTGACCGCTGTGTCGGGGAGTACAGCTGCCAGCACGCCTGTACGCAGCGCGCCCGCGCAGAGTAGTCTAGCGTTAGAGAGACGCAAACACGAACATCGATCTGTGGACTCTTCACATATACAGCTGCAAAATGGTGTACGTGAGTGCAACAGCAGTTCAGATGAGAACAGGTCATCGGGACATGCTTCTATGTCGGATAGCGGTGGAGGTGACGCGGGGAGGGACGAGCCAAGACGGGCCAAGCAGCCGCCTCACGGACGAAATAAACATCGCCCGCATAATAAGTTACAATCCCCTTGGCCGGGTGGTGGTGGGTTAGAAGAAATCAGGTCAGCCATCAAGCAGTTGACTCTACGTTCTCGGGACAGCAGCAGTACAGCTACCAGCGGTGCATCAAGTGCGGGCGGTAGCAATAATGGTGCACCGGATGGCGTCAGTGCAGCCGAGGCACGTCGTCGTCGGGCGCCACTCGTTCGTCAGCCTTCACTAGACACCGTCTGTACAAACGTCACAAGCGCCGACGAGTTCGTATGGGTGGACTCACATAATAGACTAGTTGAACTTCGCTGCGTGCCGTGGACGGCTGGCGAGGTATCCCGGGCGATACAAGCAGGTCGATGTCGGGATATAGCACCCCGTATGGCTCCAGACTCACCTCCTAGACTGGCCTACTTATTACAGAGAGCTTTGGTTCGGATAGGGCGGGAAGCTCAGCGTCTGTCACAGAACTTTGGCTTCTGTTCCAAGCACGAGGTGGCTGGCGCTTTCCGAATCGTACTGAGTACACCGTTAGCTGATTCTTGTATAAAGGGTTGTCAACGTGCAGCGACTATGTATGCAACGTCAGTAAGCGCTGCAAGAAGACTAGGCTCGGCGGCTCGAGCGCGCACAGGTCTAGCACCTGGACGTTTTCAGCGCTGGATGTTGGACGTGCGCGTCGCGGCATTCGTACATGAATACGCAGCCATTTATCTTTGTGCGGGAATGGAGACTTTATTGGAAGAGATTGCTTTAATAGCGGCTGGCACTGGCACATCACCGATTACACCAGCCACCATCGATCACGCGGTTGCAAACTGCGCTGACCTATGGGGATTGTTACAACCATACAGCCATTTAAATGCGGGGAGAATCGCTTCAGGAGCATTATCGCTTTCTCGATGGGAGTCAATGAGTTCCTTGGGCTCTGGGTCGTCGTCTCACCGACAGCTGGGCATGAGCCATGATTCCGGAATTACTACTAATGGCTCTGGTGGATCTGGCACGTCGGCGGGTTCCAGTAGCGGATCGGTGTTGTTAACGACGTGTGCAGGCTCCTCTGGCGAACTGCGAGCTGTGCTGCGACAAGTCAGCCCCAGACAACCACCACTTAGTCCAGCCGCTGAACGCGCGCTCTATTACTTCATGAGATGCTCCCAGTTGGAGCATACATCTAGCGGTGCGTCGGGTGCTGGATGTGGCGCAGGTCTGTGGGAGGAGCGTGGTGCCGGCGCGCTGCCTCCTCTTGGTGAATGGGTGAGAGCAGCACGAGCTCATGCAGCAATGCGTCCTCCCCCCGCCATACCTGATGCCGACGATATATTACAAGCTGCCAGGCTACTATTACCGCATGCAGACTGCCCACCGAGACCTATTACTTTGGAGGAAGCGATTGAGCCTGCATGGTCACGTTGCTCACGTTCGCCGGATGAATTGGGTCGGGCAGCATTGGGGCTAGCTCAGCGGGCTCTGTTATCTGGTCGCCCCGAGCTATTAAACGGAGTCCGCGCTCTGCTACCGGCAGCTGGCATTGATGCTACTGACGCCTCTGGCCTCACCGCACTTATGAAGGCCTCCTTGGTTGGAGATGAACAAGTAGTTGCGATGCTATTAGAAGCTGGGGCTGACCCTAACATAGAGACAGCGGGAGCGGCGGCACAGCACTCAGCACTACTCTCACCGCGTTCTCCTACTCACCCACCCTCCTCAGCAGCTGCAACACCGGCACCAGGATATACACCGCCCACAGCGGGTTGGACAGCATTAGTATACGCGTGCGCTGGAGCGGGTGCGGGAGGTGGAACAGGATCAGGGGGAGCAGGGGGTGCGGGGGCCCTAGAAGTAGCAAGAAGACTGCTGGCAGCCGGCGCCAGAGTTGACGGCGCACCTGCCAGAGGAGATGACGTGTGCACACTTACTCCTTTACAGATGGCATGTGGAGTTGGGAACTTGGAACTAGTACAGTTGCTATTGTCACACGGAGCGGATCCGTTCTTGTCTACGCAACTTAATGATGCACTCTGCTATTCTGCTGCTGCGCAGTACGGCTGCTACAGTGCTGTAGCCGTTTGTTGCACGCATGGTCGTCGTAGCTGTCTACGTGCGGCTGTTCGAGGCGGAGCTCGAGGTGCAGGTGGGGGTGCAGTGCTGTCTCTGGAGGAAGTTCTTGCGGAGGGCGCACCACCCGCTGCTGGCCGGCCACCACCCTTCACCAAGCAACAACAACGCGCGTTACAGGATGCCATGTACTATGCCGCGGAAACTGATCATTTGGATATAACCCTAGAGCTGCGAGCACTGGGAGTCCCTTGGTCATTGCACGCTTGGACTCTTTCCTTGGCCGCCGCTGCTGAAGCCTCGCTAGACCATGTGATCGACCAATTATTACAGGACTTTTTACAAGTCTGTCCATCCGACGACAGTCACTACAGCAAGCAATTCATTTACGAGTGCCTCCCTCTGCTTTTCAATATTTTACGCTATAGCAAGAAAGAAGGCACAGTGTTGCTTCTAGCGGATATTTTATGCGCGTGTTACGGTTGGGAGCCAGTACCTCGGGTCGCACCGCCCGCGCCCGCTCCGCCGCTACCAGCGCGAGTTGACCCTTCATACGTTAATAACCCATCGCTCGCTGATGTCACTTTCAGAGTTGAAGGTCGTCTTTTCTACGGCCACAAGATTGTGTTAGTGTCGGAGTCAGCCCGCCTACGAGCTATGTTGGCGCCGCCCCGTTCTGGCGAACCCTTGGCAGGAGCTGCTCCACCACTCGTGCAGATAAACGACATCCGATATCACATATTTGAGCAAGTAATGAAGTATTTATACTCGGGTGGCTGCTCGGGCCTTGAAATTCCGGACGGTGATGTTCTGGAAGTGCTGGCAGCCGCGTCTTTCTTTCAGCTGCTTCCCTTACAGCGCTACTGTGAAGCACGCGCGGCACAATCCGTAGACCTGCACAATCTTGTATCGGTTTACATACACGCTAAAGTGTACGGTGCGACACAACTCCTGGAGTACTGTCAAGGATTTCTCCTGCAAAACATGGTCGCCTTACTTACTTACGATGATTCAGTTAAACGGCTGCTGTTTGGCAAGAGGCTCCCCGGACATAATGTGTTGGGGGCACTACTAACTACTTTGCAGAAAAGAATCGAATCAAGGAAAAACCAAGTCAAGAGCAGATAG

Protein sequence:

>DPOGS215957-PA
MLCSARCSCFLCRELQSCQAADGRKPPAKHNMCDNFILKKAEAWEPMKFIRGAARFSVRHWQNREPIAAPKTRVSPHKSTSCEQLYLKPSEDSGPYVLRKAPSSDRVKQMERKPVFKLLGNSRSKKITSIEAPDSNLRRSVLNFRKNEMTNSKNIHEKYIPLRRAPSPLEHIYSEPKQWPTQLDDCCRPPIYQSVEKKMSSNKEASCEKKMYNHRPPCYDELAERLSHERLKKNSNPVPDGGLSNKITQNNNKVVIYFGDSIIRKHNELKKEVNELKEDSAIRKEDIKHKEEAIYANKQSIESENFHLSQEIDRGGTKLVSEIEVPEGIYSEIKNDDINEDTNVVTINDATYKKDVYETVSSDGFLVVEFEGNFERARQLVELVHGNNVDQNEATVLDEDDQCDWSFIQDWRKRPSSGCRPTEEWCGAGVSGARALRVRVSSAAAGTAAHARCLSPPPHAHRATTPPPPPPVDQTPKTNTEEVVSPDASVAQLKPVTAVSGSTAASTPVRSAPAQSSLALERRKHEHRSVDSSHIQLQNGVRECNSSSDENRSSGHASMSDSGGGDAGRDEPRRAKQPPHGRNKHRPHNKLQSPWPGGGGLEEIRSAIKQLTLRSRDSSSTATSGASSAGGSNNGAPDGVSAAEARRRRAPLVRQPSLDTVCTNVTSADEFVWVDSHNRLVELRCVPWTAGEVSRAIQAGRCRDIAPRMAPDSPPRLAYLLQRALVRIGREAQRLSQNFGFCSKHEVAGAFRIVLSTPLADSCIKGCQRAATMYATSVSAARRLGSAARARTGLAPGRFQRWMLDVRVAAFVHEYAAIYLCAGMETLLEEIALIAAGTGTSPITPATIDHAVANCADLWGLLQPYSHLNAGRIASGALSLSRWESMSSLGSGSSSHRQLGMSHDSGITTNGSGGSGTSAGSSSGSVLLTTCAGSSGELRAVLRQVSPRQPPLSPAAERALYYFMRCSQLEHTSSGASGAGCGAGLWEERGAGALPPLGEWVRAARAHAAMRPPPAIPDADDILQAARLLLPHADCPPRPITLEEAIEPAWSRCSRSPDELGRAALGLAQRALLSGRPELLNGVRALLPAAGIDATDASGLTALMKASLVGDEQVVAMLLEAGADPNIETAGAAAQHSALLSPRSPTHPPSSAAATPAPGYTPPTAGWTALVYACAGAGAGGGTGSGGAGGAGALEVARRLLAAGARVDGAPARGDDVCTLTPLQMACGVGNLELVQLLLSHGADPFLSTQLNDALCYSAAAQYGCYSAVAVCCTHGRRSCLRAAVRGGARGAGGGAVLSLEEVLAEGAPPAAGRPPPFTKQQQRALQDAMYYAAETDHLDITLELRALGVPWSLHAWTLSLAAAAEASLDHVIDQLLQDFLQVCPSDDSHYSKQFIYECLPLLFNILRYSKKEGTVLLLADILCACYGWEPVPRVAPPAPAPPLPARVDPSYVNNPSLADVTFRVEGRLFYGHKIVLVSESARLRAMLAPPRSGEPLAGAAPPLVQINDIRYHIFEQVMKYLYSGGCSGLEIPDGDVLEVLAAASFFQLLPLQRYCEARAAQSVDLHNLVSVYIHAKVYGATQLLEYCQGFLLQNMVALLTYDDSVKRLLFGKRLPGHNVLGALLTTLQKRIESRKNQVKSR-