Monarch geneset OGS2.0

DPOGS207216
TranscriptDPOGS207216-TA5787 bp
ProteinDPOGS207216-PA1928 aa
Genomic positionDPSCF300001 + 6100254-6118747
RNAseq coverage539x (Rank: top 23%)
Annotation
HeliconiusHMEL0061860.079.53% 
BombyxBGIBMGA010711-TA0.077.46% 
Drosophilapolybromo-PA0.046.77% 
EBI UniRef50UniRef50_D6W9A60.054.83%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W9A6_TRICA
NCBI RefSeqXP_001808258.10.055.09%PREDICTED: similar to polybromo-1 [Tribolium castaneum]
NCBI nr blastpgi|2700017420.054.83%hypothetical protein TcasGA2_TC000618 [Tribolium castaneum]
NCBI nr blastxgi|2700017420.055.20%hypothetical protein TcasGA2_TC000618 [Tribolium castaneum]
Group
Gene OntologyGO:00055153e-33protein binding
GO:00036771.9e-27DNA binding
KEGG pathway 
InterPro domain[505-619] IPR0014873e-33Bromodomain
[912-1029] IPR0010251.9e-27Bromo adjacent homology (BAH) domain
[1333-1412] IPR0090717.5e-14High mobility group, superfamily
[1358-1412] IPR0009102e-11High mobility group, HMG1/HMG2
Orthology groupMCL12529 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207216-TA
ATGAGTAAAAGGCGCCGTGCATCATCATGCGCGAGTCGAGGACCCGACGGTGATGATAGTGATGATGGAGTCGAGCTGAGTAACCCTGGACCTCCGCCGTCCAGGAAAAAGAAAAAGATGGAACCGAGTGAAATTTGCCAACAATTGTATGATATTATTCGGTCATATAAAAAGGAAGACGGGACTCTCTTGTGTGACTCTTTCATAAGAGCTCCCAAAAGGCGGCAAGAACCTCAGTATTATGAAGTAGTATCCCAACCAATTGATTTATTAAGGGTACAACAAAAGTTAAAAACAGATACCTATGAAGATATAGAAGAATTGTCCGCTGATATAGAACTCCTAGTTAATAATGCTAAGGCTTTCTACAAACCCGATTCAGAGGAATATAAAGATGCTACGGATCTATGGAAGCTTTATAATAAACATAGACAGACATTGGAAAATGGTGATGACATCAAAACTCCTAAACTATCAAGAAATGGTTCATCAAGCAGGAGGTCTGACGTCGCCGAGGATACTTCGGAGACATCAACCAACAACGAGGAAGACAATGTGTTTGAGGAGCTCTTCAATGCTGTCATGACGGCAAATATAGGTGGACGACCTCTTTATCCGCCGTTTCAGTTTCTGCCGTCGGAAAGGAGATATCCTGAATATTACAGCGTTATAGACAACCCCATAGACTTGAAAACCATAGCACAGAAGATCCAGGCCAGCGAATATAACACCCTGAATGATCTGGAGAAGGATTTGTTACTCATGGTGTGGAATGCCTGCCTGTTCAATGAACCAGGTTCACAGTTGTATAAAGACGCCAGGGCATTGAAAAAGGTTATACAAGCTCGGAAGCAGGAAATAGATCAGCACGGACGGAGCGGTCCGGCCAAGACCTCTGAACGTATCCGCAGTAAGCGTACGTCACGAGTGGGGCCCGTGCCCTCCAGTAGGGCTCTCGCCGTCATGGAAGCTCCCGGTTCTGACACCGAACCGGATGTCAAGCATTCAGAAGACAGCGCTGGTGATAGCGACGAAGATAAAACAGATAATGAAGATTCTCCTCAATGGAAGCTTCTGGAAACCGTGAGAAATCACTTGGGACCAAGCGGAACACCGATGGCGGATCCGTTCTGGAAGCTTCCATCGCGTCGCGAATATCAAGACTACTACAAGGAAATAAAGAATCCGGTGTCTCTGAATCAGATCAAAAATAAAATCAGACGCGGGAGTTACGGCACACTGTCCGAGGTGGCCGGGGACATGAACATCATGTTCGAGAACGCTAAGCAATACAATCTGCCTACATCCAGGCTCTACAAGTACGCTGTGAAACTACAGAGGTTGATGCAGCAGCGGGTCCAAGAGTTATTGGATATAGTCCAAAGTTCATCGAGCGATGATGAAAGCCTTTCTTCAGTTAAGAATCAGACTCAAGCGAACACTCCCAGACCCAGAGGCCGGCCGCGTCTGAATCCCAATCCCGTATCGTCGCCGACGCCGGCCTCCCCGATAGTTGCAAAAACTAATTTACCGCTGAAGAAGAAGCTACACTACGTTTCCAAGCAACTGGTGGAGTTCACGTGCTCGGACGGCAGGCAGCCCATGCTGCTGTTCATGGAGAAGCCCAGCAAGAAGCTGTACCCGGAGTACTATAACGTGATCGAGCGGCCTATAGATATGTTGACTATAGAAGCTAATATTAAGAGCGATCGCTACAACACAATCGACGAGATGGTGTCCGACTTCCGGCTCATGTTCTCCAACTGCCGCCAGTTCAACGAGGAAGGTTCCATGATCTACGAGGACGCGAACCTCCTGGAACGCGTTCTAAACGAGAAGCTGAAGGAGCTGAACAGCAACTATGAGAGGAAAGTGCCCCCCAAGACGTTTAAGGCCGCCAAATCTAAACAGTTGACGCCGTTCGAACAGAAGCTACGTACTTTATATGACGCTATCAGAGATTACAGAGATCCTAAAGTGAACAGGCAATTAGCTCTAATCTTTATGAAGCTGCCGAGCAAAACAGAATATCCGGATTACTACGAACTCATCAAAAATCCGATAGATATGGAGAAGATTGCTCACAAACTGAANNNNNNNNNNNNNNNNNNNCTGGCCTCCGACTTCATACTGATGTTCGACAACGCCTGCAAATACAACGAGCCCGACTCGCAGATATACAAGGACGCGCTCATCCTGCAGAGAGTGTGTTTGCAGACCAAGCAGGAAGACGAGGATGCTGTACCGGATGTAGCCGGTGCTGTTCAGGATCTGTTGTTGACTCTCTTCACAGGAGTCTACAACCACCAAGACGAGGAAGGAAGATGCTATTCCGACAGCATGGCTGAGCTGCCGGAGCACGATGAAGTAGCGAATGGCGAGAAGGTCCGGGCGATATCACTAGACCTGGTGAAGCGACGCCTCGACAAGGGCCTCTACAAACGACTGGATCATTTTCAACAGGATATGTTTGCTGTTTTTGAACGCGCCCGCCGCCTCTCCCGCACGGACAGTCAGATATTCGAGGATTCAGTGGAACTTCAGACGTATTTCATCGACCAACGCGACCAGTTGTGTCGGAACACGCTGTCCTCGCCGGCGCTGGCCATCACCAGGGACACGATAGCCACCAGCGTAGAACTGGTGAAGCAGTGCAAGCTGTTACAGGAAAACGACGAGGAGGAGGAGACGAGATCAAGTACCGAGGATACAATATCCGGTGCCGCTCCGCCCTCGCAGTACGGTCGAGGCGACTTCGTGTACGCCCCCGCTAAGGGGAGCAAGGAGCCTTCGATACTACAGATAGAGAAAATCGCCACGAACAGTGACAATGTACCCGTTATATACGCCAACGTATACTACAGGCCTCACGAGACCTTCCACGTGCGCACGCGCAAGTTCCTCCAGCAGGAAGTGTTCAAGACGGACACTCACCGCACGGTTCCCTTGGACGCTATCATAGGGACCTGTTACGTCATGAACGTCAAGGAGTACTTCAAGTATAGACCGGAAGGTTACCTCGACAAAGATGTCTACGTCTGCGAGAGCCGTTACAACACGAAGCATCGATGGTTCAAGAAGATCAAAGTGTGGGAGGGCGCTGAGAAGGAGGCCACTCTAGTGCCCAGAGAGGTGCCGCTAGAACCGAACAGAACGGTTTCAGTGTTCCGTGAAAGAGTCGAGAAACACAAGGACGAGCTGGCTGAGCTAGAAGTGCTGGAAAATGTACAAGAAAAGGAACGACCTGACGTGGTCATGTACAATCCTCTGGGCACCGACGACGAGAACACTTATTACGAGCAATACAACACGGTGTGCTCGGGGGTCATCAAGACGGGGGATTTTGTGTATGTGGTGACGGACGGCGGCAAGCAGATACTGGCGCAGGTCGACACTATATGGGAGACAGGAGACAACAAATGTTATTTCCGCGGACCGTTTCTCATCTTCCCATCTGAAGTGTCGCACATCATAAACAAGCCATTCTACAAACAAGAGGTCCTATTGACCACAATGCACGATACTAGCCCACTCGTGGGAATAGTGGGCAAGTGTGCGGTGCTCGATTACGACGATTACCTTAAATGTCGGCCGACAGAAATAGCGGAAGCGGATGTGTACGTGTGCGAGTCGTTGTATGACGAGTCCAATAGACTGGCCAGGAAGTTGAAGTCGGGGCTGAGGAAGTTCGAACACACCAAAGACGTTACCGTTGATGAGGTTTATTACTTCCCCAAGCCGCTGGGCCCGCCGCCCCTCGCTTCGTCCCACGAGGTCCACACGTCCTTCACACAGAAGACCTTCAATCCGAACGTTAACGCCGATTCCCTGGACGGCAAGCCTCAGTTCACCAACTTACTGAACACAACCCTGGGCAGCCAGGACGTTGAACTGTTGTTGGAGAATTCATTGGACGACTCCTCACTAGCCTCACCGGCTACACCACTGTCTATAGGTGGCAACAGCAATCCATACAATCCATCGATGACGTCAAATCAAGAACGTTCCAGCACTACGACGGCAACACCGGCCAGCAGCAAGAAGAAGAAGGAGCAGAAACAGAAGATTGTCACCGGATACATACTGTACTCGAGCGAAGTTAGGAAGGCCATAGTAGCTAACAATCCTGAATCTACCTTCGGAGAGATATCTCGTATAGTTGGCAACGAGTGGCGCTCACTCCCCGCCTCCACCAAACAGAGCTGGGAGGAGAGAGCGGCGCGCTGTAACGAGGAGACATCCGCCAGGCTGGCCGAAGAGATGCGGGAGCTGTCGCAACATACCGATCTAANCGACGATTACCTTAAATGTCGGCCGACAGAAATAGCGGAAGCGGATGTGTACGTGTGCGAGTCGTTGTATGACGAGTCCAATAGACTGGCCAGGAAGTTGAAGTCGGGGCTGAGGAAGTTCGAACACACCAAAGACGTTACCGTTGATGAGGTTTATTACTTCCCCAAGCCGCTGGGCCCGCCGCCCCTCGCTTCGTCCCACGAGGTCCACACGTCCTTCACACAGAAGACCTTCAATCCGAACGTTAACGCCGATTCCCTGGACGGCAAGCCTCAGTTCACCAACTTACTGAACACAACCCTGGGCAGCCAGGACGTTGAACTGTTGTTGGAGAATTCATTGGACGACTCCTCACTAGCCTCACCGGCTACACCACTGTCTATAGGTGGCAACAGCAATCCATACAATCCATCGATGACGTCAAATCAAGAACGTTCCAGCACTACGACGGCAACACCGGCCAGCAGCAAGAAGAAGAAGGAGCAGAAACAGAAGATTGTCACTGGATACATACTGTACTCGAGCGAAGTTAGGAAGGCCATAGTAGCTAACAATCCTGAATCTACCTTCGGAGAGATATCTCGTATAGTTGGCAACGAGTGGCGCTCACTCCCCGCCTCCACCAAACAGAGCTGGGAGGAGAGAGCGGCGCGCTGTAATGAGGAGACATCCGCCAGGCTGGCCGAAGAGATGCGGGAGCTGTCGCAACATACCCCTATGGAGATGACGTACGAGTGTGCCTGGGACACGTGCGACTATCAGTTCGAAGATCTGTCAGATTGTATGGAACATTGTATCGGAGATGGGGGAAATACGGGTCACGTACAGCAGCATTACGTCCGCGGGGGCGGGGAGTTCCCGTGTCTGTGGAGGTCGTGCGCCAGGGTCAGGAAAGGTCAAGCGCCCTTCCCAAATCTCCCTAGACTCTTGAGGCACGTGCGTGACCTTCACGTTAATAAAGGCAACGGGAAAACTATGGCTGTACACGAGAGATCCAGAAATTTCATTCCCTCATCTAAGAAGCCGCCTAAGCCGTATTTGACCAGTTTCCGAAGCGGCCTGATGTCTCCAGGAGGCTCTCTGTCAGGGATGTCACCTATGGCGAGGAACACGCCTTCTCCCGGCCCCGAGTCGTCAAGGTCCGGCCTAGACCCGCTGTTCGTAGCTGCCCCGCCCCGGGCGCAGCGAGTCACGCACTCAGAGGCTTACATACGATACATCGAGGGTCTCCATTCCGAACAGAAGTATATAACGCCGTGGGAGAAATCCCTCACACCCATGCCGGTTAACCTCGACCCGTCGCAGTTTAACTTACACAAGCTGCCCACTCATTGGATGACGGACGAGGCGGTCAGCGGTTACTTACAACAAGACAAGAATCTCACAGAAACTGACATACAGAAGATGGATCAGTCTACTAAAGTATTAAAAGGGCTCTGTGCACTCAGAGACTTCATGATGAGGGACGCCTTAGCTGTATATAAGAGTTATTGA

Protein sequence:

>DPOGS207216-PA
MSKRRRASSCASRGPDGDDSDDGVELSNPGPPPSRKKKKMEPSEICQQLYDIIRSYKKEDGTLLCDSFIRAPKRRQEPQYYEVVSQPIDLLRVQQKLKTDTYEDIEELSADIELLVNNAKAFYKPDSEEYKDATDLWKLYNKHRQTLENGDDIKTPKLSRNGSSSRRSDVAEDTSETSTNNEEDNVFEELFNAVMTANIGGRPLYPPFQFLPSERRYPEYYSVIDNPIDLKTIAQKIQASEYNTLNDLEKDLLLMVWNACLFNEPGSQLYKDARALKKVIQARKQEIDQHGRSGPAKTSERIRSKRTSRVGPVPSSRALAVMEAPGSDTEPDVKHSEDSAGDSDEDKTDNEDSPQWKLLETVRNHLGPSGTPMADPFWKLPSRREYQDYYKEIKNPVSLNQIKNKIRRGSYGTLSEVAGDMNIMFENAKQYNLPTSRLYKYAVKLQRLMQQRVQELLDIVQSSSSDDESLSSVKNQTQANTPRPRGRPRLNPNPVSSPTPASPIVAKTNLPLKKKLHYVSKQLVEFTCSDGRQPMLLFMEKPSKKLYPEYYNVIERPIDMLTIEANIKSDRYNTIDEMVSDFRLMFSNCRQFNEEGSMIYEDANLLERVLNEKLKELNSNYERKVPPKTFKAAKSKQLTPFEQKLRTLYDAIRDYRDPKVNRQLALIFMKLPSKTEYPDYYELIKNPIDMEKIAHKLXXXXXXXLASDFILMFDNACKYNEPDSQIYKDALILQRVCLQTKQEDEDAVPDVAGAVQDLLLTLFTGVYNHQDEEGRCYSDSMAELPEHDEVANGEKVRAISLDLVKRRLDKGLYKRLDHFQQDMFAVFERARRLSRTDSQIFEDSVELQTYFIDQRDQLCRNTLSSPALAITRDTIATSVELVKQCKLLQENDEEEETRSSTEDTISGAAPPSQYGRGDFVYAPAKGSKEPSILQIEKIATNSDNVPVIYANVYYRPHETFHVRTRKFLQQEVFKTDTHRTVPLDAIIGTCYVMNVKEYFKYRPEGYLDKDVYVCESRYNTKHRWFKKIKVWEGAEKEATLVPREVPLEPNRTVSVFRERVEKHKDELAELEVLENVQEKERPDVVMYNPLGTDDENTYYEQYNTVCSGVIKTGDFVYVVTDGGKQILAQVDTIWETGDNKCYFRGPFLIFPSEVSHIINKPFYKQEVLLTTMHDTSPLVGIVGKCAVLDYDDYLKCRPTEIAEADVYVCESLYDESNRLARKLKSGLRKFEHTKDVTVDEVYYFPKPLGPPPLASSHEVHTSFTQKTFNPNVNADSLDGKPQFTNLLNTTLGSQDVELLLENSLDDSSLASPATPLSIGGNSNPYNPSMTSNQERSSTTTATPASSKKKKEQKQKIVTGYILYSSEVRKAIVANNPESTFGEISRIVGNEWRSLPASTKQSWEERAARCNEETSARLAEEMRELSQHTDLXDDYLKCRPTEIAEADVYVCESLYDESNRLARKLKSGLRKFEHTKDVTVDEVYYFPKPLGPPPLASSHEVHTSFTQKTFNPNVNADSLDGKPQFTNLLNTTLGSQDVELLLENSLDDSSLASPATPLSIGGNSNPYNPSMTSNQERSSTTTATPASSKKKKEQKQKIVTGYILYSSEVRKAIVANNPESTFGEISRIVGNEWRSLPASTKQSWEERAARCNEETSARLAEEMRELSQHTPMEMTYECAWDTCDYQFEDLSDCMEHCIGDGGNTGHVQQHYVRGGGEFPCLWRSCARVRKGQAPFPNLPRLLRHVRDLHVNKGNGKTMAVHERSRNFIPSSKKPPKPYLTSFRSGLMSPGGSLSGMSPMARNTPSPGPESSRSGLDPLFVAAPPRAQRVTHSEAYIRYIEGLHSEQKYITPWEKSLTPMPVNLDPSQFNLHKLPTHWMTDEAVSGYLQQDKNLTETDIQKMDQSTKVLKGLCALRDFMMRDALAVYKSY-