Monarch geneset OGS2.0

DPOGS202304
TranscriptDPOGS202304-TA5064 bp
ProteinDPOGS202304-PA1687 aa
Genomic positionDPSCF300032 + 253696-265676
RNAseq coverage301x (Rank: top 37%)
Annotation
HeliconiusHMEL0047310.070.29% 
BombyxBGIBMGA004976-TA0.088.36% 
DrosophilaCG34422-PA5e-13862.53% 
EBI UniRef50UniRef50_E2BZ091e-14860.39%AT-rich interactive domain-containing protein 4B n=2 Tax=Formicidae RepID=E2BZ09_HARSA
NCBI RefSeqXP_001687911.11e-15166.59%AGAP007503-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582855613e-15066.59%AGAP007503-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3320279720.033.18%AT-rich interactive domain-containing protein 4B [Acromyrmex echinatior]
Group
Gene OntologyGO:00036773e-35DNA binding
GO:00056223e-35intracellular
KEGG pathway 
InterPro domain[271-374] IPR0016063e-35ARID/BRIGHT DNA-binding domain
[629-715] IPR0161971e-13Chromo domain-like
[169-231] IPR0126031.2e-06RBB1NT
Orthology groupMCL11105 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202304-TA
ATGCAGGGTGATGATCCTCCTTTCCTACCAGTGGGTACGGATGTGAGCGCAAAGTACAAAGGTGCATTCTGTGAGGCCAAAATTAAGAAAGTTGTGCGTAACATTAAATGCAAGGTGACATTAAAAGCTGGTGGTATTACCACTGTCAATGATGATGTCATCAAGGGCACTCTGAGGGTCGGGAGCACTGTGGAAGTCAAACAGGACCCCAAGAAAGAAGCCATGGAAGCCGTTATTACGAAAATACAGGACTGTAGCCAATACACCGTCGTATTTGATGATGGCGACATTACAACATTACGACGTTCAGCGCTGTGCCTTAAGAGCGGGAGACATTTTAATGAAAGTGAAACCTTGGACCAACTGCCTCTTACACATCCAGAACATTTCTCTACACCAGTTATAGCCGGGAGGAGAGGTCGGAGAGGAAGAGCGCAGTCAGATGAAAGTGACGGTGAGGGTACGACCCCAGTGAAAGCGGATAGTGCTGAACGTGAACCCCACGTGGGCCGTGTGGTGCTGGTGGAAGCGGCGAGCGGTGCTGAGAGACGTCGACCTCACCAGCCGGCCTTCCCAGCACTTGTGGTGGCCCCGACGGCACAGATCAAAGTCAAAGAGGACTACCTCGTCAGATCCTTCAAGGATGGAAGATACTACACGGTGCCGAAGAAGGAGGCTCGTGAATTCCGCAAGGGCGCAGCACCCCTGGAGTGGTGCGGAGTGGAGGCGGCGCTGCAGTACTTGGCCCACGGCGACCTACCGCCTCACTGGGACCGGGACGCCCTCTTCAACGAGCCGAGGAACACCTCTGATGATAGCTCAGACGACGAACCGCGTGAAGAGAAGGATCACTTTGTAGCGCAGCTGTACAAGTTCATGGACGACCGAGGCACACCACTCAACAGGAACCCCACCATCGCCAACAGAGACATTGATCTATATAGGCTGTTCAGAGTGGTTCAAAAACTAGGCGGCTACAACCGCGTCACGAACCAAAACCAATGGAAGACTATAGCAGACAAAATGGGTTTCCACCCGGTCACCACCAGCATCACTAACCTGTGCAAGCAGGCTTATAAAAAGTTTCTCCATAGCTTCGAAGATTTCTACCGCAAGCTGGGTGTGACGCTAGTGGCTCACCCTCGCGGCGCGCGCACGCCTCCCGCCGGCCGATCCCTCATCAGGGACCGCGACAAACTACCGCCCTCCGCCGCCTCGCCCGCCTCGACTACTTCATCCACGCCCAGCACGCCCAGCCAGCGCAAAGACAAAGACTCGGACAAAAGCGAGACGGAAAAGAGCGATAAGAGCGACAAGAGCGACAAGAGTGAGAAGAGTGAGAGGGAAGATAAAGTAGAGAAACAGGAGAAGAAGGAGAAACCTCGCGCGAGCGACGAGGATGACAGCGCGGATAACCAGCCGCTGATAACCACTACACCCAAAATCGAAAAGGACAAGGAGAAGGAGAAGGAAAAAGAGAAAGAGAAAGAAAAGGATCGAGAAAAAGAAAAAGATATAGAAAAGGATAAAGAAAAAGAGAAAAGTGTACCAAGTGAAGATAAGAGCACAGTGAAGCCCAGATCACAATCCAAGACGCGGAGTCTGCCGCCGGTCAAGAGCGAGTCCCACGAAAAGAGAACTACGAAGAGAAAAACTATATCCTCTAAATGTGAGAGCAGCGGCAATACATTAAGGGCCTCTCGCAGACCTCACGTGTCCACCGACAGCGACAGTTCCGGGCGAGCCTCAAGATGTGGCCCAACAAAGAAGATGCAAAGTCGTCGCAGTCAGAGCGCCAATTCAGCGAGCAGCGGCAACACTATCGCATCAAACAGCAGCAAGAGACCCCGGAAAAGGAAGAACACTGAATCATCTAACAACGAACCAGCCAGATCGGTTGGTGCCAGTGTGAAAGCTCAAGTCGGTGACAAACTTAAAGTATACTACGGTCCCACGCAATCTGAATCAAAGGTAACCTATGAAGCTAAGGTCATAGAAATATCATCTGAGGGCATGCTCCGCGTTCACTACACGGGCTGGAACACCCGCTATGATGAATGGATCAAACCGCAGAGGATTGCTCTGAACGTCACACAACATGATCAGAGAAACAAGAAGGGAACTAATCTAAGCAGACGCTCTCGAAGCAAACGAACAGAGGAATCATCCGCGCGCTCCGACAGCGACACGGATTCTGACAGCGACGAGAGCGTTAAGAGACCATCAAAGAAATCCGAAGACAAATCTATAACTAAAACACCTTCGAGAACCAAAGACACGAAATCAAGCGACAGCAGCAGTTCCAGCAAACCGAGGAAGAGACCGATGAGGACTGTATCGACTCCAGTGATCACGTCTCCCGCTAAGAAACCAAGGATCGGCGTCTCCAGCCAGCACCAGGGACGAGACTACGACCTGAACGAGATAAGATCAGAACTGAAAGGACTTCATTCAGTGAAACAAGAAGCAGATGACGCTGGTAAAGCTGATATAGCGCAGGACTCTATAATGAATCCGATAACTCAGCCGCCAGAGGTCCCCGAGAAGCAGGCGGAGGACGTCTACGAGTTCAAAGAACCGGAACCCTTCGAGCTCGAGTTGCACGACGAGAAGAAGAAGCGAACTCATCGCATTTTCGATGACATCTCGCCCAGTAAATACACATCCACGCTGTCGAAGTCGCTGAGCGAGGAAATATCTGAGGAGCCGTTGAGGGCCAGGCCGTCCTCGTTCAGATCACCGTCCTTATCGCCGTTTAGAGATTTCGGGTCGAGTCGAGACGTTCCCAGCAGGCAGAGCCCGGAAGATGATTCCAATAATGCTCTATTCTCCCTCGACGATGATTCCTTCCCTGGGGAAGGCAGTTCGGGGCCGATCTTCGAAGGTTTCACCCCGGCGAAGAACCAAGAGACGTATTCTAAGAAGAGCAAGGTGTCCAAACTTCGGCAATTGATTGACGACTCACCGGACAGCCCGGCCGACGACGAGCAGTCCTCAGACGATGAACCCGAACCGGTCGTCAAGGAAGAGAGGCAGCCCAGCCCTGTCCTGAAAGTAACCGAAACAGTTAAACAGACGGAAGCGAACAAGGTGATTAAAGAGGAAATAAAAACTCCAGAAACAAAAAAAGAACAAGTGACAGTTGTGAAAGAAATCCCACTAGCACAGAGTATCCCAGAACCGCCAGGTACACCACCGCCGAAACCCAAACCAGAAGCTGCCAAACCAAAATTGGAACTTCCAAGCCTGATCATAACGGCTGCCACATCAAAAGATAAAGAAGATAAGAAGATTGAGAAAATAATAAAGGAGGAAGTAAACGAAAAAATTGTGAAGGAAACGATCATGAATATACCGTTACCGGAACCCAAGGAGCTGCCGGAAATAAAAGTGGATCCTGAATTATCCTCAATCATGGAACCACCCTCGAGCCCTCTGATAGATACGGAGGAAGACAAGTCCGAACCAGACAGTCCGGCGAGGATCGACGTTCTTCCTGAACCACCTCCGGGATTCCTGCTACAATCTGAAGGACCTAAAATAGCAGAGAAACTACTTAAAGCCATCAACAGCGCCAAAAGACTATCGATCTCGCCGCCTCCTGTGGACGACAGACCCGACACGCCCAAGAAAGATGTTGTTATAGAAGATAAAATATCACCCATACTTGAGAAACGGCCTCCAAGCAAACCGGAGCTAATGAAACCCTTGAAGCTTGATCCCGTCAAACGATCGTCTCCGGCCGAAGCTACCGACTCTATATTCGGCGAGCCGTCCAACCTGACGGACTTGAAACGAGATCTATCAGACATCAAGAAGATCAAACCCAAGGAAAACACGCCGCCTCGCTTACAGAGTCCACTTAATATATTGGAAAGGAGGAAAAGCGTCGCCGACCTGCCGTTGAGCGCTCCCGGGAAGAATAAGGTTCTCAGCGACACTATACAGAAACTCTCGAGTCAAATCAACCAGTCGGTGGCTGCGGCCAGCATACCGCTACCGCCGTTCCCACCCGAGGATAGAAGCGAGTCCAGCGACTCCGACGACTCCGACAGAAGGTTGATAATCGACAAGCTGTCGGTGGAGGAGTGGGCTGGTTCTAGTGGCGGCGGGGGCGGCAGCGGCGTCACCACCAACGTACCACTGGCGAGGACCCAGACCGCCATGAGGGCGCTTCACGCGGGGAAGTCCCCGGGCGAGTGGAGTGCCGGCGAGTCGCTGCTTATGCTCGAAGACGCTTGTAAAAACGAGCGGAAACACAGCGCAAGCGTGGTGGTGGCGGGTGGTACGCGGCCCTCGGGTTCCGCGTCCACCCCGGTGGTGGGCCCGGAGGAAGACAGCTGTGCCTTACTACTCTGCGAGGAGACCATCCCCGGGTCACCCGCGCCGGACACCGAGCCTGCGCCCCCAACACGAGCCCTCCACCTACCCTTCGCCTGCACCCCTCAACACCACCCGCAGAATACACACTCCCATAAAGCGGAGGAGCGCCGAGGGTCGGGCGGGTCAGGGTCTAGTGGTGCTGGTGTGTCAGGAGTGTCCGGTGTGTCAGGTGTGTCGGGAGTGTCGGGAGTGTCAGGTCCCGCCGGGGACGAATGGTCCCGCCGACGAGCGCTCCTCGACAACACGCCCCCCACCACGCCGGATAGCAGCCTCGACCTGTCGCCGAGGGAGCGACGCATTTCGGAGACGAGTCCGTCTGACAGAAAGGAGGACGACGAAGACGCCCCGGTACAAGACCCCTGCGCCGCGGACATCGACAAACCCCATAGCAGTGGTCGCTGTCGCAAGGCGTCGGAGTCGTCGGGCCGCACGAGGACGAGGCGGAGACGCGACACAGACGACGCCCACGCGCCGCCGGCACTCAAATACAACTTCTATGTGGACCTCGATCCGTCGTGGGATTGTCAGACCCGTATAAACGTTCTGTCGACGCGGCTGTCCGACCTGCGCAAGGCTTACCACTCGGTGAAGGCGGAGCTGGCGGCCATCGACAGGCGGAGGAAGAAACTACGGCGGAAGGAACGGGAAGCCATAAAAGCAGCCAAAGCTGCATGTTCCTGA

Protein sequence:

>DPOGS202304-PA
MQGDDPPFLPVGTDVSAKYKGAFCEAKIKKVVRNIKCKVTLKAGGITTVNDDVIKGTLRVGSTVEVKQDPKKEAMEAVITKIQDCSQYTVVFDDGDITTLRRSALCLKSGRHFNESETLDQLPLTHPEHFSTPVIAGRRGRRGRAQSDESDGEGTTPVKADSAEREPHVGRVVLVEAASGAERRRPHQPAFPALVVAPTAQIKVKEDYLVRSFKDGRYYTVPKKEAREFRKGAAPLEWCGVEAALQYLAHGDLPPHWDRDALFNEPRNTSDDSSDDEPREEKDHFVAQLYKFMDDRGTPLNRNPTIANRDIDLYRLFRVVQKLGGYNRVTNQNQWKTIADKMGFHPVTTSITNLCKQAYKKFLHSFEDFYRKLGVTLVAHPRGARTPPAGRSLIRDRDKLPPSAASPASTTSSTPSTPSQRKDKDSDKSETEKSDKSDKSDKSEKSEREDKVEKQEKKEKPRASDEDDSADNQPLITTTPKIEKDKEKEKEKEKEKEKDREKEKDIEKDKEKEKSVPSEDKSTVKPRSQSKTRSLPPVKSESHEKRTTKRKTISSKCESSGNTLRASRRPHVSTDSDSSGRASRCGPTKKMQSRRSQSANSASSGNTIASNSSKRPRKRKNTESSNNEPARSVGASVKAQVGDKLKVYYGPTQSESKVTYEAKVIEISSEGMLRVHYTGWNTRYDEWIKPQRIALNVTQHDQRNKKGTNLSRRSRSKRTEESSARSDSDTDSDSDESVKRPSKKSEDKSITKTPSRTKDTKSSDSSSSSKPRKRPMRTVSTPVITSPAKKPRIGVSSQHQGRDYDLNEIRSELKGLHSVKQEADDAGKADIAQDSIMNPITQPPEVPEKQAEDVYEFKEPEPFELELHDEKKKRTHRIFDDISPSKYTSTLSKSLSEEISEEPLRARPSSFRSPSLSPFRDFGSSRDVPSRQSPEDDSNNALFSLDDDSFPGEGSSGPIFEGFTPAKNQETYSKKSKVSKLRQLIDDSPDSPADDEQSSDDEPEPVVKEERQPSPVLKVTETVKQTEANKVIKEEIKTPETKKEQVTVVKEIPLAQSIPEPPGTPPPKPKPEAAKPKLELPSLIITAATSKDKEDKKIEKIIKEEVNEKIVKETIMNIPLPEPKELPEIKVDPELSSIMEPPSSPLIDTEEDKSEPDSPARIDVLPEPPPGFLLQSEGPKIAEKLLKAINSAKRLSISPPPVDDRPDTPKKDVVIEDKISPILEKRPPSKPELMKPLKLDPVKRSSPAEATDSIFGEPSNLTDLKRDLSDIKKIKPKENTPPRLQSPLNILERRKSVADLPLSAPGKNKVLSDTIQKLSSQINQSVAAASIPLPPFPPEDRSESSDSDDSDRRLIIDKLSVEEWAGSSGGGGGSGVTTNVPLARTQTAMRALHAGKSPGEWSAGESLLMLEDACKNERKHSASVVVAGGTRPSGSASTPVVGPEEDSCALLLCEETIPGSPAPDTEPAPPTRALHLPFACTPQHHPQNTHSHKAEERRGSGGSGSSGAGVSGVSGVSGVSGVSGVSGPAGDEWSRRRALLDNTPPTTPDSSLDLSPRERRISETSPSDRKEDDEDAPVQDPCAADIDKPHSSGRCRKASESSGRTRTRRRRDTDDAHAPPALKYNFYVDLDPSWDCQTRINVLSTRLSDLRKAYHSVKAELAAIDRRRKKLRRKEREAIKAAKAACS-