Monarch geneset OGS2.0

DPOGS212870
TranscriptDPOGS212870-TA4263 bp
ProteinDPOGS212870-PA1420 aa
Genomic positionDPSCF300086 + 471497-481746
RNAseq coverage405x (Rank: top 30%)
Annotation
HeliconiusHMEL0081840.075.96% 
BombyxBGIBMGA000814-TA0.065.78% 
DrosophilaAcf1-PA4e-14631.21% 
EBI UniRef50UniRef50_E2BPI30.034.05%Bromodomain adjacent to zinc finger domain protein 1A n=10 Tax=Formicidae RepID=E2BPI3_HARSA
NCBI RefSeqXP_001604290.10.035.39%PREDICTED: similar to zinc finger protein [Nasonia vitripennis]
NCBI nr blastpgi|3838637690.036.06%PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like [Megachile rotundata]
NCBI nr blastxgi|3838637690.035.99%PREDICTED: bromodomain adjacent to zinc finger domain protein 1A-like [Megachile rotundata]
Group
Gene OntologyGO:00055153.5e-34protein binding
GO:00082704.1e-11zinc ion binding
KEGG pathway 
InterPro domain[1243-1368] IPR0014873.5e-34Bromodomain
[22-122] IPR0131367.3e-32WSTF/Acf1/Cbp146
[1027-1101] IPR0130832e-17Zinc finger, RING/FYVE/PHD-type
[1029-1110] IPR0110112.2e-17Zinc finger, FYVE/PHD-type
[1047-1093] IPR0197874.2e-12Zinc finger, PHD-finger
[1046-1092] IPR0019654.1e-11Zinc finger, PHD-type
[328-392] IPR0040222.2e-08DDT domain
[327-392] IPR0185004.6e-07DDT domain, subgroup
Orthology groupMCL12288 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212870-TA
ATGCCTCTATTGAAAAGAAAAGCCTTTGAGAAATCGAATGTGTCGGAATATCTAAGGGACGACGATGAGGTGTTTCACTGCGAGATCACCGACGAAATATTCAAGGATTATGAAGAATACTGCGAAAGGATCATTCTGGTCAATTCAATGGTGTGGTCATGTGAGATGACGGGAAAGAATAACCTCACGTATGCAGAGGCTTTGGAAAGTGAAAAAGCTGCCCGAAGATCACTAAAGGATTTCCCAATGGAACTCCGAATACCAATATTATATTTGGCTGCAAGAACCAAAAGGAGTTCATTTGCAGACATGTCAGAGGATGTATTTAATTTTGTAAGAGTGAGGTTCTTTGTGGGTGAAACGGTTGAGGCCTGCCTAGAGGGTGACATCTGGCGAGAGGCTCATATACTGTCAGTTACAGCCCCCAAACAACATCCGGACAGCCAGGCAATGCTCCCAGCATCCTCTTACTGTTATGAGGTGGAGCAGTTCAGTGAGGATCCGGAGACTGCGGGGCAGATCGGCACCGCGCCCTACGACAGAGTTAGGCGACGTAAGGGCGTCTACACACGGGACAAAAACAGACTGTTCTTGAAACAGTTTGTCACACCCGGAGCCGTCATTGGCATTAAGAAATCAGCAATAGAAAAATACAACATAGGAAAAGTGAACTTTGACCAGATATTCACAGGGAATCCCCCAGAATTCCCGTCATCCAAGAAGCTGTCAAAGAACAGTAAATCAAAAACCCCCATTGACCCTGAAGCCAGAAAAGTAGCCCAAGAAATGGCTGACAAAATGAGGAAAGCGGAGGACTTATTGAGGCAACATAAAGAAGAAGAGAAAGCGAGAAAGAAAGAGAAAAATGCCAGACTCATGGCATTTATGAAGGAATGGCATAAGGTTAAAGAAGACCAGGAGCTAGAAGATCACAAGATTATACCAAAGGGCACTCCGGTAGAGATAGAAGGAATACCACATAAAAATTTTGGTGACTTCCTCTCAGTGTTAGAATTTGTTCACAATTTCTCAGAACTACTCAAACTCAAAGACGTGTTTCACACCGAGTTTGACCTTGAAACATTTCGGAAATCATTGACCTCTAAAGAACACGCGTGGATATTCAGCGAGTTGGTACAGATGCTACTATCAGCAATATTCTCTTTGCAAGAAGATGAGGCAGAAGAATATAATGAGGGCAAAGGAATTCAGGAATCAGTAGAGGAGCCTCCGGCCATCTCGACAGGTATCGCGCGCGCCATAGAGCAAGCGACGCGAGCCGGAAAGTGGGCGCACACCTACCTCGGGACACCACTCAGCAAGCTGCCCCTGGACCCCACCACTGTGTCTGAAGTGCTGAGGTTGCACCTGCTGTCGTCGGGTGGCGTGGCTGGTTCCCGCTGCCTGGCGTGGCGGCTGCACCAGCGCGGTGGGTACTCCAGCGCCGATGACCCAGCACTGCGCCTGCGCACGTCACGTCCACATACTTTAAGAGCCTTACGGACGCAGCACGTCGCCGATCTACCCCTCGATGACAGACTCGCGATCCTTCAGTGCCTCATGAATCAAATCCTGAGTTACTCGACGGTGCGCGAGCAGGTCGAGGAGAAGATAGAGGAATATAAGAACTTAAAACAAGCCTTGCGAATATTACAAATAAACGAGCGCAAACGCGAACCGCAACTGTCGACCGCGCGGTCGGAGCTGAAGAGAGAGGCGGCTCAGAAGAAGGAGGAACTCAAGTTGACGGGGGATCGCGCGCGGGCTGCGGACGAACAGCTGAAGGCGGCCATAGACAAGCTGAACAAGGAGAGCGACGCCAAGAGACTGGAGTTCGAGAAGAAGTTGAAGGAGCTGCAGGCTCAGCTCTTTGACTACACCACTTACCTTGGGTCGGATCGCGCGTTCCGTCGTTATTGGATCAGTAGGCGGGTGGCGGGACTATTCGTGGAAGCGGGGCCGGAGCCCCGAGGTCCTTGCCGCAACAAGCCCCTGCCCCCGCCGCCCGCTCCCCGCGACGACATCTTGTCTTACGTCACTGAATTATTCCACTCAGAGAGGGAGAGGGAGAGACAGAAAGAACAAGCTGGCAGCGACAAAGAGAACGAGTCCGGCGCCAACTCCCGCGGTGCCTCGCCAAAGAAACCTCTAACAAACCTCAACGGTTTGACCCAAGACAAGAAACCCATGGAGAGCACGGTGCAGCACTGCCGGGACATGCTGCTGTGCACTGCCGATGTCAACACGTGCTATGTACACGGGAAGGGTGATTACCGACCGCAATGGTGGGTGTACCACACCCGCGACCAGCTGCAGGCACTCATCGCGTCACTCAACAAGCGAGGCTTGAGAGAGAGTGAACTGAAGCAGGCCTTGGAGGTGGACAAGGAACACATAGCTGACTACATCACCAAGTGCCCTCTGAACCTCCTGACACCCGGCCCGGCTCCGCCGACGCCGAACGTGCCGTCGACGCGCCAGCGCCGCTTCCAGCCGTCTCTCACGGTACTACCGGACTGTTCGCTGGCGGATGCTCTGGAACTGACACTCAGAGACCACATCCTGGAACTCGAGGAGAAGATCTTCCACGGATGTCTGGGAGCATTGAAAGTTAAGGAGCGGTCGGCGTGGCGCGGGACTCTCATGGTGCGCGGCTACGACAAGCAGGCGCGGTCATTGACCTGGGGCCCGGACGGCCGGTTCAGGGACGACTGCCATTTACCAGACGGATTGCTGAAATTACCGCCAGATTTAGATGAGACCGAGTTGGAGGGCATCGTTGAGAACAGATACCGCGACCCGGGGCACTGTCTGGAGCCGCCCAGGGTGAATGGCATCAAGATAGAGAACGGAGGGGGAGAGGCGGCCGGGGCCGAGGACGCTGGGGTCGTCCGCTCACTAGCCAGCGCTCTGCTGCAGGTGGCGCAGGCCATACACCACAAGTACCTGAAGAGGCCGCTCGGCCTCGACGAGAAGGAGCGCAAGGATCGCGAAGCCAAGAACAAATCTCTGGAGCTGGAGGCGCTGCAGCGGTGGGAGGTGTCGCTGATGGAGTGCCGCAGCTTCGCCAGCGTGGCGCTGCACCTGCTGACGCTGGACAGCAGCGTGTGCTGGTCGGCCAGCGTGCTGCACGCCAGCTGCCGCCTGTGCCGCCGCCGCACCGACCCCGACAACATGCTGCTCTGCGACAGCTGCAACAAAGGACACCATCTCTACTGCCTCAAGCCAAAGCTCACGAAGGTGCCGGAGGGGGACTGGTTCTGTGATCAATGCAAACCGACAGAGAAGACGCCCAAGAAGCGAAGAAAACTATACACCGACCCCGACGACACGCTCGACGACAGGCGAGTACATGTGATACATATACAACATCAGAGCGGTTGTTTCGACAGTGTTCTTGAATGTTCTCATGTTAATAATTTCTCTCGCCCTCACAGCTCGGAGTCGTGTTCGAGCGCGCCGGTGGAGCTGTGCGCGTTGTGCGGCAGCGGCGGGCGGCTGGCGGCCTCGTGTCGCTCGTGCGGGAGACGCTTCCACGCCGAGTGCGCGCCTTCCGGGGGGCGGAGGGCCGTGTGCGGGGACTGCGCTAAACCAAACAGAGATTCCGAGGACAGCGAATATAACACGGCGCTGGTCAAACTGAAGACACGGCAGCAGAGAACCGAGGAACCCGCCAGGAGAGGCAGGAAATCTAAAGAAGTCGTCAACGGAAGTACAAACAGGAGATCGAAGTCATTCATGAATGGCGTCAATGGTGATGTAGTGTCGAGTCGTAAGCGCGGCCGCCAGGAGGAGGAGTTACTGCACGTGGAGTCGCTCACACAGCTGTTGAAGGAGTGCGGCAAACATCGGGATTGCTGGCCCTTTGATGAGCCGGTCTCGACGGAGGATGTGCCGGACTATCTCAGCGTGATCGAGCAGCCGATGGACTTCTACACAATCCGCGGCAAGCTGGAGAAAGGTTCCTACACCACCGACCAACAGATGCTGGACGACGTCGCGCTCATCTTCAAAAACTGCTACACCTACAACCAAGACACACACCCTGTGGCCAAAGCGGGAGCGCGACTCGAAAAGTATATCATAAAGCGCTGTTCGGAACTCAATCTACCCGCGTTGCCCGCCACCTCGCTCGAAGACAACGAGGCTGAAGCGACGCAAGAGAATAACGAGCGAGAAGAGGTCGCGGAAGCTGCCGGGGAAGAGCTCGACTCGGACGACGAGGTTCTCGCGCCTCGAGCCAAGCGACCCAAGATACATTGA

Protein sequence:

>DPOGS212870-PA
MPLLKRKAFEKSNVSEYLRDDDEVFHCEITDEIFKDYEEYCERIILVNSMVWSCEMTGKNNLTYAEALESEKAARRSLKDFPMELRIPILYLAARTKRSSFADMSEDVFNFVRVRFFVGETVEACLEGDIWREAHILSVTAPKQHPDSQAMLPASSYCYEVEQFSEDPETAGQIGTAPYDRVRRRKGVYTRDKNRLFLKQFVTPGAVIGIKKSAIEKYNIGKVNFDQIFTGNPPEFPSSKKLSKNSKSKTPIDPEARKVAQEMADKMRKAEDLLRQHKEEEKARKKEKNARLMAFMKEWHKVKEDQELEDHKIIPKGTPVEIEGIPHKNFGDFLSVLEFVHNFSELLKLKDVFHTEFDLETFRKSLTSKEHAWIFSELVQMLLSAIFSLQEDEAEEYNEGKGIQESVEEPPAISTGIARAIEQATRAGKWAHTYLGTPLSKLPLDPTTVSEVLRLHLLSSGGVAGSRCLAWRLHQRGGYSSADDPALRLRTSRPHTLRALRTQHVADLPLDDRLAILQCLMNQILSYSTVREQVEEKIEEYKNLKQALRILQINERKREPQLSTARSELKREAAQKKEELKLTGDRARAADEQLKAAIDKLNKESDAKRLEFEKKLKELQAQLFDYTTYLGSDRAFRRYWISRRVAGLFVEAGPEPRGPCRNKPLPPPPAPRDDILSYVTELFHSERERERQKEQAGSDKENESGANSRGASPKKPLTNLNGLTQDKKPMESTVQHCRDMLLCTADVNTCYVHGKGDYRPQWWVYHTRDQLQALIASLNKRGLRESELKQALEVDKEHIADYITKCPLNLLTPGPAPPTPNVPSTRQRRFQPSLTVLPDCSLADALELTLRDHILELEEKIFHGCLGALKVKERSAWRGTLMVRGYDKQARSLTWGPDGRFRDDCHLPDGLLKLPPDLDETELEGIVENRYRDPGHCLEPPRVNGIKIENGGGEAAGAEDAGVVRSLASALLQVAQAIHHKYLKRPLGLDEKERKDREAKNKSLELEALQRWEVSLMECRSFASVALHLLTLDSSVCWSASVLHASCRLCRRRTDPDNMLLCDSCNKGHHLYCLKPKLTKVPEGDWFCDQCKPTEKTPKKRRKLYTDPDDTLDDRRVHVIHIQHQSGCFDSVLECSHVNNFSRPHSSESCSSAPVELCALCGSGGRLAASCRSCGRRFHAECAPSGGRRAVCGDCAKPNRDSEDSEYNTALVKLKTRQQRTEEPARRGRKSKEVVNGSTNRRSKSFMNGVNGDVVSSRKRGRQEEELLHVESLTQLLKECGKHRDCWPFDEPVSTEDVPDYLSVIEQPMDFYTIRGKLEKGSYTTDQQMLDDVALIFKNCYTYNQDTHPVAKAGARLEKYIIKRCSELNLPALPATSLEDNEAEATQENNEREEVAEAAGEELDSDDEVLAPRAKRPKIH-