Monarch geneset OGS2.0

DPOGS204520
TranscriptDPOGS204520-TA2949 bp
ProteinDPOGS204520-PA982 aa
Genomic positionDPSCF300205 + 120538-133076
RNAseq coverage444x (Rank: top 28%)
Annotation
HeliconiusHMEL0089023e-9374.83% 
BombyxBGIBMGA012455-TA0.065.49% 
DrosophilaCG15439-PA1e-12939.16% 
EBI UniRef50UniRef50_E2ARM42e-16046.96%PHD finger protein 14 n=7 Tax=Formicidae RepID=E2ARM4_CAMFO
NCBI RefSeqXP_970496.25e-16249.21%PREDICTED: similar to phd finger protein [Tribolium castaneum]
NCBI nr blastpgi|3407182560.043.12%PREDICTED: PHD finger protein 14-like [Bombus terrestris]
NCBI nr blastxgi|3407182560.043.62%PREDICTED: PHD finger protein 14-like [Bombus terrestris]
Group
Gene OntologyGO:00055152.2e-10protein binding
GO:00082701.9e-08zinc ion binding
KEGG pathway 
InterPro domain[872-973] IPR0110115.4e-12Zinc finger, FYVE/PHD-type
[515-581] IPR0130839.7e-12Zinc finger, RING/FYVE/PHD-type
[926-975] IPR0197872.2e-10Zinc finger, PHD-finger
[524-574] IPR0019651.9e-08Zinc finger, PHD-type
Orthology groupMCL14120 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204520-TA
ATGAGTAATCCTAGCAGAGGTCTTGCTAAACGCAAAGTTAAACCTGTAGAGCCGCAGTCGTTGTTGGATTTTGATCTCGGGGAGGGTGAGAGCTCTGATGATTCTGACTTCCGAATCGAAGATCATCCTGAAGAGAGTGACGATTATTCTATAAATACTGACGATGAAGAGAAAAAAAATGCTAAGAGTGAAAGTTCAGAAGAACAGTCTGGCTCAGATGATGAAGATGAATTCAAAAATACAACCAACAAGTTGGGAGAGGAAATGTGTGTGTCTGATTTACTAGAAAAAGCTAAACAGAATGAATTTAAGTTTCCCGAACTGGCCAATGTCATGATATGTGCTGGTTGTCTCGGCTCAAGGAGTGATGATATCAATGAGATTGTTGAATGTGATGGTTGTGGAGTCACAGTTCACGAAGGTTGTTATGGAGTGTCAGATGTTACCAGTGAGTCCAGTACAGTAAGCTCAGCCTCAACAGAGCCCTGGTTTTGTGAGGCTTGCAAAGCTGGCGTCACCGACCCTAGCTGTGAATTGTGCCCTAATAAAGGTGGAATATTTAAAGAAACAGAAGTAGGTGCATGGGTTCATCTGGTATGTGCATTGTATGTGCCGGGGGTGGCATTCTCAGAGAATGTTATAAAGTATTTAACTATGCTTTTCTTTGACAGCGGTCAACGTGAAGGTTTATTGGCGGAGGCGCATTCTGAAGAAGCTGAACAAGCGGATCCTTTCTACGCGCACTGTCGCTTACATTCAGACAAAACACTAGTCAAGAAACGGAAAAGAAATTGGCTGGCGTTACAGTTGAGAACAGAAAAAAGGAAAATGGAGCTGCAGAACAATCTTAGTACAGAAGAGAAGAAGAGGATACAGAGGAAATTAATCAAGTATAGAAAGAAGTACTCGCTGCAAAAAGAAAATAGAAATCCGCCATGGGTGCCGACTCAAAAGATGGCGCGCATGATATACAGCAGTGCCTCGGCTGTGCGGAAGTTCCAAGACAAGGCGCTCTGTATGGGCGTGGACACGCATGCGTTAGAGTTTAGAGATTCACAGATGGCAGCACTGAAGGACGTGTCTCGTAGATGGCACGTGCCGCCCGCGTTCTCAGTGGAGTTCGTCGGTTATTATTTGGAGAGGAACACTCGAGTGACGTCATTAAGGAAGTCCTTGGAACGACTGACGAAGGAGAACGAGATATTGGTAGCCGATGACGAGGATCTGCGGACGGAGTATGATAAGGCTTCAAAAGAGAACACAGACGCTATAGCCGAGTTGGCTTCAACACGGCTCGGTTTACAGAAGATGTATGACACTATAGTGTGTTTGTGCCCTAAGAGGTCAACGCCCGCCATATTAGAAGACCGACCGCTGGTCATAGCTCCACCGAGATCCACGCCTAAGGTCACGCCTCAGCAGTTACAAAAACGGTCGATATCTGTGCCCACCGCGGCTGCACTTAAGATGGGCGTTGGTTTTCCTCTTAGCGACAATCCGGACGCTCGCCACGGGAAAGTTCTCTCTACGTCTATGGAAGCGAGCGCTGACGGGGCGTTAGCTGCTCGTGCTTGTTTCGCGTGTGGTCGGGCCAGCGAGCGCCATCTGATGGCGGCCTGCGACACTTGCAGACACCACTACCACTTGCATTGCCTGCGACCACCCCTGCAAAGACCACCCAAAAAGACGAAGCTGTACGGATGGCAATGTTCAGAATGCGACAAGACTTCAGATTCGGAACCGGAAGTGCTCGAGAAGAAAGTGCCTCGTCGTTCACGTATACGTTACAGCAAAGACGGAGCCATAGTATCGGAACCACTGAGTCCGGGTTCCGTACCTAATTCACCACCACCCAAACCTAAAATCGAGAAGACCTTGAAGGTCGAGAAAAAGATGAGCCTCTCGTCAGAGAACATATCTCCGATAAAAGTCACAATAAAACCGTTCGAGTTTAACAACGACTGTGGAGAAGGAGGCGAGGTCAAGGTGAAGAAGGAGAAGAAGTCGAAATCGAAAAAGGACTACTCCTCGACATCCGGCGGAGAGAGTGAGATATCAGCCAAAAAGATACACAAAAGAAGCTTCACGTCACCCATACTGACGAACACGCCGCTTATGTCGATAACGCCCATAGTGGCGGACAGTCCGAGCGATTCTCACAACGATCACTCAAACGACTCCACAAACGTGCCGCCGAAAGAACCGAGCTTCTTTTCCCAGAACCTGTCATTCTCGGCTCTGTTGAACGAGCCCAAGGAGAGGGATAGCAAAACCATAGAGAGCTCGATAGAGAACACGCTAGCGAATCTGTCCTCCGATATAGCGATGTACAAAGCCAATAGAAAGAGAAGGAAGGAGAAACACAGGTCTAGATATTCGCCTGATCTGTTACGATCACCGACGAAATCTCACAAACACAAGAGGAAGAAGAAGACTCAGGACATGGAGAATCCTGACACACCACATCCGAGGATTACTATCAAGATCAAACCAATACCTAAACCTGACGGCTCTTTAGATACACAGATGTTCTACGTACCCACGGACAGTAACGACGGACCACCGCCCGCCGTTATAAGGAAGATCTCCAAACAATGTGAGCCTGAGCCTCCCCCCTCCCCGCCCCAGGCTCTGTATCCACTGCTAACTCAAGAGGAAAAACCAGTAGAGGTCGTACCTACTGTCTCGACAAAGCCGAAGCGCTCACGTGAGAGCCGGGCTCGTGGTTCGATGTCGTCTCGTCCACCGCGAGCTGCCGTCACACCTCTCACACACTGCGATGTATGTTCAGAACCGGGTGATGGTACCAACCTCGTCAGATGCGACGAATGCAGCAAGAGGTACCACTTTACTTGTTTGGAGCCGCCGCTGAACAAGAATCCGAAGAAACGCGGCTATTCGTGGCACTGTGCCGATTGCGATCCAACTGACTTGGAAGAAAATAACTAA

Protein sequence:

>DPOGS204520-PA
MSNPSRGLAKRKVKPVEPQSLLDFDLGEGESSDDSDFRIEDHPEESDDYSINTDDEEKKNAKSESSEEQSGSDDEDEFKNTTNKLGEEMCVSDLLEKAKQNEFKFPELANVMICAGCLGSRSDDINEIVECDGCGVTVHEGCYGVSDVTSESSTVSSASTEPWFCEACKAGVTDPSCELCPNKGGIFKETEVGAWVHLVCALYVPGVAFSENVIKYLTMLFFDSGQREGLLAEAHSEEAEQADPFYAHCRLHSDKTLVKKRKRNWLALQLRTEKRKMELQNNLSTEEKKRIQRKLIKYRKKYSLQKENRNPPWVPTQKMARMIYSSASAVRKFQDKALCMGVDTHALEFRDSQMAALKDVSRRWHVPPAFSVEFVGYYLERNTRVTSLRKSLERLTKENEILVADDEDLRTEYDKASKENTDAIAELASTRLGLQKMYDTIVCLCPKRSTPAILEDRPLVIAPPRSTPKVTPQQLQKRSISVPTAAALKMGVGFPLSDNPDARHGKVLSTSMEASADGALAARACFACGRASERHLMAACDTCRHHYHLHCLRPPLQRPPKKTKLYGWQCSECDKTSDSEPEVLEKKVPRRSRIRYSKDGAIVSEPLSPGSVPNSPPPKPKIEKTLKVEKKMSLSSENISPIKVTIKPFEFNNDCGEGGEVKVKKEKKSKSKKDYSSTSGGESEISAKKIHKRSFTSPILTNTPLMSITPIVADSPSDSHNDHSNDSTNVPPKEPSFFSQNLSFSALLNEPKERDSKTIESSIENTLANLSSDIAMYKANRKRRKEKHRSRYSPDLLRSPTKSHKHKRKKKTQDMENPDTPHPRITIKIKPIPKPDGSLDTQMFYVPTDSNDGPPPAVIRKISKQCEPEPPPSPPQALYPLLTQEEKPVEVVPTVSTKPKRSRESRARGSMSSRPPRAAVTPLTHCDVCSEPGDGTNLVRCDECSKRYHFTCLEPPLNKNPKKRGYSWHCADCDPTDLEENN-