Monarch geneset OGS2.0

DPOGS215007
TranscriptDPOGS215007-TA2652 bp
ProteinDPOGS215007-PA883 aa
Genomic positionDPSCF300256 + 138226-155906
RNAseq coverage365x (Rank: top 32%)
Annotation
HeliconiusHMEL0148280.054.62% 
BombyxBGIBMGA012158-TA5e-9143.65% 
Drosophila% 
EBI UniRef50UniRef50_D6WBS23e-9144.69%Serine protease H6 n=1 Tax=Tribolium castaneum RepID=D6WBS2_TRICA
NCBI RefSeqXP_968114.26e-9244.69%PREDICTED: similar to AT-rich interactive domain-containing protein 5B (ARID domain-containing protein 5B) (Mrf1-like) (Modulator recognition factor 2) (MRF-2) [Tribolium castaneum]
NCBI nr blastpgi|1892349041e-9044.69%PREDICTED: similar to AT-rich interactive domain-containing protein 5B (ARID domain-containing protein 5B) (Mrf1-like) (Modulator recognition factor 2) (MRF-2) [Tribolium castaneum]
NCBI nr blastxgi|1892349043e-11235.01%PREDICTED: similar to AT-rich interactive domain-containing protein 5B (ARID domain-containing protein 5B) (Mrf1-like) (Modulator recognition factor 2) (MRF-2) [Tribolium castaneum]
Group
Gene OntologyGO:00036773.8e-13DNA binding
GO:00056223.8e-13intracellular
KEGG pathway 
InterPro domain[296-385] IPR0016063.8e-13ARID/BRIGHT DNA-binding domain
Orthology groupMCL22290 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215007-TA
ATGGTTTATATTAAGTGTCACGGTGTCTCCTCTCAGCTGGTGGGTGCTCCGTGTGGACATCACGGACAGTACACCTTCTACAAGGCGATCCGTCTGACGGGACCCCGGGACAGGATCGTCGCTATCGGGGACTTCTTCTTCGTCAGGATCTGGCAGGACTCAGAACTGGTCTCCATAGGCGAGCTCCAGTTGTTGTGGACGGACCGCGTGTCGGACCAGACCCTGGTGTCTCTGAGGCTGTACTTCCTCCCGGAGAACACGCCCGACGGAAGAAACACACATGGGGAGGACGAGGTGCTCGCTATCAATGATAAGGTGGTCTTGAGGGCGGAGGAGCTCCTCAGCTGGGTGTGCAGCGGCGCGGGCTGGCGGTGGGGGCTGCGGGCCGTGTGGCGAGGGGCCTGCGCGCCGCCCGCGGAGCCCCGGCACTCCGCCCCCCTGCACCACACCAAGCTGGACTTCAGCGACGTCGAGAGAGAGAAGAGCTCCATCACGGTGGACGTGGACGAGCCTGGCGTGGTGGTGTTCTCGTACCCCCGGTACTGCCGGTACCGGGCCCTGGTCGCTCGCCTGGAGGGCATCCAGGCCGCCTGGCTCAGAGACTCGCTGGTGGCTGCGCTGGGCGGATACGCCGCGCCCACCAGAAACACTAGGATACTGTACTGTAAGGACACATTCGAGTACCCGGAACTGGAAGGTCATGAGTTCGTCTGTAACCACCTAGCTCCTAAACTGAAAGGTCGTCCGCGGGGTCGGCGGAGGCGGGCGGCGCGGTCGCGCGACCGGTCGCCCGACCAGTCGCCCGACTCGCGCTCCAGTGACAGTGACGCTGTCGAGACCCGGACACCTCGGAGGATATCACTCCGGAACGGTCCAGAGAAGATCAGCGAAGACGAGGACTCGGAGAAGGACCAGCAGTTCATGAAACAACTGAAGGAGTTCCTCAAGGAGAAGAATGAAACTGTGAAAGTCCCACACAGCTATAAAAATGTATCTCTCCGGTCGCTGTACTCCTGGGTGTGGTCATCCGGGGGGTTTGCGGCGGCGTGTCGCGCGGGCGCGTGGCGGGAAAGATACCGGGAAAACGCGCCCGCACTACGACGGATATACGAGAGATATCTCCTCCAATACGAAAACCACGAGCGGTGGAACGGCAGGAAGTATCCGAAGATGAACGGCATCATAGACGGAGCTAAGACCATAGACACCATCGACGTCACGGACTCCCCGGCGAGGGACACGCCCGGGCACACGGAGCTGCCCGCCAAGACGCTCCGCACACCCTCGCCCAGACCAGAGAACCTGGTGTTGGACAACGAAACTGGAGAGATCACTAAAGAACTGAACATTACGTCCAAACCCGCCGAGGAGCTGAACAGGGAGTTCCTGGACTCGCTGCCCAAAGAAGAGAAACCGGCCAAGATCTTCGTCAAGCCCGTGGAGAAGCTGATAGAGCCTGGACTGCAGAACAAGATGGGCCTGGACAATGACGGCGTGGGGTCCGCGTTCTTTAACGAGCTGGCGCAGAAGTTAAACTTGGGTAACTCCGACACCCGCTTCCTGCAACAGCTGTCGGCGCCGGACAGCCTCACGAGCCTGTCCTCGCTAGGAGATAAATACACGAACGGGCACATCAACAGCGACCAGAAACCGCGCAGTTCCCTCCGCGCTGTTCGCGTGAAGACCACCCGAGCACCCCCCACCGCCCCCACCACCCCCAACCCCGTGCCCGCGGAGAGCTCGTCGCCGCCCTCCATCACCAGCGTCGTCAACAACTTCGGGATCCACCATCCGCCCACGCCGACCGCCAACGACGACGACATCGTAGAGGTGCCCTACAAGCCCAAGAGTCCGGAGATCATAGACCTGGACGAGTACCCGGAGAGTCCCCAGGCCATCAAGAACAAGAAGCTGGACATCCTCAAGGAGCGCGGCCTGGAGGTGACGGCCGTGCCCCCGGGGCCCGCCTGGCCGCCCGCGCCGCTGCTCCTCAACCCCGTGCAGCAGATCATGGGCCAGGCGTCGCTGTTCCAGATGTACAACATCATCCCCAGCTACCCCAACGGCGCGCCCGCGCCGAAGGTCATCCAGGCGTCCTCGGCGTTCGGCTCGTGCGGGCCGGAGAAGACGGTGTACGGCAACCCCAAGGACCCCTTCATGCCGCCGCCGCACGTGCTGCAGGGCACGCCCGTCAAGCCGCAGCGCAGCGTGGCGCCCGCGCCCGCCGCGCCGCTGGACGTGCTGGACCTCACCTGCAAGACGCGGCCCGGACACAAGCCGGCCGTGGAGATCGTGCGCCTGCCGCCCGCCCCCCGCCCGCAGAGCCTCGCCAGCAGCTACTCGCTGGTGGACGGCAAGGCGGTCGTGGGCTCCAACCTGGAGATCACGCTCGTCAACAAGTCGCACACGCCGCCCCGGAGGCCGCAGAAGAGGTCCTCCAACGGCAAGTTCGTGTCGTCGAAGACTCCGCCGCAGGAGTCGCCGCCCAAGAAGCCGTCGCCGGCGCGGCCCGAGCCGCCGGTGGAGCCCTACAGCCTGTTCCTGCGGGGCGCGCCGGGCCTGGACCCGCGCCAGCTGGCGCTGTACCGCGACCTCGTGGCCGGCCAGCTGCGCTACCCGGGCCTGCTCAGCACGCCCACCACCAAGAACTAA

Protein sequence:

>DPOGS215007-PA
MVYIKCHGVSSQLVGAPCGHHGQYTFYKAIRLTGPRDRIVAIGDFFFVRIWQDSELVSIGELQLLWTDRVSDQTLVSLRLYFLPENTPDGRNTHGEDEVLAINDKVVLRAEELLSWVCSGAGWRWGLRAVWRGACAPPAEPRHSAPLHHTKLDFSDVEREKSSITVDVDEPGVVVFSYPRYCRYRALVARLEGIQAAWLRDSLVAALGGYAAPTRNTRILYCKDTFEYPELEGHEFVCNHLAPKLKGRPRGRRRRAARSRDRSPDQSPDSRSSDSDAVETRTPRRISLRNGPEKISEDEDSEKDQQFMKQLKEFLKEKNETVKVPHSYKNVSLRSLYSWVWSSGGFAAACRAGAWRERYRENAPALRRIYERYLLQYENHERWNGRKYPKMNGIIDGAKTIDTIDVTDSPARDTPGHTELPAKTLRTPSPRPENLVLDNETGEITKELNITSKPAEELNREFLDSLPKEEKPAKIFVKPVEKLIEPGLQNKMGLDNDGVGSAFFNELAQKLNLGNSDTRFLQQLSAPDSLTSLSSLGDKYTNGHINSDQKPRSSLRAVRVKTTRAPPTAPTTPNPVPAESSSPPSITSVVNNFGIHHPPTPTANDDDIVEVPYKPKSPEIIDLDEYPESPQAIKNKKLDILKERGLEVTAVPPGPAWPPAPLLLNPVQQIMGQASLFQMYNIIPSYPNGAPAPKVIQASSAFGSCGPEKTVYGNPKDPFMPPPHVLQGTPVKPQRSVAPAPAAPLDVLDLTCKTRPGHKPAVEIVRLPPAPRPQSLASSYSLVDGKAVVGSNLEITLVNKSHTPPRRPQKRSSNGKFVSSKTPPQESPPKKPSPARPEPPVEPYSLFLRGAPGLDPRQLALYRDLVAGQLRYPGLLSTPTTKN-