Monarch geneset OGS2.0

DPOGS207088
TranscriptDPOGS207088-TA2733 bp
ProteinDPOGS207088-PA910 aa
Genomic positionDPSCF300001 + 2815122-2818441
RNAseq coverage270x (Rank: top 40%)
Annotation
HeliconiusHMEL0061490.060.70% 
BombyxBGIBMGA013053-TA4e-15857.03% 
DrosophilaCG7879-PA3e-2045.36% 
EBI UniRef50UniRef50_E2BZF71e-4140.66%RNA-binding protein 12 n=1 Tax=Harpegnathos saltator RepID=E2BZF7_HARSA
NCBI RefSeqXP_001600826.14e-4139.93%PREDICTED: similar to ENSANGP00000015961 [Nasonia vitripennis]
NCBI nr blastpgi|3071977924e-4140.66%RNA-binding protein 12 [Harpegnathos saltator]
NCBI nr blastxgi|1571099233e-7326.73%heterogeneous nuclear ribonucleoprotein (hnrnp) [Aedes aegypti]
Group
Gene OntologyGO:00001661.2e-07nucleotide binding
GO:00036768.3e-07nucleic acid binding
KEGG pathway 
InterPro domain[637-716] IPR0126771.2e-07Nucleotide-binding, alpha-beta plait
[642-710] IPR0005048.3e-07RNA recognition motif domain
Orthology groupMCL26031 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207088-TA
ATGAGTATAAGGGCGTGTGTTACCGATGAAGACGCAAGACAAGCCATGATGTTAGACGGAGGAAAAATCAAAGAAATCCAAGTTAAGCTTCTTCTGAGTTCTCGGTCGGAGATGCATAAAGTGATAGAAGCGGCTCGACAAAACGTTCCCCTTCTTAATATCAATGCTCCGGCCGCTAGCCCTGCTCCAACTCCAGCTGCCCCTGCAGCGCCAGTTATTACACCGTTCTCAGCAGCTCTAGGAACTGGCATATCGACTTTTGGAATACCGGGTATCGGAAATCCGCAAGAGATACCACAACCTGCCGTCATAGAACCACCAGCGCCAATTATAAGCCCACCTGCTAAATCTCCTGTTGAAGAAGAAAAGGATGAGGATACGAAATCTGACAGAAAACGTAGTAAGGAGAAAGACAGACGCCGAACACGGTCACGGTCCAGGTCGAGAGATCGAAAGGATAGAAAACGAGATCGAAGAGACAGATCACGATCCAGAGACAGGAGACGACGGGATCGGAGTCGGAGCAGGGAGAGACGAGATCGGAAACGTGAAAGAAAGGACCGTAGTCGCTCCCGAGACAGGTCGCCTTCCAGACGATCACGCGATAAGCGGAACGGAGATCGTAAATCACCACAAGGTTCAATGGACAGATCCCAAGAAAGTAGTCTTGATAATTCACTGTCAAATTCTACACCTCCTTTTGGCGGTTTGCTGAATTCTAACGGTCCGCAAATGGGTATGATACTCCCAAATAGTGCAATGTCTAATCTGCAATTAGGTCAGGGTGATCTGACACAAACTCGTTTCAATGATCCATCTTTAGCTGAGGCATTCAATAAATTACAAGAATTAGGTAAAAAACGTAATCCCAATGCATTCCAAGGAGAGCAACAGAATGGTAGTAAATTTTCAAGTGGCAGAGGTGGAAGTAGTAGCTCTAGGAATCAATTCCGTCGTGAGGGTCGATCTACGAGATTTGAAGATCAGCAAAGAGACTGCTGCGTGGCTATAAGAAATGCGCCAAACCACACAAGTTATGGTGACGTTCGTCGTTTTTTCCCTTTTATGATCGATAAACGAGGAATAAAAATGATTAATGATAATATGGGTCGGCGAACTGGAAATATATTTGTTAGATTTTGCGACTCTCGAGCAAAACAGCTTGCCTTACAACGCAAACCAAATGAGTTAAAAGGAGCTCAAGTAATTGTAGAATCTTTGGACGATGATACTTACGACGCTGCTACAGACTCATTCCTTCCTTACCGTGAGGATAATGACGAAGAAGAATCTACATTGACAGTCTCAGATACAGGAGACGATAATAAAACTCAATTCAGTGTTCTCAAACTGATAGATCTCCCTCATTTTGTGCAAGAACAAGATATTATGAAAGCATTTAGCGATTTTTCACTTTTATCGATCCAACTTGTTGACTGTCGCCATAACCGTACTAAAAATGCATATGTTGAGTTTGTAAAACCAGATGATGCCAAAATAGCTTTAGAACGCAAAGATTCTTATGTTTTCGGAAGACGACATCCAGCTATCACTCCACTTACAGATGAAGAGTATAAAAACGATAAAAATGAAAATTCTGATGTGTCTGGAAAATCGCAATCCAGTAAAAATTCATTACAAGAACAGGCTGTGCCTCGAGATCCTCGTCAGCGACGGTTATTGGATAATGGTCTGGGAGGGCCACAAATGCATAACGCACAACAGCCTTTCTTTCCCAATACTGCATTTGCACAAAATTTTAGGTCACCCTTTCCCAATCCACAATTCGGTGGTTTTGGAGGTATGGATCATAGAGGTTTGATTCAGAATTGGGGCAACCGAATGAACTTCCCCAACAAGTCGGATGTTCAATCAAGTTCCCAGGCTATATCTCTCAACATTGATGAGGAATCCCTCGATTGCGTCCTCATGAAAGGTCTTCCTCGCGAAGCTACGGACAGAACTATCGTCAACTTTTTGTCAGACACGGGAGCCGTACCTGCGAGGATCCACCTCATGCTAGACAACAACGGTCTTCCTTCGGGAGATTGCTTTTGCGAGTTTAGAACCTCTCAAGAAGCTAGGCAAGCAAGCACTAAGCATGGCAGTCTTTTGGATGGTTGCCGCGTTACCGTCGATTTGGTTTTGAGAAGTGTCGTAGAGGAAGCTTTGGAAGGACCCAAGGACACAAATCAGGGGACTCAAGAGGGTCTACTCGGACCGCCACCTCCCTTCGTCAACGTACCTCGCATGCCATTTTTCCCGAATCGTGGACAGTTTCGAGGTCGAGGATTCGATAGAGGAGGATTCGACCGAGGCGGATTTATGAATCGTGGCGGGTTTGATCCCCGAGGGCGTGGCATGATGCGCGGTCGCGGTGGCTGGCCAGATCGTGGTCGCGGGTTCGACCCAAGAGGCCGTGGCCGAGGCTTCATGCGCGCGCCTGCACCTAGAGACGACGAGCCAGATCCAGCACTCGAAGATTTCGGCACGCCAGGTTGCGTGCTGTCTATGGAGAATGTACCTTTCAGAGCCACTATTGACGACATCCTCGCGTTCTTCAGTGACTTTGAGCTGACACAGGACGACGTTATCCGCCGCTACAACGAACGCGGTCAACCCACAGGAGATGCGCGTGTTTCGTTCCGCACTCCATTCGACGCTAAGCGTGCACAGTCGTCCCACAACCTTTCGTCCATCTTTGACCGCCGCATTACGCTTACTTTACTCTAG

Protein sequence:

>DPOGS207088-PA
MSIRACVTDEDARQAMMLDGGKIKEIQVKLLLSSRSEMHKVIEAARQNVPLLNINAPAASPAPTPAAPAAPVITPFSAALGTGISTFGIPGIGNPQEIPQPAVIEPPAPIISPPAKSPVEEEKDEDTKSDRKRSKEKDRRRTRSRSRSRDRKDRKRDRRDRSRSRDRRRRDRSRSRERRDRKRERKDRSRSRDRSPSRRSRDKRNGDRKSPQGSMDRSQESSLDNSLSNSTPPFGGLLNSNGPQMGMILPNSAMSNLQLGQGDLTQTRFNDPSLAEAFNKLQELGKKRNPNAFQGEQQNGSKFSSGRGGSSSSRNQFRREGRSTRFEDQQRDCCVAIRNAPNHTSYGDVRRFFPFMIDKRGIKMINDNMGRRTGNIFVRFCDSRAKQLALQRKPNELKGAQVIVESLDDDTYDAATDSFLPYREDNDEEESTLTVSDTGDDNKTQFSVLKLIDLPHFVQEQDIMKAFSDFSLLSIQLVDCRHNRTKNAYVEFVKPDDAKIALERKDSYVFGRRHPAITPLTDEEYKNDKNENSDVSGKSQSSKNSLQEQAVPRDPRQRRLLDNGLGGPQMHNAQQPFFPNTAFAQNFRSPFPNPQFGGFGGMDHRGLIQNWGNRMNFPNKSDVQSSSQAISLNIDEESLDCVLMKGLPREATDRTIVNFLSDTGAVPARIHLMLDNNGLPSGDCFCEFRTSQEARQASTKHGSLLDGCRVTVDLVLRSVVEEALEGPKDTNQGTQEGLLGPPPPFVNVPRMPFFPNRGQFRGRGFDRGGFDRGGFMNRGGFDPRGRGMMRGRGGWPDRGRGFDPRGRGRGFMRAPAPRDDEPDPALEDFGTPGCVLSMENVPFRATIDDILAFFSDFELTQDDVIRRYNERGQPTGDARVSFRTPFDAKRAQSSHNLSSIFDRRITLTLL-