Monarch geneset OGS2.0

DPOGS209677
TranscriptDPOGS209677-TA1863 bp
ProteinDPOGS209677-PA620 aa
Genomic positionDPSCF300134 - 9209-21949
RNAseq coverage3231x (Rank: top 4%)
Annotation
HeliconiusHMEL0139150.092.74% 
BombyxBGIBMGA000516-TA0.093.85% 
DrosophilaImp-PL0.065.96% 
EBI UniRef50UniRef50_Q174K30.075.33%Igf2 mRNA binding protein, putative n=6 Tax=Neoptera RepID=Q174K3_AEDAE
NCBI RefSeqXP_973939.20.072.14%PREDICTED: similar to igf2 mRNA binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|1892371540.072.14%PREDICTED: similar to igf2 mRNA binding protein, putative [Tribolium castaneum]
NCBI nr blastxgi|1892371540.067.71%PREDICTED: similar to igf2 mRNA binding protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00037231.3e-15RNA binding
GO:00001668.2e-15nucleotide binding
GO:00036763.2e-09nucleic acid binding
KEGG pathway 
InterPro domain[222-285] IPR0181111.3e-15K Homology, type 1, subgroup
[217-290] IPR0040878.2e-15K Homology
[29-126] IPR0126778.2e-15Nucleotide-binding, alpha-beta plait
[35-106] IPR0005043.2e-09RNA recognition motif domain
Orthology groupMCL10757 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209677-TA
ATGTCTAATTCTCTGGAACAACAATTTGGGGACCTTAACTTGTCACAGGAAGATCATGATCAAATTTTTGAACAAGAAGATCATCAAGATCAAAGCCGGTCAAGGATTCTCATCAGTGGGCTATCAATGCACGCTCGCTTCGACACCATCGAGCCACTACTATCACAGTATGGCAACGTGCAACAATGCGACAAAATCAACTCACGCGATGCCAACACCCAAGCGGTGTACATTACATTCGAGACCCCGGAGCAGGCGCAGCAAGCCATCAACGGTCTGAACGGATGCGAAGTAGAGGGCGCGCGTATAAAGGTAGAGGCGGCGGATGGTATGGCGAGGGGTGGACGGCGTGGTCGACCAGGCGGCGGTCGCGGCGGTGGCGGCGCTCTCGGAGGCGGCTCGCGTCCTACTGATTTTCCTCTTCGCCTCCTTGTTCAGAGTGACATGGTTGGAGCTATCATAGGCCGTCAGGGAAGCACTATCCGCCTCATCACTCAGCAGAGCCGTGCCCGCGTTGACGTTCACCGTAAAGATAACGTTGGCTCCCTAGAGAAAGCTATCACCATATATGGCAATCCAGATAATTGTACGAATGCCTGCAAAAGGATACTTGAAGTTATGCAACAGGAAGCTAACAACACTAATAAGGGTGAAATATGCCTTAAAATATTGGCTCATAATAACTTGATTGGTCGCATAATCGGCAAGGGTGGCAACACAATCAAGCGAATAATGCAGGAAACCGATACAAAGATTACTGTATCATCCATAAATGATATAAACAGTTTTAATTTGGAGCGAATCATAACTGTGAAAGGTACTATTGAGAACATGGCTAAAGCAGAGTCACAAATTTCGGCCAAACTGCGTCAAAGTTACGAAAACGATCTGCAGGTACTGGCGCCGCAGAGTATAATGTTCCCTGGGTTACACCCAATGGCAATGATGTCGACGGGGCGCGGATTTTGTGGTGCGCCGCCTCCTTTCCCACCGCCTATATATGCGCCGCTGGCCGGCCAGGGCGGCGCGCAACAAGGTGCTGGCGACTCGCAAGAAACGACCTATCTTTATATTCCGAATAATGCTGTGGGTGCAATAATTGGCACAAAGGGATCACATATACGCAACATAATTAGATTCAGCAATGCATCTGTAAAGATTGCTCCTCTTGAGCAAGATAAGGTTATTGAAGGCAGTGTGGCTGCACAACAGGAGAGGAAAGTGACTATCGTAGGAAGTCCTGAAGCACAGTGGAAGGCGCAGTATTTGATCTTCGAGAAGATGCGTGAGGAGGGATTCATGTCTGGCTCTGATGATGTGCGACTGACTGTGGCAATAGTGGTAGCATCGTCGCAGGTGGGCCGCATCATCGGGAAGGGTGGGCAGAATGTGCGTGAGCTACAGCGCGTCACCGGATCGCTTATTAAGCTTCCCGAGCAGCCGCAACCGCCGACCGCAGCTGTCCGCGCAGCGACGCATCCGCTCGATGGTAGCGCAGGCGTCGGCTCCGGGTCGCCACCGACGCGCTGCCCCGCCACCGCCACCGCCCGCGCAGCAGTAGGCTGTCACCTCGTCACCACCACCCGCCGCCACAACTCTGCCACCACCGCATCGTGTCACCTCGCCAACTGCCACCTGCCACCTGCCACTTCGCCGACCCTACCCCACCTCACCCCTGCTACGCCCCCCTGTCGTACGAATGCTTTATTACGGTTTTCCTGTATCGATGCGGCCGTGAGCGATCTCCACGGTAGCGACCTGCGCTGGCAGAGCTGGCGCATCGCGAAACGGCGGAGTCGTTCAGTTTCACGCCCCGATACAGTTTTATGCGTGTACGATTTGCGTTATAATTGTGCCAAGTGA

Protein sequence:

>DPOGS209677-PA
MSNSLEQQFGDLNLSQEDHDQIFEQEDHQDQSRSRILISGLSMHARFDTIEPLLSQYGNVQQCDKINSRDANTQAVYITFETPEQAQQAINGLNGCEVEGARIKVEAADGMARGGRRGRPGGGRGGGGALGGGSRPTDFPLRLLVQSDMVGAIIGRQGSTIRLITQQSRARVDVHRKDNVGSLEKAITIYGNPDNCTNACKRILEVMQQEANNTNKGEICLKILAHNNLIGRIIGKGGNTIKRIMQETDTKITVSSINDINSFNLERIITVKGTIENMAKAESQISAKLRQSYENDLQVLAPQSIMFPGLHPMAMMSTGRGFCGAPPPFPPPIYAPLAGQGGAQQGAGDSQETTYLYIPNNAVGAIIGTKGSHIRNIIRFSNASVKIAPLEQDKVIEGSVAAQQERKVTIVGSPEAQWKAQYLIFEKMREEGFMSGSDDVRLTVAIVVASSQVGRIIGKGGQNVRELQRVTGSLIKLPEQPQPPTAAVRAATHPLDGSAGVGSGSPPTRCPATATARAAVGCHLVTTTRRHNSATTASCHLANCHLPPATSPTLPHLTPATPPCRTNALLRFSCIDAAVSDLHGSDLRWQSWRIAKRRSRSVSRPDTVLCVYDLRYNCAK-