Monarch geneset OGS2.0

DPOGS214163
TranscriptDPOGS214163-TA1611 bp
ProteinDPOGS214163-PA536 aa
Genomic positionDPSCF300014 - 328744-332652
RNAseq coverage1020x (Rank: top 12%)
Annotation
HeliconiusHMEL0068010.093.72% 
BombyxBGIBMGA006208-TA0.094.12% 
DrosophilapUf68-PA4e-1344.59% 
EBI UniRef50UniRef50_Q144981e-14552.95%RNA-binding protein 39 n=111 Tax=Bilateria RepID=RBM39_HUMAN
NCBI RefSeqXP_002428613.15e-17765.17%RNA-binding region-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420159731e-17565.17%RNA-binding region-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3071809600.067.33%RNA-binding protein 39 [Camponotus floridanus]
Group
Gene OntologyGO:00063972.3e-115mRNA processing
GO:00056342.3e-115nucleus
GO:00037232.3e-115RNA binding
GO:00036767e-30nucleic acid binding
GO:00001661e-26nucleotide binding
KEGG pathway 
InterPro domain[56-408] IPR0065092.3e-115Splicing factor, CC1-like
[281-354] IPR0005047e-30RNA recognition motif domain
[276-362] IPR0126771e-26Nucleotide-binding, alpha-beta plait
Orthology groupMCL11651 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214163-TA
ATGGCTGAAGATTTAGATGTTGAAGCAATGTTAGAAGCACCTTATAACAAATCGGATAACACTTCCTCTTCAAAGCATCGGCCGGAGAAGTATTCCGAAAAAAACAGAGACAAAGATAAGGACGGAAGTCGAAGGAGGAGTCGTTCAAAGGACCGGGAAAGAAGACGATCGCGCGATAAGGATTCTAGACGTTCTAGGGAAAGGGATGACAAACGAGAAAGAGACCGCGACAAGGATAGAGACAGAGATCGCAATCGTGACCGTGAACGTCGCGACCACGACAAGGATCGTAACCATAGAGATAAGGATAGGGATAAGGACAGGGATCGTGAAAGAGATAGGGAACGTGAAAGAGATAGAGAAAAGAGACGTAGTAAAGATAAAGTTAAGGAACGTAGTCGTGAAAGGGAAAGAAGCCGTGACCAACATCCTCCTAAAAGAGAAAAAAGTCGCGATAAGGAAAGAGAGAGAAGCGAATATAGATCAAAATCAAGAGGAATTGAACCCAAGTTGGATGACCTACCTCCAGAAGAAAGGGATTTGCGCACTGTATTTTGTATGCAATTATCTCAACGTATTAGGGCTAAGGATTTAGAAGAGTTTTTTTCCTCTGTTGGTAAAGTAAGAGATGTAAGACTTATCACATGCAATAAGACAAGAAGATTTAAGGGAATAGCCTATATTGAGTTTAAAGATGCTGAATCTGTTCCCTTGGCTTTGGGTTTAACAGGACAAAAACTACTGGGTGTTCCTATTATAGTTCAGCATACCCAAGCTGAGAAAAACAGGGTTGGCAACACCTTGCCCAACTTAGCACCAAAAACTAGTAATGGTCCTACTCGATTATATGTTGGCTCACTTCATTTTAATATTACAGAAGACATGCTTAGAGGAATCTTTGAACCATTTGGTAAAATAGATCATATACAACTAATGACCGATCCTGATACTGGAAAAAGTAAAGGCTATGGTTTTCTGACATTCCATCACGCAACAGATGCAAAGAAAGCAATGGAGCAGTTGAATGGATTTGAGCTTGCAGGTAGACCAATGAAGGTTGGAAATGTGACTGAGCGAGCAGATGGTGGTTCAAGCACACGGTTTGATGCTGATGAACTAGACCGAGCTGGAGTTGATCTGGGTGCTACCGGACGCCTTCAACTTATGTTCAAATTAGCTGAAGGCACAGGTTTACAGATACCTCCAGCAGCGGCATCAGTATTGATGGGCGCCGGCTCAACTTTAGTAGCGCCTCAACCACAAGTCGCACCTCCTATTGCTACACAATGCTTCATGCTCAACAACATGTTTGACCCATCATCGGAATCGAACCCCAGCTGGGACATTGAGATAAGGGATGACGTCATCAGCGAATGCAATAAACATGGCGGAGTATTACACGTGTACGTTGATAAGGCGTCTCCACAGGGAAATGTGTACTGCAAGTGTCCAACAATAGCAACAGCTGTGGCGTCTGTGAATTCATTGCACGGCCGTTGGTTTGCGGGGAGGGTCATAACCGCAGCTTACGTGCCCCTTGTCAATTATCATTCACTGTTCCCAGACGCGATGACTGCACTAACACTACTATTGCCTTCTAAACAACGATAA

Protein sequence:

>DPOGS214163-PA
MAEDLDVEAMLEAPYNKSDNTSSSKHRPEKYSEKNRDKDKDGSRRRSRSKDRERRRSRDKDSRRSRERDDKRERDRDKDRDRDRNRDRERRDHDKDRNHRDKDRDKDRDRERDRERERDREKRRSKDKVKERSRERERSRDQHPPKREKSRDKERERSEYRSKSRGIEPKLDDLPPEERDLRTVFCMQLSQRIRAKDLEEFFSSVGKVRDVRLITCNKTRRFKGIAYIEFKDAESVPLALGLTGQKLLGVPIIVQHTQAEKNRVGNTLPNLAPKTSNGPTRLYVGSLHFNITEDMLRGIFEPFGKIDHIQLMTDPDTGKSKGYGFLTFHHATDAKKAMEQLNGFELAGRPMKVGNVTERADGGSSTRFDADELDRAGVDLGATGRLQLMFKLAEGTGLQIPPAAASVLMGAGSTLVAPQPQVAPPIATQCFMLNNMFDPSSESNPSWDIEIRDDVISECNKHGGVLHVYVDKASPQGNVYCKCPTIATAVASVNSLHGRWFAGRVITAAYVPLVNYHSLFPDAMTALTLLLPSKQR-