Monarch geneset OGS2.0

DPOGS203773
TranscriptDPOGS203773-TA1701 bp
ProteinDPOGS203773-PA566 aa
Genomic positionDPSCF300010 + 666359-668059
RNAseq coverage248x (Rank: top 42%)
Annotation
HeliconiusHMEL0042170.064.27% 
BombyxBGIBMGA013351-TA2e-16159.75% 
DrosophilaCG13097-PA7e-8436.59% 
EBI UniRef50UniRef50_UPI00015B50662e-10540.16%UPI00015B5066 related cluster n=3 Tax=unknown RepID=UPI00015B5066
NCBI RefSeqXP_001605032.13e-10640.16%PREDICTED: similar to U3 small nucleolar ribonucleoprotein protein mpp10 [Nasonia vitripennis]
NCBI nr blastpgi|3800307395e-10942.35%PREDICTED: LOW QUALITY PROTEIN: U3 small nucleolar ribonucleoprotein protein MPP10-like [Apis florea]
NCBI nr blastxgi|3838560762e-13044.14%PREDICTED: U3 small nucleolar ribonucleoprotein protein MPP10-like isoform 1 [Megachile rotundata]
Group
Gene OntologyGO:00056342.5e-117nucleus
GO:00305292.5e-117ribonucleoprotein complex
GO:00063642.5e-117rRNA processing
KEGG pathwaydpo:Dpse_GA120444e-82 
 K06670 (SCC1, MCD1, RAD21)maps-> Cell cycle - yeast
    Cell cycle
InterPro domain[23-512] IPR0071516.2e-124Mpp10 protein
[1-560] IPR0121732.5e-117U3 small nucleolar ribonucleoprotein complex, subunit Mpp10p
Orthology groupMCL11956 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203773-TA
ATGACGAGTGAAAAAATTGATGATATAATAGATAAATTCAGTGTGCTCACTGAGAAACCTGTGAAGTTTTTAACAGTTCAAGATGATATACAGGGAGATATCAGGAACTTGGTGAAGTCCATCTATGATCTTACTAAGTCACAAGAAAGCAGCAATAAGAAGAAAAAGGCGTTATCGAGTATGATAGTGAGTGATTGCGACGAGGAACAGATTTGGCAACAGATAGAGTTGCAGAATTCAGAGCGATGGGACGAACTGGTCTGGGATGTAGCTAATAGCGTGTCCAGTAAAAACGATCTCACGTTTCCTTTAGAGTTTCCAGAAGAGAAAGACGAAGAAGATATTAAAAATGAAGATGATGTTATGTCAGAAGAAGAACATCAAGAAGTAGAACAGTCTAATGTTAAAGTGGCCAAAACGAAACCTAGTAAAAAACAGTCAATTGTTGATGACGATTTCTTTAAATTACAAGATATGGAAAACTTTTTATTAAAAGAAGAAAAAATGGAGGGAAAAAACAAAAAGAGTGAAGACGACGAGGACTCGATAGACATGTTTGAAGATATAGATAGTGAGGGATCTGACGAAGAAGGCGGCAAAGATGTTAAATACTATGACTTTTTCAATGAAGACAATGAAAGTGGTCAGGAAGATAATGACGAGGAAGATGATGATGAACATGAAGAAAATAATGATTATAAGACCGAACCAAAAGAACATAAGAAAAGTGACAAAAAAGTCAGGTTTCTTGAGCCAGATTCTGAATCAGGTGATTCCGAGGACAGTGAAGAACAAAAATCTAAGCACATCAATGGCAGTGATAAGGAAAATAAATCAGAATTTGAACTTCGGCAAGAGCGGTTGCAGAAGCATATTTCGAGGCTGGAAGAGAAATCCATAAATGAAGCCCCGTGGCAACTGAAAGGTGAAGTCGATGCCATGAAAAGACCACAGAACTCGTTACTTGAAGAAGTTCTCGATTTCGATCTCACAACCCGACCACCTCCCGTCATAACAGAACAAACAACAGTCACATTAGAAGGCCTCATCAGACAACGCATCAAAGACAAGGCTTGGGACGACGTAACCAGGAAAGAAAAGCCAGTAGATGATCAGTTTCTGTTCAAAAAGCCCGAAGTTCTAGATCAGTCTAAGAGTAAAATGAGTTTAGCACAAGTCTATGAAGCAGAGTATCTTAAACAGAAACAAGCGTCGTCTGGTGAAGTTGATGATGAAAAGGAACCTGAAAGTCATACTGAAATTCGGGAAGCTATGAAAAATTTATTTTCCAAGCTCGATGCTCTGTGTCATTATCACTACACACCAAAACCACCTCAAGCTGAAGTTAAAATTGTTAGTAACACTCCGGCCATATCCATGGAAGAAGTGGCTCCAGTGGCAGTGAGTGATGCTACCCTATTAGCACCCGAAGAAGTTAAAAGGAAAACAAAAGGAGATCTTATGAGCAAGGAAGAGAGAACGCAGACAGATAAAAACAGAGAGAGGAGGAAAAAGAAGAAACTGCAAAGGAAAAAGGGTTCTGTTACGAAAGTCACAGATAATAGAAACACTAAGGCAGCCGTTGAAAGCAACGACAAGTCCCTCAAGACGTCCAAAGCTTTCTTCCAACAGCTAAATGATAATTCAACTAGTCTCATTAAATCTAAAACTAAGAAGCTTATTAAAAATAAGGGACAATAA

Protein sequence:

>DPOGS203773-PA
MTSEKIDDIIDKFSVLTEKPVKFLTVQDDIQGDIRNLVKSIYDLTKSQESSNKKKKALSSMIVSDCDEEQIWQQIELQNSERWDELVWDVANSVSSKNDLTFPLEFPEEKDEEDIKNEDDVMSEEEHQEVEQSNVKVAKTKPSKKQSIVDDDFFKLQDMENFLLKEEKMEGKNKKSEDDEDSIDMFEDIDSEGSDEEGGKDVKYYDFFNEDNESGQEDNDEEDDDEHEENNDYKTEPKEHKKSDKKVRFLEPDSESGDSEDSEEQKSKHINGSDKENKSEFELRQERLQKHISRLEEKSINEAPWQLKGEVDAMKRPQNSLLEEVLDFDLTTRPPPVITEQTTVTLEGLIRQRIKDKAWDDVTRKEKPVDDQFLFKKPEVLDQSKSKMSLAQVYEAEYLKQKQASSGEVDDEKEPESHTEIREAMKNLFSKLDALCHYHYTPKPPQAEVKIVSNTPAISMEEVAPVAVSDATLLAPEEVKRKTKGDLMSKEERTQTDKNRERRKKKKLQRKKGSVTKVTDNRNTKAAVESNDKSLKTSKAFFQQLNDNSTSLIKSKTKKLIKNKGQ-