Monarch geneset OGS2.0

DPOGS208894
TranscriptDPOGS208894-TA2910 bp
ProteinDPOGS208894-PA969 aa
Genomic positionDPSCF300009 - 866187-872982
RNAseq coverage326x (Rank: top 35%)
Annotation
HeliconiusHMEL0157810.086.04% 
BombyxBGIBMGA002460-TA2e-16479.84% 
DrosophilaDis3-PA0.059.75% 
EBI UniRef50UniRef50_Q16PB60.058.48%Mitotic control protein dis3 n=7 Tax=Eukaryota RepID=Q16PB6_AEDAE
NCBI RefSeqXP_314089.30.060.98%AGAP005191-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1191138460.060.98%AGAP005191-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1191138460.060.00%AGAP005191-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00037233.3e-136RNA binding
GO:00045403.3e-136ribonuclease activity
KEGG pathwayaga:AgaP_AGAP0051910.0 
 K12585 (DIS3, RRP44)maps-> RNA degradation
InterPro domain[463-797] IPR0019003.3e-136Ribonuclease II/R
[64-185] IPR0065965.2e-17Nucleotide binding protein, PINc
Orthology groupMCL14283 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208894-TA
ATGCAGTGGACTACAAAAACTTTTTTAACAAAAACAAAACGCGGCAATGTTTTGAAAATTGTAAGGGAGCACTATTTACGTGATGATTTGCTATGTGGATCTGCTGTGTGTCACATTTGTCCCCATAAAGACGATGAGATTGTATTAGACACTAAACCAGAATCCATTTGCAAATTATTTGATTTTGAACATTACCTAGTTCTTGACACCAACGTAGTCCTTCATCAGATAGATGTTTTAGAAGAAAATGCTTTGAAAAATGTTATAATATTACAAACAGTTCTAGAGGAAGTAAAGCATCAGAATACTGCTATTTTTCAAAGACTGTTAGAAATAGTGGGAAATAAGAATAGGAAATTTTATTCCTTTGTGAATGAACATCATAAAGACACTTATGTGGAAAGAAATCCTCATGAAAAACAAAATGACCGTAATGATCGTGCAATCAGAAGGGCAGCATCCTGGTATGCATCTCACTTGTTGTTATCAAAGAGTGATATTGATGGTAAAATACCTAAAATTATTTTGCTCACCGATGATGAGAATAACAGAAAGATTGCTCAAAAGGAAGGAATTGTTTGTTGTACAGTCAAGGAATATGTTGAAAATATCACTGGATATCCTGGTTTGGTTGATAAATTATCTAAGAATGTTATGCCAGAAGCCTGTTCTAAGGATGCATTATATCCTGCTCACTTAACACCAAGTCAGATCCATTCTGGTATAAGGGGCGGGAAGTTATATCAAGGAACATTTCATGCATCCAGGGATAATTTCCTAGAGGGTACAGCAGTTGTCAGTTCTTTTGATAAGCCTATTCTTCTACAAGGTCACAGTGGTATCAACAGAGCTATAGACGGCGATATTGTTGCTATTGAAATTTTACCCAAAGAAGAGTGGAGAAAACCAAGTGACATTGTTCTGGAAGATAAGGCAGATGATCCGGGTGACACGCTCGAAGAAGAGACTCTTCTTAATACTAAGGTGGACAGTGAAGATGAAATAACACCAACTGGAAAAGTTGTTGGCATTATAAGAAGAAAATGGCGACAGTATTGTGGTATTCTGTTGCCGAGTAAATTTCCAGGAGCCACGAGACATCTGTTCACCCCAGCTGAAAAAAGAATTCCCAGAGTTCGAATAGAAACCCGTCAGAGTGATATACTTATTTCTCAGAGGATTCTGGTTGCATTGGATTCTTGGCCAAGGAACAGTAGATATCCGTTAGGACATTTCGTAAGATCGCTCGGCCCTATAGGTGACAAAGATGCTGAGAACGAGGTAATTTTACTGGAACATGATGTACCTCATGCAAGATTCAGTGAGGCGGTGCTGGCATGTCTGCCCCCAGATGACTTCAAAATACCGGAAGAGGAGATAAAGAAACGGGTTGACTTGCGTTCAATATGTATATGTTCAGTGGATCCTCCCGGCTGTACCGATATCGATGATGCTCTGCACGCGCGGCCGCTCCCAGGCTTGAACACTTACGAAGTTGGAGTTCATATCGCGGATGTCACTTACTTCGTGAGACCTAACACTGCCCTCGACAGGGAAGCAGCGGCCAGGTCCACTACAGTTTATTTAGTGGACAGGAGAATTGACATGGTCCCAGGTCTTTTGAGTTCGAACCTATGTTCTCTGCGCGGTGGTGAAGAGAGGTTGGCTTTTTCATGTGTGTGGGTTGTCGATGAGAACGCTAATGTTTTGTCAACCAAGTTCCATAAGAGCGTTATAAAGATGTCTTCATCGGCTATGACGTATGAGGAAGCGCAAATAGCAATCGATGACGCAACCAGATGTGACGAGATCGCCTCGTCACTGCGAACACTGAACTCTTTGGCTAAAAAGTTAAAGCAGAAGAGACTGGACAATGGCGCTCTCTTGCTGGCGTCGCCAGAAATACGTTTCCAGGTCGACTCCGAGACCCACGATCCCATAGAAGTCCAGGCAAAGAAGATAATGGACACTAACTCTATGGTGGAGGAATTCATGTTGTTGGCGAACGTGAGTGTGGCTGAACGTGTGGCGGCCGACTACCCGCGGTGTGCGCTACTCAGGAGACATCCGTCGCCACAGCTGCACAGCTTTGACACGTTCCTCAAGGCTGCCAGGCAACAGGGCTTCGAGTTGGATGTATCGACGAATAAGTCGTTCTCAAAGTCCCTCAACGAAGCTGTGATACCCGACCGTCCGTTCTTCAACACTCTCCTCCGTATAATGGCGACCCGCTGTATGCAGCAGGCCGTGTATTTCCCAAGCGGGACTCGAACCCAGGAAGAGTTCTACCACTATGGACTGGCCTGTCCTATTTACACACATTTCACATCGCCAATACGCAGATATGCAGATGTCATAGTACATCGTCTGCTGGCGGCCAGTATAGGCGCGGACGTGAGTCACGCTTCTCTACTCGACACTAAGGCGGCGGACGCACTGTGCGACAACCTCAACTACCGACACCGACAGGCGCAGTACGCGGGTCGCGCGTCTGTGGCGCTCAACACGCATATATTGTTCAAGAATCGCGAAGAAATCGAATCGGCGGTGGTGCTTGCTGTGAAACGAAATGCGCTGCAAGTTCTTATACCTAAATACGGACTTGAAGGACCCATATACCTCCCGTCCGATAAGTTCAGATACAACGAAGAGGAGCACGTCCAAATCTGCGGTGACGTCATCTTGCGTACGTTCGACGAACTAACCGTCCGTCTGACGTTAGACAGCACTAACCTACAACACAGGAAACTAGTTTTCCAACTGGTGAAGCCGAGCATACCGGGAGTAAGTTACACTGCTCAGGAGCAAGTGGAAAAAATGGAAATCGTCGAAATAGACAAAAAGGAAAACGAAAAGAAAAGAAAGGAAAGCAAAGGCGGCAAAAACAAGAAAAAGAAATACAAGAAATGA

Protein sequence:

>DPOGS208894-PA
MQWTTKTFLTKTKRGNVLKIVREHYLRDDLLCGSAVCHICPHKDDEIVLDTKPESICKLFDFEHYLVLDTNVVLHQIDVLEENALKNVIILQTVLEEVKHQNTAIFQRLLEIVGNKNRKFYSFVNEHHKDTYVERNPHEKQNDRNDRAIRRAASWYASHLLLSKSDIDGKIPKIILLTDDENNRKIAQKEGIVCCTVKEYVENITGYPGLVDKLSKNVMPEACSKDALYPAHLTPSQIHSGIRGGKLYQGTFHASRDNFLEGTAVVSSFDKPILLQGHSGINRAIDGDIVAIEILPKEEWRKPSDIVLEDKADDPGDTLEEETLLNTKVDSEDEITPTGKVVGIIRRKWRQYCGILLPSKFPGATRHLFTPAEKRIPRVRIETRQSDILISQRILVALDSWPRNSRYPLGHFVRSLGPIGDKDAENEVILLEHDVPHARFSEAVLACLPPDDFKIPEEEIKKRVDLRSICICSVDPPGCTDIDDALHARPLPGLNTYEVGVHIADVTYFVRPNTALDREAAARSTTVYLVDRRIDMVPGLLSSNLCSLRGGEERLAFSCVWVVDENANVLSTKFHKSVIKMSSSAMTYEEAQIAIDDATRCDEIASSLRTLNSLAKKLKQKRLDNGALLLASPEIRFQVDSETHDPIEVQAKKIMDTNSMVEEFMLLANVSVAERVAADYPRCALLRRHPSPQLHSFDTFLKAARQQGFELDVSTNKSFSKSLNEAVIPDRPFFNTLLRIMATRCMQQAVYFPSGTRTQEEFYHYGLACPIYTHFTSPIRRYADVIVHRLLAASIGADVSHASLLDTKAADALCDNLNYRHRQAQYAGRASVALNTHILFKNREEIESAVVLAVKRNALQVLIPKYGLEGPIYLPSDKFRYNEEEHVQICGDVILRTFDELTVRLTLDSTNLQHRKLVFQLVKPSIPGVSYTAQEQVEKMEIVEIDKKENEKKRKESKGGKNKKKKYKK-