Monarch geneset OGS2.0

DPOGS212310
TranscriptDPOGS212310-TA2169 bp
ProteinDPOGS212310-PA722 aa
Genomic positionDPSCF300549 + 10168-18248
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0144110.081.00% 
BombyxBGIBMGA001833-TA2e-6945.89% 
DrosophilaDcr-1-PA9e-14160.58% 
EBI UniRef50UniRef50_B0W7S31e-14963.70%Endoribonuclease Dcr-1 n=3 Tax=Culicinae RepID=B0W7S3_CULQU
NCBI RefSeqXP_001844757.12e-15063.70%endoribonuclease Dcr-1 [Culex quinquefasciatus]
NCBI nr blastpgi|1700337865e-14963.70%endoribonuclease Dcr-1 [Culex quinquefasciatus]
NCBI nr blastxgi|3883304471e-14565.84%Dicer1 [Locusta migratoria]
Group
Gene OntologyGO:00063961.9e-45RNA processing
GO:00037231.9e-45RNA binding
GO:00045251.9e-45ribonuclease III activity
KEGG pathway 
InterPro domain[598-661] IPR0009991.9e-45Ribonuclease III
Orthology groupMCL11692 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212310-TA
ATGTCCGACAGTGATGATGACACCGACACTAGCGGCTTCTGCAGCAGTAAAAATACTAACGCGGCGATGGGAGTTAGGATAGAGTACAAAACCGCTCACGAAGCTGAAGCTGTGGACATAGAACGTCGAAGGCCTATCGCGCCGGCCCCTCAACCGGATGCACTTGATGACGCTCGTGACTTGAAGGAATGGGGACAGGTGTTGAGGGAAGGGACCGCCCCAGAAGAGTATGTGAAGAAGTTCAAGAACGCAGTATTGAAACACGAAGCTGATATAAAGGATAAAGATTATTTAATTGACAAGGACACACCTATAGTTGAGTTACTTCCTGAAACTACGAAAGATTTAGACAAAACTAATAGTGAAATATTGAAAGTAACTATTAAAAGTGATTCAAATTTAATGAACGGGGACGACACATCGACAATAGAAGTTTGTGATCCCGTTCATACACAGACGGAAGATGTACAAATAAAAGATAAAATAGATAAAGATAAAAATATAACAGATTTCATAGACGACATATTTCCTTACGGTCATCTATTGGAAGTCGAAAACGGTCAAATAACATTAGAGTCGATAGAGAGAAACAAAAGATTATTATTCGCCGAAATAAAGAATAGTTTGAGTGAAGAAGAGATGAAGAAGATGCAATGCTTCTCGATGAAAGACGTTGACATAGATTCGCCGGACTACGTTAATGAGAAAATAGTGAACGTTGGCTTCGATACCAACGAGAAATATATAGATAGGGTCAATGATTGGAGTGAACGGGATTTAAAACCCTATTACCTAGACAGTAATGGTATTAATGGCAAGGAATTCGATTTCGACTCTCAGCCAGACTTGGAGGGACATCCGGGGCCCAGCCCTAGTGTTATTCTCCAGGCTTTGACGATGTCGAACGCCAACGACGGCATCAACTTGGAGAGGTCATGTGTACAGGGTGACGTCAAGGACGGGGAAAATGAGCCACAAGTCGACAGCACCGGCTGTTTTATACCTTACAATCTTATAACACAGCATAGCATACCAGACAAATCGATAGCTGACTGTGTGGAGGCCTTGATAGGCGCGTATCTATTGGAGTGTGGTCCACGAGGCGCTCTCTTGTTCATGTCCTGGCTCGGTATCACGGTGCTGCCAACTCACACGGTGCCGTTGCCAGAGAATCACCCGTTTGTAGTCAAAATGAGAAATAGACAGGATGACAAAGAGACGGATATTAGTGATGAGGAGGACGATAAGTGGTGTTGGGACGGACGGCCGCCCGGGAGCCTTAAACCTCACAAGGACAGTGAAGGGCGTTGGGTGCAGACAATTTATGGCGCGTTGAAATCGCCTCCGTCCCCGCTGCTAAGGTACATCGAAGATCCCGAGGGAGAGCTGGAGCGGATGCTATCAGGTTACGATGCGTTGGAAAACACGCTGCAGTACCGCTTCCGCGACCGCTCGCTGCTACTGACCGCGCTCACGCACGCCTCGTGTCACAACAACACGCTCACGGACTGTTACCAGAGACTGGAGTTCTTGGGGGACGCTATACTCGACTATTTGATAACCCGTCACCTGTACGAGGACCCTCGCCGCCACTCTCCCGGCGCCCTCACGGACCTGAGGTCAGCTCTCGTCAACAACACCATCTTCGCGACGCTGGCGGCTAGACACGGCTTCCACAAGTACTTCCGCCATATGTCTCCAGGTCTCAACGAGGTGTTGACGAAGTATGTTAAGATTCAAGAGGAGAATGGACACTCCATCAGCGAGGAGCATTATTTGATACAAGAGGATGAAATGGAGCAAGCTGAAGATGTAGAGGTTCCGAAGGCTCTCGGGGACTTGTTCGAGTCAGTCGCTGGAGCTATATTCCTCGATTCTGGTATGTCCCTGGCGGCCGTGTGGCGCGCAGTCGGCGGTCTTCTGCGGGCGGAGCTAGACGCGTTCAGCGCGGCCGCTCCCAAGTCACCCGTCAGGGAGCTCTTGGAAGCGGAACCTGATACAGCCAAGTTTGGTAAACCGGAACGTCTCGCGGACGGCCGTCGCGTGCGCGTGTGTGTGGAGGTGTTCGGCCGCGGATCCTTCAAGGGAGTCGGTCGTAACTACCGCATCGCTAAGGGTACAGCCGCGAGGTGCGCGCTCCGGCACTTACGGTCCGTGAGACCGAGGTGA

Protein sequence:

>DPOGS212310-PA
MSDSDDDTDTSGFCSSKNTNAAMGVRIEYKTAHEAEAVDIERRRPIAPAPQPDALDDARDLKEWGQVLREGTAPEEYVKKFKNAVLKHEADIKDKDYLIDKDTPIVELLPETTKDLDKTNSEILKVTIKSDSNLMNGDDTSTIEVCDPVHTQTEDVQIKDKIDKDKNITDFIDDIFPYGHLLEVENGQITLESIERNKRLLFAEIKNSLSEEEMKKMQCFSMKDVDIDSPDYVNEKIVNVGFDTNEKYIDRVNDWSERDLKPYYLDSNGINGKEFDFDSQPDLEGHPGPSPSVILQALTMSNANDGINLERSCVQGDVKDGENEPQVDSTGCFIPYNLITQHSIPDKSIADCVEALIGAYLLECGPRGALLFMSWLGITVLPTHTVPLPENHPFVVKMRNRQDDKETDISDEEDDKWCWDGRPPGSLKPHKDSEGRWVQTIYGALKSPPSPLLRYIEDPEGELERMLSGYDALENTLQYRFRDRSLLLTALTHASCHNNTLTDCYQRLEFLGDAILDYLITRHLYEDPRRHSPGALTDLRSALVNNTIFATLAARHGFHKYFRHMSPGLNEVLTKYVKIQEENGHSISEEHYLIQEDEMEQAEDVEVPKALGDLFESVAGAIFLDSGMSLAAVWRAVGGLLRAELDAFSAAAPKSPVRELLEAEPDTAKFGKPERLADGRRVRVCVEVFGRGSFKGVGRNYRIAKGTAARCALRHLRSVRPR-