Monarch geneset OGS2.0

DPOGS209191
TranscriptDPOGS209191-TA4611 bp
ProteinDPOGS209191-PA1536 aa
Genomic positionDPSCF300061 + 541750-554968
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0032560.059.42% 
BombyxBGIBMGA011542-TA0.046.43% 
DrosophilaDcr-2-PA4e-14228.99% 
EBI UniRef50UniRef50_D7UT110.057.82%DICER-2 n=2 Tax=Bombyx mori RepID=D7UT11_BOMMO
NCBI RefSeqNP_001180543.10.057.82%dicer-2 [Bombyx mori]
NCBI nr blastpgi|3023189080.057.82%dicer-2 [Bombyx mori]
NCBI nr blastxgi|3023189080.057.86%dicer-2 [Bombyx mori]
Group
Gene OntologyGO:00168911.1e-23endoribonuclease activity, producing 5'-phosphomonoesters
GO:00063961.7e-19RNA processing
GO:00037231.7e-19RNA binding
GO:00045251.7e-19ribonuclease III activity
GO:00036773e-18DNA binding
GO:00055243e-18ATP binding
GO:00055153e-18protein binding
GO:00167873e-18hydrolase activity
GO:00043869.1e-14helicase activity
GO:00036769.1e-14nucleic acid binding
KEGG pathway 
InterPro domain[775-862] IPR0050341.1e-23Dicer double-stranded RNA-binding fold
[200-406] IPR0140017.2e-21DEAD-like helicase
[1379-1465] IPR0009991.7e-19Ribonuclease III
[201-372] IPR0069353e-18UvrABC complex, subunit B
[1060-1209] IPR0031003e-18Argonaute/Dicer protein, PAZ
[642-698] IPR0016509.1e-14Helicase, C-terminal
Orthology groupMCL15191 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209191-TA
ATGTCTAAACGTAAACTAAACAATTTCTCTTTAAGTGACAAACTAAAAATTGTGAGTGGTCTTGAATCCGGGAAATTAGGAAAGCAAATAATGAGTGAGTCTAGGTTAACAGAATCGGCCTACCGTAAAATTATAAATATATGTCAGTGGTCAGAGGGCCAGGGAAGTATCAAGAGAAGGCGGCTCTCTGAGTTTTCAGACGTTGAAAAGTGCTTATTCCCCGACATCACTCTCAATGTATCAACGCCAGTCTCCAAAGACTTGGTGCAGATTAGCGAAGACGTAACTACCTTCAGAAAGTACACCAGAGACGATAATGTCCAAAACTGTGTGATCTGGTGTGATGACGGAATGGACAGTGGAAATAAAGCCGGCAAATCACTCATTTTGGATAGAGAGTTGGAGAAAGTACCAAGTAAACAAGAGGCTTTGAAGGCATTAGAGACACTTCAAAAATTTTATAAAAATACGGCGTTTGATCCAACCATAATCAATAGAATTAATGAACTTGAAAAGTACACTCACAGATTACAAACAGCATCACAAAAGAAATTAATAAAATATCTCAAACCACATGATAAAATGGATGCGACCGACGACGAGATCCTTCCAAGATTGTATCAAACGCAGTTAGAAGAAATTGCAGTTAAAAAAAATACAATAATTTTTCTTCCGACAGGTTCGGGTAAAACTCACATTGCGATAAGTTTGATAAAGAGGCTGAGAGATACATTGCAGAAGCCCTGGGGTTTCGGCGGGAAGAGATGTTTTTTTCTTGTCAACACTGTGCCCTTAGTGGCTCAACAGAAAAAAACTATTGAACGATCATGTCCTGTGCACGGAGTGGCCGGTTTTAGCGGTGAAGACAACGTCGATTATTGGAATAAGGCGCAGTGGGACAGCGAACTTTCCAAATATCAGGTGATAGTGATGACGTGTCAGATCCTAAGCGACATGCTGACGCACCAGTACCTCCAGCTCAAGGACATCAGTCTTCTGATTTTCGATGAGTGTCATCACGGAGTCGAGGACCATCCTATGAGACTCATCATGAAGCACTTTCTCAACTGCCCCAAGAACGAACAACCCAGAGTTCTAGGTCTGACAGCGACACTGCTGAATAGTAACGTGAAGTTGAATAAGATACAAGAAACTATCCGGCACCTGGAGACAACCTTCCAGGCTTCCATAGCCACGGCCGACGACATGGGAGAGGTCGCCATGTATTCAACGAATCCCAGAGAACAATTACTATACTACCGCCGCCCCTCGGCGACTCCCGCGGCCTCTGAAGCAGTGGACTTGTTGAGGCGACTCCAACAGATAGTGCTGCAGTTCGAACTGCCGAACACCGTTCAGAAACACGACATCGAGTTGGAACCTCACCAGCGAGACATCACCGGGGATCCCAAGAAGATAATAAAAGCTGTGAAGAACATGATTGAGTCCATGATATTGTCTATTTGTGATCTGGGAGTGTACGTTGGAGCCCTAGGGCTCCTGGCGAACATCGTGGCCCTCGAGAGAAGGAAGCGGTGGGCCTCCAGTCAGGTCGAGGAACTCTTGTACACCGTCACCATAACGGCAGCGGTCGAAGCTCGCTCCGTTCTCTTGGATTCAATGAAAAATGAGAGCGGTTACGAAAAAATTATACGACACTCATCCGAGAAGACCGTCCAACTGCTGAATATATTGAAAGAATACAACCCGCGGACCCCCGTCAGGCGAGCGCTGAAGGTGAATCAGGAGAGGAACGCCCTCTGTGGGTTGATCCTGACCCAGCAGCGGTTCACAGCCAAGATCCTGTACAACCTACTGAAGGACGTCGCGGAGGTGAACAAAGAGGAGTTCGGCTTCCTGAAGCACGACTTCATAGTGGGCTTCAATATATCGCCGCTCTGCTCCACCAGGGAGCAGCACTATAACAAGAAGATCAGCGAACAAGCACTACTGAAGTTCAAGAACGGCGATCTGAACTGTCTGATCGCGACCAGTGTGGTGGAGGAAGGACTGGACGTTCCTCAGTGCACTCTGGTGCTGAGATACGACGTGCCCCTCGAGTACCGCTCCTACATCCAGAGCAAGGGTCGTGCGAGAAACAAGAAGGCCAATTTCGTGATATTAGTGGATGAAGAGAACAGACAGAAATTTAATAGTCAGTACAGAGCGTTTCAGCAGGTGGAGTCGTACCTGAACGGGATGCTGTGCGGCGGTGAGGTGAGGAACACTCCGTCTGAAGCCGAAGTCAAAGAGAATCTGAACGATGATGATTGTATCCCGGCCTATAAAACGAGAAACGGAAATTCCTTATTCGCGGTGTCCGCTATAAGTCTGTTGAACCGCTACTGCTCAGTGCTGCCTCACGACCAGTATACCACGATCCTGCCCATGGTAATCCAGGAGCCGGCGCAGGGGAACAAGACCATGGTCACCATCATCATGCCGATCGCCTGCCCCATCAAGGAACCGGTCCAGGGCCGGCCCATGTCCAGTGTTAAGAATGCAAAGAGATCTGCGGCCTTGAACGTGTGCATCAAACTTCATAAGGCCGGAGAACTGGACTACGAGACCCTACTGCCAAAAGTGAAGGGGCTAATAGACTTTTCTACGACGGACGTGTCGTCGTGTTTTCCGAACTGGCGCGATGAGACCGAGGTCTCGGACGAGGGTCCCCCGGGAACCAAGAAGAGGGTCAGGAAACATCCCGTACATTATCCCAAATGCCTGGACGGCCCGAAGACGAGGGACGAGCTCACCAAATGTTATCTCCACGTCATAAAGCTGGAGACAGCCTTCGACGAGCCGAGCGACGCCAGGGAGAAGGCGTTGTACGACATGCTGCGGAGGAAGGAGGGCTACGGGTTTATCACGTTCGATAGGCTGCCGGAGCTGTGCGCGTTCCCTATGTTCCTCACAGTGGGGGAGGTCCGCACTACGCTTCAAGTTAACTACGCCGCCATCATCCTCGATACGTCTCACTTAGAACTCATCAAGCAGTTCCATTTCTTCATATTCGATCAAGTGTTGGAGATCGCCAAAAAGTTTCTAGTGCACGACGGGCGGGTCAACAACCTGTACGCGGTGCCCCTGAAATACGACAACGGTTACGATATAGACTGGAACGTGATGCAGACATACAAACAGATAGCGCCGTGCGACGAGCCCACGGCACAGGAGCGCGCGTCTCTGAGTGTCACCAAGGAAATGTACGACAACTGTGTAGTGACGCCCTGGTACAGAGGCAGTCTACACCCCGACAGGTACATAGTGTCCAGAGTTCTGGAATACCTCACGCCGTACTCCAAGTTCGAGGACAACTCGTTCGAGACCTTTGCCGATTACTACAGCAACAAATACAACCTTGAGATCCTCGGCAGGAAGGACCAGCCGCTGCTGGAGGTCCGTAACATAAGCAGTCGTATGAACTGCCTGCTCCCCCGCGCGGCGACCATCCGCGGCCTGAGCGACAAGCAGCGCCGGCTGGCGTCGCTAGCTCACGGCGACTACAAACCTAGAGAATTCACTGAACTCTTCGTGCCAGAGTACTGTGTGCGGGCGGACTACCCCGCGCATCTATGGTACAAGGCCATCATTCTACCCAGCATCGTCCACCGAGTCACCATGTTACTGATCGCTGAGGAGCTGCGAGTGCAGATCCTGACCGACACCCAGAAAAGCCAGGGCAGGCTGACTAAAGGTGTAAGTTGGCTTCCCATAGAAGTGGATCACTTTGTTGTGAAGAGATCCTTGCTGTCCAATTTAGACGAACCTGCTCCTATAAACAGCGTCGACAGAATAAACAACCCCATAGACGAAACGGCACAGAAACCGCCCAACATCGTCTCCCTGAAGCAGAGCGTGTATCAACTCCAGAAGAAGAAGGTGTCCAAGAACTATCCTTGGGACGAGAGCATGGAACCCATTGATATCGAGCGGAACCTGTCCCGGGTCACGGTTATGGACATAGAGTGCTACGACTCCTTCGTGACGGCGCCCCTTGCTGCAAAGGAGCCCGTCTCTCTGCTTCGGAATTCACCCAAAGTGAAGACGCCCCTCTCCACCGCCCTCCTGCCGCCGCCGCTACGATACAAGGACACTATATCCGTTCTATCCACCGCAAGTTCCCCTCGCGGGCCCGAGCCTCGCGAGGTGTTGAGCGCGCTCACCCTCATCAAGTCTAACGACACCTTCAACCTGGAGCGCTCCGAGACCCTGGGCGACTCCTTCCTCAAGTTCGCAGCCAGCCTATACCTGTTCCACAAGTTCCCGCAGATGGACGAGGGACAACTCACCAACCTCAAGAGCCGGCTTATTGGAAATAGAAATCTGTATTACGCCGGCGAGCGTGCTGGGTTGGCTGGCCGTATGAAAGTGGAACAGTTTAGTCCTCGGAGCGACTTCCTGGTGCCCGGGTTCCTCGCACCCAGCGAGCTCGTCTCATTCATCACAAGACGGAAGTTAATCATCGAGTTATTAAATGCGAGCAAAGTGTTGTCTCAAACAAAACTGCCGTGGCAACACCAGATTTCACGCCAGCAGCGCCTTTATAAAAGCGTAACTTATGTCGATGAATTTTTTAACGTAATTTTCCACGATGAGGGAAATTTTTGA

Protein sequence:

>DPOGS209191-PA
MSKRKLNNFSLSDKLKIVSGLESGKLGKQIMSESRLTESAYRKIINICQWSEGQGSIKRRRLSEFSDVEKCLFPDITLNVSTPVSKDLVQISEDVTTFRKYTRDDNVQNCVIWCDDGMDSGNKAGKSLILDRELEKVPSKQEALKALETLQKFYKNTAFDPTIINRINELEKYTHRLQTASQKKLIKYLKPHDKMDATDDEILPRLYQTQLEEIAVKKNTIIFLPTGSGKTHIAISLIKRLRDTLQKPWGFGGKRCFFLVNTVPLVAQQKKTIERSCPVHGVAGFSGEDNVDYWNKAQWDSELSKYQVIVMTCQILSDMLTHQYLQLKDISLLIFDECHHGVEDHPMRLIMKHFLNCPKNEQPRVLGLTATLLNSNVKLNKIQETIRHLETTFQASIATADDMGEVAMYSTNPREQLLYYRRPSATPAASEAVDLLRRLQQIVLQFELPNTVQKHDIELEPHQRDITGDPKKIIKAVKNMIESMILSICDLGVYVGALGLLANIVALERRKRWASSQVEELLYTVTITAAVEARSVLLDSMKNESGYEKIIRHSSEKTVQLLNILKEYNPRTPVRRALKVNQERNALCGLILTQQRFTAKILYNLLKDVAEVNKEEFGFLKHDFIVGFNISPLCSTREQHYNKKISEQALLKFKNGDLNCLIATSVVEEGLDVPQCTLVLRYDVPLEYRSYIQSKGRARNKKANFVILVDEENRQKFNSQYRAFQQVESYLNGMLCGGEVRNTPSEAEVKENLNDDDCIPAYKTRNGNSLFAVSAISLLNRYCSVLPHDQYTTILPMVIQEPAQGNKTMVTIIMPIACPIKEPVQGRPMSSVKNAKRSAALNVCIKLHKAGELDYETLLPKVKGLIDFSTTDVSSCFPNWRDETEVSDEGPPGTKKRVRKHPVHYPKCLDGPKTRDELTKCYLHVIKLETAFDEPSDAREKALYDMLRRKEGYGFITFDRLPELCAFPMFLTVGEVRTTLQVNYAAIILDTSHLELIKQFHFFIFDQVLEIAKKFLVHDGRVNNLYAVPLKYDNGYDIDWNVMQTYKQIAPCDEPTAQERASLSVTKEMYDNCVVTPWYRGSLHPDRYIVSRVLEYLTPYSKFEDNSFETFADYYSNKYNLEILGRKDQPLLEVRNISSRMNCLLPRAATIRGLSDKQRRLASLAHGDYKPREFTELFVPEYCVRADYPAHLWYKAIILPSIVHRVTMLLIAEELRVQILTDTQKSQGRLTKGVSWLPIEVDHFVVKRSLLSNLDEPAPINSVDRINNPIDETAQKPPNIVSLKQSVYQLQKKKVSKNYPWDESMEPIDIERNLSRVTVMDIECYDSFVTAPLAAKEPVSLLRNSPKVKTPLSTALLPPPLRYKDTISVLSTASSPRGPEPREVLSALTLIKSNDTFNLERSETLGDSFLKFAASLYLFHKFPQMDEGQLTNLKSRLIGNRNLYYAGERAGLAGRMKVEQFSPRSDFLVPGFLAPSELVSFITRRKLIIELLNASKVLSQTKLPWQHQISRQQRLYKSVTYVDEFFNVIFHDEGNF-