Monarch geneset OGS2.0

DPOGS209940
TranscriptDPOGS209940-TA3564 bp
ProteinDPOGS209940-PA1187 aa
Genomic positionDPSCF300567 + 11954-21858
RNAseq coverage124x (Rank: top 57%)
Annotation
HeliconiusHMEL0144110.057.67% 
BombyxBGIBMGA001833-TA2e-11964.16% 
DrosophilaDcr-1-PA7e-11946.44% 
EBI UniRef50UniRef50_UPI00015B5D4A8e-12243.42%UPI00015B5D4A related cluster n=1 Tax=unknown RepID=UPI00015B5D4A
NCBI RefSeqXP_002032258.11e-16834.74%GM23619 [Drosophila sechellia]
NCBI nr blastpgi|1953311352e-16734.74%GM23619 [Drosophila sechellia]
NCBI nr blastxgi|1565530565e-11843.18%PREDICTED: endoribonuclease Dcr-1 [Nasonia vitripennis]
Group
Gene OntologyGO:00055158.6e-32protein binding
GO:00168911.9e-26endoribonuclease activity, producing 5'-phosphomonoesters
GO:00055245.9e-13ATP binding
GO:00043865.9e-13helicase activity
GO:00036765.9e-13nucleic acid binding
KEGG pathway 
InterPro domain[927-1109] IPR0031008.6e-32Argonaute/Dicer protein, PAZ
[659-758] IPR0050341.9e-26Dicer double-stranded RNA-binding fold
[378-434] IPR0016505.9e-13Helicase, C-terminal
Orthology groupMCL11692 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209940-TA
ATGTCATCCTCATTGGCGAAGCCCAAAGAGGTTATAATAGAGTACAGTCCCCGTACCAAAAGTCCAGAACTACCAGCTGATTATATACAGCTGGAGGAGTATATGAGGGAGTCCGTCAAAAGTGCGCTGGACTTCATAAATGAGCACAGGTACGACCCCACGGAGATCTACGGTGAAGATATGTATGAGGAGTTCAAAAATATCCCCGACCCCACCCTGGAGCCCAAGGCCATATTCAAGCAGTTCCTATACGTGCTGGACCAACTCGGCCCTTACGCGGCGGACAAAGCGGCCTTCTCGCTGCTCATCAAATTGGAAAAACTTAAAATCAAAGTGCCATACGAGAGACATTTCCTTCTTTTGTGCGTGTGTACTTCCGTTTTCGTAAAGATAAGATGTTACACAGACAGCGTTCTGTCAAAGTATGATAACGATTGGGAAAAGATAACCACATTCTTGACTCCGAAGATGCTCAGATTAGCTGAGATTTTAGAGCAATTCTATCAGAACGAATCGTCAGATAAAGCAGAGAATGAAAAAGACACGAAAAATTGCATCGAAGATAAAGTCGAAGACAATGTTAATAAAGTTATAAATGATAACTTGAATAGGGATAACACAAACGATGTCTCAGATCAGTCCCACGAGGCGAAAGCAGATGTATCGAAAACAAACGAGCTTTTAAATGAAATAGAAAAATGCGATTTCGTAACGCTTGGCAACAAAATCGAAGACAGGGTTCATATGTACGAGGCTAATCTTAAAGACTATGAAGAAAACTATATGGGAGATAAAGTCACAAATAGCTTATGTTCGGTAGTTGAGAACAGAATTGGTGAAAGTGACATAAAAGTTGATGACAGTCTCAAAACGGATAACGTTATTGATGTAATGAGTGAGTCTCCGAAGTTCAAACGCCAGGGTCATAGGTCACGCGGTCGGGGTCGAGTGTCGAGGGTGAACAAGGTCCAAATGCAGGCAAACCCAGACGCGCTCTGTGGTATTGTGTTCGTGAAGGAGCCTTTGATGGCAAAGATATTGTTCATGTTAATTGTGGACTTATCTCGTTCCCGTCCCACGCTGTCACATCTATGCGCCCAGTACTGTACTGATGAGGGAGACGCCAGAGAACATAGGCGACATGAGGAAGTGCTTAAGAAGTTCCGTATGCATGAATGTAATCTATTACTAGCGACGTCGGTGCTCGAAGAAGGTATAGATCTGCCCCGGTGTAATCTCGTTGTGAGATGGGATTTACCACCTTCGTACCGGTCACATGCTCTGTGCCGAGGTCGAGCACGCGCTTGTCGTGCTGCGGTCACGTTACTGTCGAAGGCCGAAACGCATACTGATCGGCTGCTACATCATCTCGCTATTTATAGGGAGCTAGATCAGATAATAACAAGGAAATGCGGCTGCGGAATCCAATATGAACCTCCGCAGGAGGAAGAACATCACGCGGATACAGTCGCTTTCATGGTTAAATCCTATACACCTTTAGAGAATAAAGTCAAAAAACTTGAAACTATAGAAAGCGACGATATAAGCGATAATGTTAGCCCGTGTTTAGAAAATTCTTTGAATGATACTAAAGAAAACGTAGTTTTAGGTGATAGTGAGATAGATAATTGTATTACAGAAAACGAAGACTGTACCAAAGATAATAATAGTAAGATATGCCAAGACTCTGTAACAGAAAACGATGTTAATATTAAAGCACTTAGCGAAATCTTAATCAATGGTGACAAAGATAATCTTGGTGAACGTATGAAGGATTTACAAAATAAAATCGAATCAGAGGCATCGATTGACGAAATATCCGAAGGAATCGCAAATATTGAAGTCAAAGATGATTCGGACGTAAACAAAACATTAGATTTTCGCGAAAACGACAAAAAACTAGAGAACGTAAAGTACGAATATAAAATTAAAAGTGGCAACGATGAAACTTTGGCGAGCGTGGATCTAAATACAGCTATAGCATTGATAAACAGATACTGTGGTAAACTACCCTCGGATACGTTCACACGTCTAACGCCTCAGTGGCGTACTGAGCGCGTTATTCTGAAAGATCGTCGAGGCGAGCCGCGCGACGCGTATGTATGTACACTGAGGATGCCGTTGAACTGTCCCGTGAAGTTTAACATCGTGGGTCATCCAATGCCAACGAGGTCGTTGGCTCGTCGGATGGCAGCTCTGTTGGCGAGTAGGGCGTTACATCGCGCTGGGGAACTTGACGATCAATTGCTACCAATAGGTAAAGAAAACTTCAAAGCCGCTGAAATGGAGAACATAAATTTGGAAGAATCGGGTGAGAACGCGCGACCGGGGACCACGAAGCGCAGACAGTATTACTATAAACGGACGGCATCTGCGTTCACGAATTGTAGACCAGTAGTGGAATATACGGACGAAGACATAGAAAAAAAACCTCATACTTGCGATGAGGGAAAAGTTGATAATGAAACCAGATCACAAGCAGAGAAATTATATAATGATATACCAACCGAAACGAAGGTAGTCCACTTAGTTGGTGCTATCCAAAATGCAGACCTCGGCGAGGAAGCCAGAGAAACCGCAGCAGTACTCCTGCGAAGCTTTATGGAGTTTGTTTTCTCGGACGTGCTTCGAGTGCGACGCCGCGGCATGAAGCTGGCGGACACTACTCACAATAACTACTACGTAGTTCCGACCGTTAAGAAAACAACATCCGATGGCGCCACATGGATAGACATAGACTGGCCTTTCTTAGACTTAATATATCAGCACACAGAAGAGAAGAGATGTTTAGAAATAGAGAAGCCCATGTTGTGGGAAGAAGAGAAGAAGAAAGAGGAGAAGAAGGGAAAGAAGATTGAAAATCCGCTGCTTAAACCAGGAGAGGTATTCGTGTTTGACGCGGAGAAGTACAAGGAAGCGGTCGTGACGCCCTGGTATAGAAATCAGGATCAGCCTCAATTCTTCCTGGTTGCGGAGATATGTTGGAACCTGACGCCGGAGTCGTCGTTCCCGTCAGCGCAACACGCGAGTTTCCGCTCGTACTACAAGAGCAAATACGGCGCTGAACTGACACAGAGCAATCAGCCGTTACTCGACGTGGACCACACGAGCGCTAGGCTGAACCTGCTTACACCACGGTACGTGAATCGTAAGGGCGTGGCTCTACCTGTGTCGTCGGAGCGCACGCGTCGCGCGAAACGTGATCGTCTCGATCAAAAACAGATTCTAGTGGCGGAGTTGTGTATGAGACATCCTTTCGCTGCGCCTCTATGGTGTGCAGCGGTTGCGTTACCATGCGCTTTGTACCGTATAAACGCGCTATTGCTGGCTGATGAAGTGAGGCGTGCTGTGTCAAGGGATGTGGGAGTTGGGCAGCCTAGAGATGACATATCATGGCCGCCGTTGGATTTTGGATGGAGTTTAGCGGAGGTTCTGAGCGCTGATTCTGACAAAGGGGATAAGAAAAAAGAAAATGAAATGAAGCGATTGCTGTTCCAGAGATGGAGAAAAAGAGAGTATAAAAGAAACAGAAGAGAGACCGGAAGTCAAAGAGAAGACTATTAA

Protein sequence:

>DPOGS209940-PA
MSSSLAKPKEVIIEYSPRTKSPELPADYIQLEEYMRESVKSALDFINEHRYDPTEIYGEDMYEEFKNIPDPTLEPKAIFKQFLYVLDQLGPYAADKAAFSLLIKLEKLKIKVPYERHFLLLCVCTSVFVKIRCYTDSVLSKYDNDWEKITTFLTPKMLRLAEILEQFYQNESSDKAENEKDTKNCIEDKVEDNVNKVINDNLNRDNTNDVSDQSHEAKADVSKTNELLNEIEKCDFVTLGNKIEDRVHMYEANLKDYEENYMGDKVTNSLCSVVENRIGESDIKVDDSLKTDNVIDVMSESPKFKRQGHRSRGRGRVSRVNKVQMQANPDALCGIVFVKEPLMAKILFMLIVDLSRSRPTLSHLCAQYCTDEGDAREHRRHEEVLKKFRMHECNLLLATSVLEEGIDLPRCNLVVRWDLPPSYRSHALCRGRARACRAAVTLLSKAETHTDRLLHHLAIYRELDQIITRKCGCGIQYEPPQEEEHHADTVAFMVKSYTPLENKVKKLETIESDDISDNVSPCLENSLNDTKENVVLGDSEIDNCITENEDCTKDNNSKICQDSVTENDVNIKALSEILINGDKDNLGERMKDLQNKIESEASIDEISEGIANIEVKDDSDVNKTLDFRENDKKLENVKYEYKIKSGNDETLASVDLNTAIALINRYCGKLPSDTFTRLTPQWRTERVILKDRRGEPRDAYVCTLRMPLNCPVKFNIVGHPMPTRSLARRMAALLASRALHRAGELDDQLLPIGKENFKAAEMENINLEESGENARPGTTKRRQYYYKRTASAFTNCRPVVEYTDEDIEKKPHTCDEGKVDNETRSQAEKLYNDIPTETKVVHLVGAIQNADLGEEARETAAVLLRSFMEFVFSDVLRVRRRGMKLADTTHNNYYVVPTVKKTTSDGATWIDIDWPFLDLIYQHTEEKRCLEIEKPMLWEEEKKKEEKKGKKIENPLLKPGEVFVFDAEKYKEAVVTPWYRNQDQPQFFLVAEICWNLTPESSFPSAQHASFRSYYKSKYGAELTQSNQPLLDVDHTSARLNLLTPRYVNRKGVALPVSSERTRRAKRDRLDQKQILVAELCMRHPFAAPLWCAAVALPCALYRINALLLADEVRRAVSRDVGVGQPRDDISWPPLDFGWSLAEVLSADSDKGDKKKENEMKRLLFQRWRKREYKRNRRETGSQREDY-