Monarch geneset OGS2.0

DPOGS200366
TranscriptDPOGS200366-TA1989 bp
ProteinDPOGS200366-PA662 aa
Genomic positionDPSCF300026 + 850253-853327
RNAseq coverage326x (Rank: top 35%)
Annotation
HeliconiusHMEL0000130.073.28% 
BombyxBGIBMGA005660-TA0.078.86% 
Drosophilaunk-PD0.050.92% 
EBI UniRef50UniRef50_G6CMS50.0100.00%Putative unkempt n=2 Tax=Endopterygota RepID=G6CMS5_DANPL
NCBI RefSeqXP_002425759.10.056.50%zinc finger protein CCCH domain-containing protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2613359650.073.73%putative unkempt [Heliconius melpomene]
NCBI nr blastxgi|2613359650.074.18%putative unkempt [Heliconius melpomene]
Group
Gene OntologyGO:00082701.4e-06zinc ion binding
GO:00036761.4e-06nucleic acid binding
KEGG pathway 
InterPro domain[271-296] IPR0005711.4e-06Zinc finger, CCCH-type
Orthology groupMCL12465 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200366-TA
ATGCCGTCGGAATCTAAGCCTTTGTTGACAGCCCAAACAGAAAAACCAAATCATTATAAATATCTAAAGGAATTCAGAGTAGAGCAGTGCCCGTCATTTTTACAACATAAATGTACACAACATAGACCCTTTACATGTTTTCACTGGCATTTTAATAATCAGAGAAGAAGACGACCAGTTCGTAAAAGGGATGGATCGTTTAATTATAGCGCCGACAACTATTGTACCAAATATAACGAGACTTCGGGAATCTGCGAAGATGGTGACGAATGTCCATACTTGCATAGAACAGCAGGGGACACGGAACGTAGATATCACTTGCGCTACTACAAAACATGTATGTGCGTACATGACACTGATACTAGGGGTCTCTGTACTAAGAATGGTGCGCACTGTGCATTCGCACACGGCGCCCCAGATCTCCGACCTCCAGTATTAGATATGAGAGAACTCCAAGCTCTGGAAAACCCCGATGGCTTGGACGGAGATGCTGCTGCGCCAAACGCATTAGATAGAGAAAGAAATCTAATGAATGAAGATCCAAAGTGGCAAGATACAAATTATGTACTATCTTCATATAAAACGGAACCTTGTAAGAGACCTCCTAGATTATGTAGACAGGGTTATGCATGCCCTCAATATCACAATAGTAAGGATAAGCGAAGATCTCCAAGGAAATATAAATATCGTAGCACACCATGTCCCAATGTTAAACATGGTGAGGAATGGGGTGAGCCTAGCAATTGTGAAGCCGGTGATGCTTGTGGTTACTGTCATACACGCACTGAACAACAGTTCCACCCTGAGATATACAAATCGACAAAATGCAATGACGTACAGCAGGCCGGCTACTGTCCTAGGGGATTATTTTGTGCATTTGCACATGTGGAACCTGAAGACCTGGGTGGAGCTCGCGATCTCACAGCTCCTTTAGATTGTGGAACAAATCTCGCGGACCTTTTATCTTCCGCGTTGCCGTCAGATAAAAAGAACGATATAGGAGTCCTCGGTGGGATGAGACCTCACTCGCCTGTGACGGCACATGTCAACGGCTCGGGGGATGGTTCCGAATGTGCCTCTACATCTTCAGGCGGTTCCAGCGCGGGCCGTGCTCCTGTTCGTACACTCCTACCTATGGCCTCTTCCATGGTGGACCACGACAAACCGCAGCCGTCATGGCCGCCGCGGGCTGCTTTCGAACCAAATCCTACCGTTTTTGAAGTGGTCGGCAATGCTCTGGACGACCTACATCTCGACGATCCTCTGAATTTCGCTGCATCCCTCGATCGCGACTTGTCTGATGGTGAAGGCCTACTCGTGGGCTCGGCTCCAGTCAACATCCCTCAGCCGCGGCCGACTCTCCGCGGGTTCAGTCCACCGCCGACCAGTTCACCACTGCCACCGTTCCTGCGCTACGCAACCAATGACACTGAGCGTCTATTTAATTCGCACGCAATAAAAGCTGGTAGTTTCGGTGCAGGCGGCGCGGGGCCGTTCGAATTTGGGGCGACCTCCCCTTCAGCGCCGGGTGAGCTGTCTCGGCTGCGAGAGGAGGTGGCCACCTCACGTGCCGCAGCCGCTAGATGGGACGAACGTATTGCTCAGGCTCGTTCAGCGTGCGAGGCCTGGCAGCGTGACTCGGAGGAAGCTAAACGTAAAGCGGTCATAGCGGAGCGTCAGCGCGACGAAGCATTGGCCCACGCGTTGGCACTTAAGCGTGAACTTGAAGCGGCGCGTGCTCCGCGACGAGAATTGCGCGGAATGCCACTGACAGCGCTTAAATCGCTTCAGGCGCAAGCTAGGAGCGAGCTCGAGGAGATAGAAAAGGTTCTGTATCTAGAGACTGCCACCAAATGTATGGTCTGCGAGGAACAACCGCGCAGCGTGACTCTGGCGCCCTGCAATCATTATGTTCTGTGTGACGGCTGTGCTGCAACCGCTAAAGAATGCCCCTACTGCCAGATGCCGCTCCAACAGCATCATCAGTGA

Protein sequence:

>DPOGS200366-PA
MPSESKPLLTAQTEKPNHYKYLKEFRVEQCPSFLQHKCTQHRPFTCFHWHFNNQRRRRPVRKRDGSFNYSADNYCTKYNETSGICEDGDECPYLHRTAGDTERRYHLRYYKTCMCVHDTDTRGLCTKNGAHCAFAHGAPDLRPPVLDMRELQALENPDGLDGDAAAPNALDRERNLMNEDPKWQDTNYVLSSYKTEPCKRPPRLCRQGYACPQYHNSKDKRRSPRKYKYRSTPCPNVKHGEEWGEPSNCEAGDACGYCHTRTEQQFHPEIYKSTKCNDVQQAGYCPRGLFCAFAHVEPEDLGGARDLTAPLDCGTNLADLLSSALPSDKKNDIGVLGGMRPHSPVTAHVNGSGDGSECASTSSGGSSAGRAPVRTLLPMASSMVDHDKPQPSWPPRAAFEPNPTVFEVVGNALDDLHLDDPLNFAASLDRDLSDGEGLLVGSAPVNIPQPRPTLRGFSPPPTSSPLPPFLRYATNDTERLFNSHAIKAGSFGAGGAGPFEFGATSPSAPGELSRLREEVATSRAAAARWDERIAQARSACEAWQRDSEEAKRKAVIAERQRDEALAHALALKRELEAARAPRRELRGMPLTALKSLQAQARSELEEIEKVLYLETATKCMVCEEQPRSVTLAPCNHYVLCDGCAATAKECPYCQMPLQQHHQ-