Monarch geneset OGS2.0

DPOGS210386
TranscriptDPOGS210386-TA1179 bp
ProteinDPOGS210386-PA392 aa
Genomic positionDPSCF300025 + 974927-978728
RNAseq coverage1139x (Rank: top 11%)
Annotation
HeliconiusHMEL0055032e-15283.38% 
BombyxBGIBMGA011604-TA6e-16083.80% 
DrosophilaCG8635-PA9e-11863.01% 
EBI UniRef50UniRef50_Q7JWR91e-11563.01%Zinc finger CCCH domain-containing protein 15 homolog n=15 Tax=Eukaryota RepID=ZC3HF_DROME
NCBI RefSeqXP_969738.11e-13172.18%PREDICTED: similar to GA21225-PA [Tribolium castaneum]
NCBI nr blastpgi|3323729632e-13568.06%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3320199161e-15571.29%Zinc finger CCCH domain-containing protein 15-like protein [Acromyrmex echinatior]
Group
Gene OntologyGO:00082704.7e-08zinc ion binding
GO:00036764.7e-08nucleic acid binding
KEGG pathway 
InterPro domain[93-119] IPR0005714.7e-08Zinc finger, CCCH-type
Orthology groupMCL13271 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210386-TA
ATGCCGCCCAAAAAAGCTGCTGCAGCCAGCAAAAAGACTGAGGCGAAGAAGAAGGACAAAGTTATAGAGGATAAAACCTTTGGTCTCAAGAACAAGAAAGGTGCCAAACAACAGAAATTCATCCAGCAAGTGGAAAAACAAGTTAAAAGTGGTGGCATTCATCCAGCTAAGCCCATGGATGATAAGAAGAAGGATAAAGAACAGAAATTAAAGGAACAGAAAGAACTGGCCATGTTGTTCAAACCAGTGCAGACTCAAAAAGTTGAAAAAGGTATTGATCCTAAGTCGGTGGTGTGTACATTCTTTAAGCAGGGCCAGTGCACGAAAGGAGACAAATGTAAATTCTCACACGACCTTAGCATTGAAAGAAAAGCAGAAAAGAGGTCGCTCTATGTAGATATGAGAGATGATGAAGACACTATGGACAATTGGGACGAGGATAAATTAAAAGAAGTTGTTGAGAAGAAACATGGGGAAGGCAACAAACAGAGACCAGGCACCGACATTATTTGCAAGCACTTTTTGGAGGCTGTCGAGAAATCTAAATATGGTTGGTTCTGGGAATGTCCGTCGGGTACCAAGTGTATTTATAGACATGCCTTGCCGCCCGGCTTCGTCCTGAAACGTGATAAGAAGAAGTTGGAGGACAAGAAAAATGAGATTTCACTTGTAGATCTCATCGAAAGGGAAAGAGCAGCGCTTGGACCTAATCAGACAAAAATAACGTTGGAGACATTCCTCGCTTGGAAAAAGAAGAAAATCAAAGAAAAACAGGAAGCGTTCACGCAAGCCGAAGAACAGAAGAGAAATGACTTCAAGGCTGGTCGAGCTGTGGGCTTGTCTGGACGGGAGATGTTCGCCTTCGACCCGGCGCTAGCAGCTGATGACGATGATGATGAAGCCGTGGACCTGCGGTACTATGATGATGAAGAGGAACAAGACGACACGGAGTACAGGGACATACAACTGGACCTCATCGGCATGGAAGCCGCGGAGGTGGATGGCAGCGGCACTAAAGCGACGGAAGACAGGTTGCAGCAGCTGAGTCAAGGGTTAGAGAACGGAGAGAAGCAACCAGCGAGTGCTGAAGATGGATCGACCGCCGTCCCTATCAATGAGAACCTGTTCCTCGACGAAGACCTGGACGAGGAGCTGCAAAATCTTGATCTGGAAGATTAG

Protein sequence:

>DPOGS210386-PA
MPPKKAAAASKKTEAKKKDKVIEDKTFGLKNKKGAKQQKFIQQVEKQVKSGGIHPAKPMDDKKKDKEQKLKEQKELAMLFKPVQTQKVEKGIDPKSVVCTFFKQGQCTKGDKCKFSHDLSIERKAEKRSLYVDMRDDEDTMDNWDEDKLKEVVEKKHGEGNKQRPGTDIICKHFLEAVEKSKYGWFWECPSGTKCIYRHALPPGFVLKRDKKKLEDKKNEISLVDLIERERAALGPNQTKITLETFLAWKKKKIKEKQEAFTQAEEQKRNDFKAGRAVGLSGREMFAFDPALAADDDDDEAVDLRYYDDEEEQDDTEYRDIQLDLIGMEAAEVDGSGTKATEDRLQQLSQGLENGEKQPASAEDGSTAVPINENLFLDEDLDEELQNLDLED-