Monarch geneset OGS2.0

DPOGS215363
TranscriptDPOGS215363-TA1158 bp
ProteinDPOGS215363-PA385 aa
Genomic positionDPSCF300351 + 37117-40406
RNAseq coverage527x (Rank: top 24%)
Annotation
HeliconiusHMEL0045720.080.21% 
BombyxBGIBMGA008641-TA4e-8594.55% 
Drosophilakin17-PA1e-12253.87% 
EBI UniRef50UniRef50_Q8SXR22e-12053.87%RE65257p n=30 Tax=Coelomata RepID=Q8SXR2_DROME
NCBI RefSeqXP_002429462.13e-14265.21%zinc finger protein RTS2, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420179816e-14165.21%zinc finger protein RTS2, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420179813e-14465.21%zinc finger protein RTS2, putative [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[52-178] IPR0194474.2e-49DNA/RNA-binding protein Kin17, conserved domain
Orthology groupMCL12787 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215363-TA
ATGGGTAAAGCAGAAAAAGGGACGCCAAAATACATAGCGAACAAAATCAAAGCAAAAGGCCTCCAAAAATTAAGATGGTACTGCCAGATGTGCCAGAAACAGTGCAGGGACGAGAACGGCTTCAAATGCCACACAATGTCCGAGTCCCATCAGCGCCAGTTGCTGCTCTTCGCTGACAATGCCTCCAAATACATAGACGAATTTTCGAAGGAATTCTCTGATGGTTATGTGGAGCTACTTCGTCGTCAGTTCGGTACCAAGCGTGTGAATGCCAATAAAGTTTACCAGGATTACATATCCCATCGCGACCATCTCCACATGAACGCGACCCAATGGGAGACTCTCACCGACTTCGTCAAGTGGCTAGGCCGGGAAGGCAAGTGTGTTGTCGACGAAACCGAAAAGGGTTGGTACGTGGCATATATAGATAGGGATCCGGCTACAATAGCTGCACAGGAAGCGAAGGCGAAGAAGGATAAGTGTGATAAAGACGACCAGGAGAGAATGTTAGAATTTATCAGACGTCAAGTTGAAAAGGGAAAGAAGGAAACGACGAGCGTTGAACCGAAATTCACCGAGCTGAAGAGGGAGGGCAGTCAGGAAAAGTTGACCTTAAACTTGAATATGAAGCGGAAGGTCGAAGAAGTTAAACCAGAAATAAAAGCTGCATTCAAAATGAAAGTGAAAGCTGAACCGGCGAAGAGAACAAAGACAGAGGAAAAGTCTAAACAGACGGCTCTGGATGAGATCATGGCGATGCAAGAGAGAGAGAAAGAGAGACACAACCGCAAGGATCACTGGCTGGTTGAAGGCATCATTGTGAAGATAGTTACCAAGTCGCTCGGTGACAAGTATTATAAGAGGAAAGGCACAATAACGAAGGTTGTGGACAAATATGGTGCTCATGTCAAGTTAACCGATGAAGCGGTCACATTGAAATTGGACCAGAATCATCTTGAAACTGTGATACCTTCACCGGGAAGACATGTGAAGTTTGTGAACGGAGCGTACAGGGGACAGATTGGTGTTTTAAAAGACATCAACACTGACAAATATTGCTGTGACGTAGAAATATCAGAAGGTCTATTAACAGGGAGAGTGGTGAAAGGCGTGCAGTACGAGGACATTAGTAAATTATCTTCGGGACAAATAAACTGA

Protein sequence:

>DPOGS215363-PA
MGKAEKGTPKYIANKIKAKGLQKLRWYCQMCQKQCRDENGFKCHTMSESHQRQLLLFADNASKYIDEFSKEFSDGYVELLRRQFGTKRVNANKVYQDYISHRDHLHMNATQWETLTDFVKWLGREGKCVVDETEKGWYVAYIDRDPATIAAQEAKAKKDKCDKDDQERMLEFIRRQVEKGKKETTSVEPKFTELKREGSQEKLTLNLNMKRKVEEVKPEIKAAFKMKVKAEPAKRTKTEEKSKQTALDEIMAMQEREKERHNRKDHWLVEGIIVKIVTKSLGDKYYKRKGTITKVVDKYGAHVKLTDEAVTLKLDQNHLETVIPSPGRHVKFVNGAYRGQIGVLKDINTDKYCCDVEISEGLLTGRVVKGVQYEDISKLSSGQIN-