Monarch geneset OGS2.0

DPOGS213651
TranscriptDPOGS213651-TA1539 bp
ProteinDPOGS213651-PA512 aa
Genomic positionDPSCF300165 + 198588-201672
RNAseq coverage64x (Rank: top 68%)
Annotation
HeliconiusHMEL0052543e-7949.84% 
BombyxBGIBMGA001681-TA9e-3232.04% 
DrosophilaMeics-PA4e-2924.61% 
EBI UniRef50UniRef50_D6WI126e-4531.39%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WI12_TRICA
NCBI RefSeqXP_002429661.19e-4736.16%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420183922e-4536.16%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420183922e-5436.16%krueppel c2h2-type zinc finger protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00056342.9e-09nucleus
GO:00082702.9e-09zinc ion binding
GO:00036761.2e-08nucleic acid binding
GO:00056222.4e-05intracellular
KEGG pathway 
InterPro domain[6-75] IPR0129342.9e-09Zinc finger, AD-type
[415-457] IPR0130871.2e-08Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL34983 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213651-TA
ATGTGTGAATGCAAGTGTAGATTGTGTTTAAAGGTATCATCTAATATTGTGGAAGAATTGTGGGAAGGATGTAAAATTTTGGAACAATTGTTTACCTGTCTGGGGCACCATGTAGTAATAAACAAAGATCTACCTAACAAAATATGCAATAATTGTGTCATGAAAATAGAAGATATATACAAATATCAACAATTTATCAGACAAAATGAAATAAAATTACAACAAGAATTAAATAATATTATAATTCATAGTGAAATTGTTAATATCAATGAAGTAAAAATAGAAGATCAAATATTAGATGAAGAAAAATCTATAGATCAAGAGAAAAATGTTAAAATTAAATCAGAAATTACTGAAGAAAATGTAAAAAATATAACAGAATTACATGGAGAAGTAATTCAGAAAAATGATGTTGAATTATATCCAGTCAAACAAGAAATTGAAAACAAAATTGTTTATAGTAATGAAGACAAAGATATTAATAAAGAAATATCAAATGCAATTAAACTAGACGAAACAAAGAGATTTTCATGTTTAACATGCTTTGAAGTATTCCCAAATCAATTGGAACTCTTAAGGCACTACCAGAATGTTGAATTAGAAAAATATAACAAGAATAATACAGATGTTGAAAGTAAACCTGTAAAGTACAACGTGTTCACAGACGATAATGGTTTATATTACAAATGTGAGAGGTGTTACAAAAAGTACCGACAGAAATCATACATAAACAGACATGTATTGAGTCACATAGAAAGAAGACCATTCCTCTGCAAGCTGTGCGGTAAAACTTACCAAACAGCATCAATAATAGTTTCCCATGGAAAGATACACACGGGAGATATATATGCATGTACATACAACTGTGGCTACCGATCTGTACACAAACATGTTGTCAAAAATCATGAGAAAAGACACAAAGGAGAATTTAAGTATAAATGCCAGACATGCGGCAAAGGATTCCAAGTGAGATCATGGTACCAACAGCATCAGAACATTCATGATGGAGTCAAGCCATTCAAATGCGATATCTGTGGCATGAGTTTTCATCTGCATCGATATCTAACTACACATCGAAGCAATGTACACCCACAGTCGTCTGTTCGCAAACCGTGGGTTTGCAAGCAATGTGAATATCCCTGTGACTCCAAAAATAGTCTAAATTTGCATTTGAAGGATAAACATGGCCTAGTTATAAAGAAATCAAACTTGTGTGATGTTTGTGGTAAAGTCTTGAAGGATTCCCAACAGTTGAAAGTACACAAGAGAGCCGTACACTTGAACATCAAGCCGTATGTCTGTGGTACATGTAACAAGTCGTTCCCCAAAAAATATACTCTCAAGAACCACGAACAGACACACAAAGGAAAGACGTTTTTATGTTCCATGTGTGACAAGATGTTCGCTAAAGACGCCAGTCTACAGAAACACGTACAAAGGTGTCATATAACAAGCAAGTACAGGTGTCACGAATGTGACAAATCATTTTCATCAAAGATGACATTGACAGTTCACGTGAAGAATTGCAACCGGAAGTGA

Protein sequence:

>DPOGS213651-PA
MCECKCRLCLKVSSNIVEELWEGCKILEQLFTCLGHHVVINKDLPNKICNNCVMKIEDIYKYQQFIRQNEIKLQQELNNIIIHSEIVNINEVKIEDQILDEEKSIDQEKNVKIKSEITEENVKNITELHGEVIQKNDVELYPVKQEIENKIVYSNEDKDINKEISNAIKLDETKRFSCLTCFEVFPNQLELLRHYQNVELEKYNKNNTDVESKPVKYNVFTDDNGLYYKCERCYKKYRQKSYINRHVLSHIERRPFLCKLCGKTYQTASIIVSHGKIHTGDIYACTYNCGYRSVHKHVVKNHEKRHKGEFKYKCQTCGKGFQVRSWYQQHQNIHDGVKPFKCDICGMSFHLHRYLTTHRSNVHPQSSVRKPWVCKQCEYPCDSKNSLNLHLKDKHGLVIKKSNLCDVCGKVLKDSQQLKVHKRAVHLNIKPYVCGTCNKSFPKKYTLKNHEQTHKGKTFLCSMCDKMFAKDASLQKHVQRCHITSKYRCHECDKSFSSKMTLTVHVKNCNRK-