Monarch geneset OGS2.0

DPOGS208692
TranscriptDPOGS208692-TA1320 bp
ProteinDPOGS208692-PA439 aa
Genomic positionDPSCF300043 - 498461-502868
RNAseq coverage136x (Rank: top 55%)
Annotation
HeliconiusHMEL0152213e-9745.15% 
BombyxBGIBMGA003341-TA3e-6133.93% 
DrosophilaCG15269-PA4e-2132.58% 
EBI UniRef50UniRef50_Q5TYW11e-2436.51%Zinc finger protein 658 n=22 Tax=Primates RepID=ZN658_HUMAN
NCBI RefSeqXP_001862549.13e-2338.58%zinc finger imprinted 3 [Culex quinquefasciatus]
NCBI nr blastpgi|2961895283e-2434.88%PREDICTED: zinc finger protein 658 [Callithrix jacchus]
NCBI nr blastxgi|2961895286e-3334.88%PREDICTED: zinc finger protein 658 [Callithrix jacchus]
Group
Gene OntologyGO:00036768.3e-07nucleic acid binding
KEGG pathway 
InterPro domain[134-168] IPR0130878.3e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26688 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208692-TA
ATGCTGCAGTGCTGCTATTGTTCAAAGAAATTCAAATATGAAAGTGAAAAGAAACGACACGAAAAGAGTCATGATCCACAGTTTGAATGTGACAAGTGTTCGAAAAAGTTTAGTTTTCTTTCTGCTTTGAAGAGGCACCAAAAGCAGCATGAAAGGACTGGCAGTGTTAAATGTACAGAGTGCGGAAAATATTTTCTTGATAAAACTCTTCTAAACAGACATATGAAATATGCTCACCAAGGTACATACGTGTGTTCTAAATGTGAAGCGACGTTCAGTAGCGACCTGGCCTTGGCGTCACATCAGAAGAGCCACAAGCCGCGGTCAGAGCGCCGCTTCAAATGTAATTATGATGGATGTACAAAAACCTTCAACTACATACATCACTTGAAACACCACCAGCTCAGCCATACTAATAAAAAACAACACTATTGTAACGTATGTGGAAAAGGTTTTATTCAGTACCATCATTTCAAAGCTCACAGGAAGGTACATTCGCCTGAAACCTGGTTGCCGTGCACCATACCGGGTTGCAGCAAACAGTTCCCAGATGAATACGCTAGGAAACAACATCTTATGAAACATAAAGGCGCTGATGGTCCATCATCGGATTGTACGAGCAATTTTTCTCATACATCCGACATCAAAGAGTCACCAGATGAAATACTATGTTCGAGTTGCGGAGATATGATTGAAATTTCTTCACAAGCAACTCATCTAGAAAAATGTAAAACGAACACACGCTCTAAACATTCATTAACAAACCTAGAGGATTTGAATATAAGTAATAATAATAATAACAACAACGAGGAAATTAAAAATATCAATGACGGCATTTTGGACGTGTTTATAGACAGCAAAAAATATAAAAATAATCCAAAAAATACAGATAACTCCGACTATTATTCTATGGTTGGAAGTCTTAACTTTAATGACGATATTAAAGGAATTCCATCAGATAGCAGTAACGTGGCATGTGAAGACTGTGATTGTTATACCAAAGACATTATATGCGGTCCCCGAAATGATAAGGACGTACCGCAGATAGAGTATAGAAGCGACGGCGTCATTAAACTCAAGGACACGCTAGAAACAGGTGTCACTGAAGCATTCAAGACTAATAAGTTAGAAAACGAGGAGTTCTATATAAACGAGGTCCCATATAACAGTTGTCAAACAATTCTCGGAGGCTGTATAGTGAGTGGTGATGGCACCATCAGCGAAGGATGCCTCTGTGCGAAGATGGCTCTGAATGAACAAGAAGCCGTCGAACAGGAAATAGATGAAATAACACCACGACCGGTTACATCGACAGCATGA

Protein sequence:

>DPOGS208692-PA
MLQCCYCSKKFKYESEKKRHEKSHDPQFECDKCSKKFSFLSALKRHQKQHERTGSVKCTECGKYFLDKTLLNRHMKYAHQGTYVCSKCEATFSSDLALASHQKSHKPRSERRFKCNYDGCTKTFNYIHHLKHHQLSHTNKKQHYCNVCGKGFIQYHHFKAHRKVHSPETWLPCTIPGCSKQFPDEYARKQHLMKHKGADGPSSDCTSNFSHTSDIKESPDEILCSSCGDMIEISSQATHLEKCKTNTRSKHSLTNLEDLNISNNNNNNNEEIKNINDGILDVFIDSKKYKNNPKNTDNSDYYSMVGSLNFNDDIKGIPSDSSNVACEDCDCYTKDIICGPRNDKDVPQIEYRSDGVIKLKDTLETGVTEAFKTNKLENEEFYINEVPYNSCQTILGGCIVSGDGTISEGCLCAKMALNEQEAVEQEIDEITPRPVTSTA-