Monarch geneset OGS2.0

DPOGS215067
TranscriptDPOGS215067-TA1344 bp
ProteinDPOGS215067-PA447 aa
Genomic positionDPSCF300208 + 524461-527567
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0045467e-3930.43% 
BombyxBGIBMGA011111-TA2e-3529.89% 
DrosophilaCG5245-PA5e-3227.55% 
EBI UniRef50UniRef50_UPI00023ADE1E2e-4431.31%UPI00023ADE1E related cluster n=1 Tax=unknown RepID=UPI00023ADE1E
NCBI RefSeqXP_002097392.13e-4030.46%GE26193 [Drosophila yakuba]
NCBI nr blastpgi|3017572041e-4632.14%PREDICTED: zinc finger protein 658-like [Ailuropoda melanoleuca]
NCBI nr blastxgi|3343478706e-5732.81%PREDICTED: zinc finger protein 850-like [Monodelphis domestica]
Group
Gene OntologyGO:00036761.1e-11nucleic acid binding
GO:00082703.5e-05zinc ion binding
GO:00056223.5e-05intracellular
KEGG pathway 
InterPro domain[330-354] IPR0130871.1e-11Zinc finger, C2H2-type/integrase, DNA-binding
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215067-TA
ATGGGGGTAGTGCGGAGTTTCACACTATGCGTAGATGGTGTATACCATTGTGAAGATTTAACACCGTTCTTGAAAATGAACCCCACCGTTCTAATTGAAAGAATAAAAATATTGGAATGCCTTAAATGCAATTACGCTTTCGATAACAAAATTGAATATTTGAAACATATTTTAATTCATCTGAGTAGTAATTGTTATGTAAAGTTAAAAAGGGAGAAAATATTCAAGTGTGCGAAATGCCTTAGAGTGTTTATATGTGAACATGATTACGCCCTCCACAAACTGATTCATGTGAAAGAAGAAGTCAGATCACAACAAAAGGCAGATTTAGTGTACAATCCAAGATACATTAAGAGGTTTAAATGCGAGGAATGCGGCTTAAAGTTTGTCGAGAGAAGCACATTGGAAGCTCATAAAATTTTACATGATCCGTTCCAACACATTTGCTTCTGCGGCATCGGTTATTACAAACAAATTGATCTGACAGCTCATAAAAATCTCGTTCATCCGGAATATAAAGAAATCGAGACAAACAATAACAACATTGAAGATAAAAGCAAAATGGTGACTAAAAAGAAGAGACAGTTGAAATTGTCTAGACATTACAATTGTAAGCATTGCACGGAAAGTTTTATGTGTAAAAAATCTTATAACGAACATCTCAAAACTCATATCGAAATACCATATGAGTGTGCCACTTGCTATGAAAGATTTGACACACAACAGCTGCTACATTTGCACACGGGCTTGCACACACTCTCATGCAATTTGTGCAACAAGACATTCAATAGCAGGCACAGCACATATTTGCACATAAACGGCCACACTTCAGAATTCAAATGTGATGTATGCCACAAGAATTTCAGAAGCAAGTATTTCTTGGACATACATATAAAAATACACCAAGGCGTTTTAAATTTCAAATGCACTTACTGTGGTAAGGAATGCTATAATAAAAGTGCTAAAACCTTGCATGAGAGGACCCATACGAAGGAGAAGCCATTTGTTTGTAGGTATTGCAATAAACCGTTTGGTGATCCATCGTCCTTGCTACGACACAAACGTATACATACCGGTGATAAAATCTACAAGTGTGATAAATGTCCGAAAGCCTTCACAGACATAAGCGGACTAAGAGGTCATCAACCTTCACATTCTAGTGTAAGGTACAAATGCGAAATTTGTGAGAAATCTTTAAAATCCAAACATAATTTAAAAAATCATTTCCTAACTGTGCATAGCCGTTTAAGGAAATATGAATGCCATTACTGCGGTAAGAGATTTATATTGAAAACATATTTAGCTTCTCATATACCGCGTATGCACAAAAACCTCTTCCAATAA

Protein sequence:

>DPOGS215067-PA
MGVVRSFTLCVDGVYHCEDLTPFLKMNPTVLIERIKILECLKCNYAFDNKIEYLKHILIHLSSNCYVKLKREKIFKCAKCLRVFICEHDYALHKLIHVKEEVRSQQKADLVYNPRYIKRFKCEECGLKFVERSTLEAHKILHDPFQHICFCGIGYYKQIDLTAHKNLVHPEYKEIETNNNNIEDKSKMVTKKKRQLKLSRHYNCKHCTESFMCKKSYNEHLKTHIEIPYECATCYERFDTQQLLHLHTGLHTLSCNLCNKTFNSRHSTYLHINGHTSEFKCDVCHKNFRSKYFLDIHIKIHQGVLNFKCTYCGKECYNKSAKTLHERTHTKEKPFVCRYCNKPFGDPSSLLRHKRIHTGDKIYKCDKCPKAFTDISGLRGHQPSHSSVRYKCEICEKSLKSKHNLKNHFLTVHSRLRKYECHYCGKRFILKTYLASHIPRMHKNLFQ-