Monarch geneset OGS2.0

DPOGS205372
TranscriptDPOGS205372-TA1569 bp
ProteinDPOGS205372-PA522 aa
Genomic positionDPSCF300373 - 86180-91124
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0134430.070.04% 
BombyxBGIBMGA008773-TA4e-13957.87% 
DrosophilaCG12299-PA2e-2928.30% 
EBI UniRef50UniRef50_UPI0001CF20965e-3138.40%UPI0001CF2096 related cluster n=1 Tax=unknown RepID=UPI0001CF2096
NCBI RefSeqXP_002738230.12e-3037.05%PREDICTED: zinc finger protein 197-like, partial [Saccoglossus kowalevskii]
NCBI nr blastpgi|1942124351e-3027.44%PREDICTED: zinc finger protein 77-like [Equus caballus]
NCBI nr blastxgi|1564057652e-3731.15%predicted protein [Nematostella vectensis]
Group
Gene OntologyGO:00036762e-10nucleic acid binding
GO:00056342.5e-07nucleus
GO:00082702.5e-07zinc ion binding
KEGG pathway 
InterPro domain[449-476] IPR0130872e-10Zinc finger, C2H2-type/integrase, DNA-binding
[14-87] IPR0129342.5e-07Zinc finger, AD-type
Orthology groupMCL25537 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205372-TA
ATGGATGATATTATGGTAGAAAGTCTGCCCATACTCGGCACTTGTAGCTTATGTCTTGCGGAAGGAATAGTAAGAAGTATGATTCTGAAAGAAAATAATGAAACAAACAGAGAAAATTATATAGATATACTTTTAAAGTGTTTTTCAATCGATATGCTGTCGTTAGACCTGGACGATACTAAATACATGATATGCAGTCTGTGCATCAAACAGCTGGAAATCTGTCACAGGTTCAAAGAACAGGTGATAGTCTCGCTGAGGACGCTAGAAGCCAGCACTAGAATCAAGAAGGCGGACCAGTCTTATGATGTGGTTAAAATTGAGGGAACAGAAAATAATGTTAACTGTGTCAAGGAGGAGATGAATCCAGCGGTTAGACTCACCAGCTGTTCTGATGTTGAGGACTCAGTCCTAAACGACTTACTTATCAAACACGATCCTGATGAACTGAGATCGAGGCGTTCTAGACGGAACACGCTACTGGCTAAAAAGAAAGTGTCTTACTCCGTGCGAAAACAAGCGGAATTGAAGGAAACAGAGAGGATGTTGAAACGTGGACTGTTCCCGTTCAAGATTGGCAAAAATCAGACGTACACCTGCGCGATCTGCCCGGAAAAATCCACAGCCCTGGACGACATTAAATCCCACATAACGGATCACAACATCGCAAACATACACGTGGCCTTCAAAAAGACGATGACCTCCAACCAACACAGGTTCTATAAATACTCGACCAAACTGAAATGCAAATTATGCAAGGAGGACATAAGGGACTACGTCACACTCAAAAACCACATCGGCTCCTGCGTCAGGAGCAGCGCGAAATGCAACAATCTGCCGTTCAAACTGGAGAAAGACCAGCTGGACTGTCCGATATGCAAGAAGACGTTCCTGAACTTCGTGAGCCTGAACACGCACATGAACGTCCACTATCCTAACCACGTGTGTGACAACTGCGGCAAGGCGTTCGCTTCGAAGGCGCGACTCCGCGGTCACATGAGGACCCACGAAATCGGGGATTTCCCTTGCAGATACTGCGACCAGGTCTTCGATAGGGTCACCAAGCGAGAGAATCACGTCAGCAAAGAACACAAATCCGGCATAAGGTACGCCTGCAAGCGCTGCAACATATCCTTGACGTCGTTCTACGCCAGGCAGAAACATTTAGCGGAAGTCCACAACGAGGAGCTCAAGAGGTACAAGTGTAAGGCCTGCACCCAGAGCTATATAACGCCTGGACATCTATCGAGTCATGTCAGAAGAGATCACCTCAACGAGAGGAACCACAAATGCACTAAATGCGATCAGGCCTTCTATACGAGGAACTCGTTGAAGATGCACATGATCAAGCATGACGGGGAGCGCATACACACCTGCAACATCTGCAACAAGTCCTACCAGAGGAAGAAGACGCTGCGGGAGCACATGCGGATACATAACAACGACAAGAGGTTCGTGTGTCCAGTGTGTGCCAGAGCTTTCACGCAGAAGTGCACCCTCAAAGGTCATTTGAAAGTCCACGAACGAAGATTGGAGGACAACGTCCACCCGTCCGCTCAGATGCTGTAG

Protein sequence:

>DPOGS205372-PA
MDDIMVESLPILGTCSLCLAEGIVRSMILKENNETNRENYIDILLKCFSIDMLSLDLDDTKYMICSLCIKQLEICHRFKEQVIVSLRTLEASTRIKKADQSYDVVKIEGTENNVNCVKEEMNPAVRLTSCSDVEDSVLNDLLIKHDPDELRSRRSRRNTLLAKKKVSYSVRKQAELKETERMLKRGLFPFKIGKNQTYTCAICPEKSTALDDIKSHITDHNIANIHVAFKKTMTSNQHRFYKYSTKLKCKLCKEDIRDYVTLKNHIGSCVRSSAKCNNLPFKLEKDQLDCPICKKTFLNFVSLNTHMNVHYPNHVCDNCGKAFASKARLRGHMRTHEIGDFPCRYCDQVFDRVTKRENHVSKEHKSGIRYACKRCNISLTSFYARQKHLAEVHNEELKRYKCKACTQSYITPGHLSSHVRRDHLNERNHKCTKCDQAFYTRNSLKMHMIKHDGERIHTCNICNKSYQRKKTLREHMRIHNNDKRFVCPVCARAFTQKCTLKGHLKVHERRLEDNVHPSAQML-