Monarch geneset OGS2.0

DPOGS200505
TranscriptDPOGS200505-TA1311 bp
ProteinDPOGS200505-PA436 aa
Genomic positionDPSCF300450 - 54252-59211
RNAseq coverage589x (Rank: top 22%)
Annotation
HeliconiusHMEL0176224e-17564.03% 
BombyxBGIBMGA001717-TA5e-10459.54% 
DrosophilaZpr1-PA3e-12148.64% 
EBI UniRef50UniRef50_Q5TSZ97e-12751.63%AGAP004417-PA n=7 Tax=Pancrustacea RepID=Q5TSZ9_ANOGA
NCBI RefSeqXP_975557.13e-14457.98%PREDICTED: similar to zinc finger protein [Tribolium castaneum]
NCBI nr blastpgi|3504075789e-14558.18%PREDICTED: zinc finger protein ZPR1-like [Bombus impatiens]
NCBI nr blastxgi|3504075782e-14257.86%PREDICTED: zinc finger protein ZPR1-like [Bombus impatiens]
Group
Gene OntologyGO:00082702e-76zinc ion binding
KEGG pathway 
InterPro domain[239-396] IPR0044572e-76Zinc finger, ZPR1-type
Orthology groupMCL14362 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200505-TA
ATGTGTGAAGAACAGAAACCTCTGTTCAGAGATCTCGCAGGTGATGATCCGGAACCCGAGGTGACGGAGATCGAGTCGCTCTGTCTGAATTGCCACGAAAATGGCATGACTCGTCTTCTGTTAACTCGAATCCCGCACTACAAGAATGTCGTTATAATGTCCTTCAACTGTGAGCACTGCGGCTTTGAGAACAACGAGATACAACCAGGGGGAGCGTACGCCGAGTTAGGAGTACGTTGGAAGCTGAACATTCAAGAGTCTCGTGACCTGAATCGTCAGGTGGTGAAGAGTGACCATACAGCCGTCATCATACCGGAGCTGGACTTTGAAATACCGGCCCTGAGTCAGAAGGGAGTATTTTATAGTTGTAAATGTCATTATATATATAACCAGGCCGTTAGAAGACAACAGCATCCAGAACATGCGGAGCAAATAGATCAGTTCGTCGCCAAGCTGGAGGAACTTAGAAGTCTGCAAAAGACCTGGACCCTCATCATAGAAGATATAACCGGGAACTGTTTCGTAGAGAACCCAGAGGCTCCTAAAAAGGATCCGGGCTGTGTCAGGACGGACTTCAAGAGGAGTAAAGAGGATGATATCAAACTGGGAATATACACGGAGGGCTCGCAAGCCCTCGCCGGAGCCCTAACGTCGGAGGAGCCCTCGGTGGCTAGCTACGACCAGCTCGCATCGGACGAAGTGCTGCAGTTCAGGACCAACTGCCCTGAATGTAACGCACCAGCCGACACCAACATGAAGATCACTAAGATTCCGCACTTCAAGGAAGTTGTGATAATGGCGACTGTCTGTGACGCGTGCGGGCATCGGACCAATGAGGTGAAGTCAGGTGGTGGAGTTGAAGAGAAAGGCGTCAAGTTTGAAGTAAAGATCAGGAACAGAGAAGATTTCACCAGAGATATATTAAAGTCGGAGACATGTAACATGGGAATCCCCGAGCTGGAGTTGGAGGTGGGGGGCGCGGCCCTGGGAGGACGCTTCACGACGGTGGAGGGAGTCCTGACTGCGGTTAAGGACCAGATGAGGGAGGGGGTCGGGCTGGGGGACGCGGGGGGAGAGGCCAGGGAGAAAGTCGAGAGATGTATATCCAGCATAGACGAAATCTTAGAGGGCAAATCCACAGCCACATTGATCTTGGACGACCCCGCTGGGAATTCATACGTACAGAACCTTAGCGATGACCCCACAGTGTTTGATGATGGTCTCAAAGTAGTTCATTACGAGCGTTCCTATGAACAAAACGACGAGTTGGGACTGAATGATATGAAGACGGAAGGTTACGAAGAGAGTTGA

Protein sequence:

>DPOGS200505-PA
MCEEQKPLFRDLAGDDPEPEVTEIESLCLNCHENGMTRLLLTRIPHYKNVVIMSFNCEHCGFENNEIQPGGAYAELGVRWKLNIQESRDLNRQVVKSDHTAVIIPELDFEIPALSQKGVFYSCKCHYIYNQAVRRQQHPEHAEQIDQFVAKLEELRSLQKTWTLIIEDITGNCFVENPEAPKKDPGCVRTDFKRSKEDDIKLGIYTEGSQALAGALTSEEPSVASYDQLASDEVLQFRTNCPECNAPADTNMKITKIPHFKEVVIMATVCDACGHRTNEVKSGGGVEEKGVKFEVKIRNREDFTRDILKSETCNMGIPELELEVGGAALGGRFTTVEGVLTAVKDQMREGVGLGDAGGEAREKVERCISSIDEILEGKSTATLILDDPAGNSYVQNLSDDPTVFDDGLKVVHYERSYEQNDELGLNDMKTEGYEES-