Monarch geneset OGS2.0

DPOGS213224
TranscriptDPOGS213224-TA957 bp
ProteinDPOGS213224-PA318 aa
Genomic positionDPSCF300394 - 86496-87452
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0169591e-13672.53% 
BombyxBGIBMGA002238-TA6e-12468.24% 
Drosophilahkb-PA3e-5340.06% 
EBI UniRef50UniRef50_D6X1H72e-5869.18%Huckebein n=2 Tax=Tribolium castaneum RepID=D6X1H7_TRICA
NCBI RefSeqXP_001807455.12e-5969.18%PREDICTED: similar to GA22020-PA [Tribolium castaneum]
NCBI nr blastpgi|1892407624e-5869.18%PREDICTED: similar to GA22020-PA [Tribolium castaneum]
NCBI nr blastxgi|2700142265e-5969.43%huckebein [Tribolium castaneum]
Group
Gene OntologyGO:00036761.3e-13nucleic acid binding
GO:00082704.1e-06zinc ion binding
GO:00056224.1e-06intracellular
KEGG pathwaydan:Dana_GF168142e-53 
 K12379 (HKB)maps-> MAPK signaling pathway - fly
InterPro domain[237-259] IPR0130871.3e-13Zinc finger, C2H2-type/integrase, DNA-binding
[267-289] IPR0070874.1e-06Zinc finger, C2H2
Orthology groupMCL17485 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213224-TA
ATGTTTACTGTTAAACAGAACTACTCCAGGCTGTTCAGACCTTGGGATGTGAGTGAAAGCGAAGACATTGAAACAAAAACAAGTGAACAAACAGAGTTACGAGATGCTGCAGTTAAGATAAAGAAAGATATGAAACGCGAGAGCAGGTTGAGGAAAATACGGGGTGTCTTCAAAAGAAACGATGTCAACCAAGACGCCACAGTCTCCACGACAACGGCTGATGATGAAGAGAGAACGCTTAAAGTCGAAGGTGTTCCCAATGAACCCACTGATTTATCAGATTCTCCTGTAGCCAGCTTAAGCTTATGTTGCGATAGATTACTCCAAGAACCAGAGAAAATGTCAAAAACAGATAAAATCAGAGATATCCCAAAAACACCAGACAATGTATATCCGCAAAACTTATTCCCTCAAAGTATAGTGTCATCAAATATGTATAATCCCGGATACATACCTGATTATGTAACATCAGACATAGCCCAAACACTCGGCGTGCCTCCAACCGATCCCCTATTACTGGAATCACTGGCACAAGGATACGCGATGGAATTAGAATACGCCAGAATACTCCAACAAGAAAACGAAGCGAAGCTCCTGAACGCCAGGAAACAGAGACCAAAGAAATACAAATGTCCGCACTGCCAAGTCGGCTTCTCCAACAACGGCCAGCTGAAAGGTCACATACGGATCCACACAGGAGAGAGGCCTTACAAGTGTGACGAGAAGAACTGCGGCAAGACCTTCACGAGGAACGAGGAATTGACGCGTCATAAGAGGATACATTCAGGAGTTCGTCCATACCCGTGTCCCACTTGTGGGAAGAAGTTCGGAAGGCGTGATCATTTGAAGAAACACACAAGAACTCACTACATGCAAGCCGAGAGAATGATGCCAGTGTTCGTACCTCTGACAGCCGTCCAACACGTTGCAGGATTCCCATACTTATATGGCTATTAG

Protein sequence:

>DPOGS213224-PA
MFTVKQNYSRLFRPWDVSESEDIETKTSEQTELRDAAVKIKKDMKRESRLRKIRGVFKRNDVNQDATVSTTTADDEERTLKVEGVPNEPTDLSDSPVASLSLCCDRLLQEPEKMSKTDKIRDIPKTPDNVYPQNLFPQSIVSSNMYNPGYIPDYVTSDIAQTLGVPPTDPLLLESLAQGYAMELEYARILQQENEAKLLNARKQRPKKYKCPHCQVGFSNNGQLKGHIRIHTGERPYKCDEKNCGKTFTRNEELTRHKRIHSGVRPYPCPTCGKKFGRRDHLKKHTRTHYMQAERMMPVFVPLTAVQHVAGFPYLYGY-