Monarch geneset OGS2.0

DPOGS212699
TranscriptDPOGS212699-TA720 bp
ProteinDPOGS212699-PA239 aa
Genomic positionDPSCF300012 - 849720-852335
RNAseq coverage139x (Rank: top 55%)
Annotation
HeliconiusHMEL0155332e-4743.27% 
BombyxBGIBMGA014538-TA3e-1654.67% 
DrosophilaCG17359-PA8e-1827.56% 
EBI UniRef50UniRef50_Q9VUB31e-1527.56%CG17359 n=1 Tax=Drosophila melanogaster RepID=Q9VUB3_DROME
NCBI RefSeqXP_001359871.27e-1726.61%GA18174 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|1984551351e-1526.61%GA18174 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastxgi|1892414072e-1927.75%PREDICTED: similar to novel KRAB box and zinc finger, C2H2 type domain containing protein [Tribolium castaneum]
Group
Gene OntologyGO:00056349.5e-14nucleus
GO:00082709.5e-14zinc ion binding
GO:00036763.2e-13nucleic acid binding
GO:00056224.8e-05intracellular
KEGG pathwaydpe:Dper_GL194955e-14 
 K02215 (CF2)maps-> Dorso-ventral axis formation
InterPro domain[12-80] IPR0129349.5e-14Zinc finger, AD-type
[205-233] IPR0130873.2e-13Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26039 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212699-TA
ATGGAAGAAAAAAGGAAAAATAAATTATTCGACATATGTAGACTGTGTTTAGTGAAACGTGGATTTTGTGATATTACTGACAGAAACGAATTGTGCGATAATATTCACAAATTCACCGGTCTCAAGATATCCGCGTCGGACAAATTGCCTCAGAAAATTTGCAGAAATTGTTTAGACATAGTAAATAAAGCATCAGAATTAAGAAACATGGCTAGGAAGAATGATAAACATTTAAGATCTTTATTTGATTGCCCCGTGGAAGATCCGGTACCGGAATTTGAAGCGGGATCATCAAGTGAGAAGAGTGAAATTGTTTTACAAGAGACGGAAAGAAAATGCTTCTCCCTAGCCGTCAGGCAAGACCTGTTTAACAAATCCAAAACAAATGATGAAGAAGATAGCGAAGCTGTTAATTCTCTGAATCTCATCCACTCAATGAAGCCCGAAAATGAGGACACGGACAGTGAATATAAATGTCCGGTCTGCTCGAAGAATTTTGCAAAATGGAGGAAACTGTACGTACACCATCGCTTGCACAATAAATGTTACAAGTGTCCGATTGATATATGCGTAAAGAGGTTTGCTACAAAAGGAGATCTTGAGAAACATATACGAACACACACGGGAGAAAAGCCTTATGTCTGTAATGAATGTGAGAAACGTTTTGCTCAGCGCGGGACATTGAAGGCTCATAAAGAATCCGTGCACCCTGTAACCTGA

Protein sequence:

>DPOGS212699-PA
MEEKRKNKLFDICRLCLVKRGFCDITDRNELCDNIHKFTGLKISASDKLPQKICRNCLDIVNKASELRNMARKNDKHLRSLFDCPVEDPVPEFEAGSSSEKSEIVLQETERKCFSLAVRQDLFNKSKTNDEEDSEAVNSLNLIHSMKPENEDTDSEYKCPVCSKNFAKWRKLYVHHRLHNKCYKCPIDICVKRFATKGDLEKHIRTHTGEKPYVCNECEKRFAQRGTLKAHKESVHPVT-