Monarch geneset OGS2.0

DPOGS212907
TranscriptDPOGS212907-TA762 bp
ProteinDPOGS212907-PA253 aa
Genomic positionDPSCF300285 - 106520-107973
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0042395e-8981.50% 
BombyxBGIBMGA013985-TA1e-11477.38% 
DrosophilaCG13367-PA1e-4353.47% 
EBI UniRef50UniRef50_E2BYA91e-6453.20%GATA zinc finger domain-containing protein 1 n=7 Tax=Endopterygota RepID=E2BYA9_HARSA
NCBI RefSeqXP_967180.25e-7054.18%PREDICTED: similar to GATA zinc finger domain-containing protein 1 (Ocular development-associated gene protein) [Tribolium castaneum]
NCBI nr blastpgi|1892397219e-6954.18%PREDICTED: similar to GATA zinc finger domain-containing protein 1 (Ocular development-associated gene protein) [Tribolium castaneum]
NCBI nr blastxgi|1892397212e-6854.18%PREDICTED: similar to GATA zinc finger domain-containing protein 1 (Ocular development-associated gene protein) [Tribolium castaneum]
Group
Gene OntologyGO:00063554.3e-06regulation of transcription, DNA-dependent
GO:00435654.3e-06sequence-specific DNA binding
GO:00082704.3e-06zinc ion binding
GO:00037004.3e-06sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[6-32] IPR0006794.3e-06Zinc finger, GATA-type
Orthology groupMCL13008 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212907-TA
ATGCCTAAACCGACTTGTGTTCAATGTAATACAAACGATTCCCTACTATGGAGAAGCGCTGAAAACGGACAAATATGCAACGAATGCCACTTAGCAAATACCGCAAGTAGTGAAACAAATTTAGATACGGCCATCAAAACCGAAAATGATGACAAGAAAGAAGACAAAGACGATTGTGATAGTAAGAATGGAAAAAATGAAGGCGAGACCACTCCAGCAAAAGCAACGGGTAAAGGTACGAGGAAAAGCACTCGTTCTACAAGGTACAAAGCTAAAGGACCAACAGTGAACCAATCAAAACCACCGGCACCCCGTGGTAGAGGTCGCAGAAGTATATTTAAGAGACAACCATTAAAAGCCCCAACAGCGACGGCTACAGTTGTTGCGAGTGAATCAGTATTTTTCAAGGGTACATACATTCAAGTTGGCGATATCGTTTCAATGATTGACATTGACGGCGGCACATACTATGCACAGATAAGAGGTTTCCTTACAGATCAGTACTGTGAGAAAAGTGCCGTAGTCACTTGGCTGCTACCAACACAGGCCAGTCCGCCCCCAGACCAAAGTTTTGATCCAGCTACATACATTATAGGTCCAGAAGAAGATCTTCCTCGTAAACTGGAATACATGGAATTTGTAATGCATGCACCTTCAGACTACTATAAATCCAGTACAAGCCCATATCCCCTCACCGACAGGGAGCTAAATAACCAAAGTGGTTTCATATGGACCTCTATGGAATCTAAGGAAAGAGGATAA

Protein sequence:

>DPOGS212907-PA
MPKPTCVQCNTNDSLLWRSAENGQICNECHLANTASSETNLDTAIKTENDDKKEDKDDCDSKNGKNEGETTPAKATGKGTRKSTRSTRYKAKGPTVNQSKPPAPRGRGRRSIFKRQPLKAPTATATVVASESVFFKGTYIQVGDIVSMIDIDGGTYYAQIRGFLTDQYCEKSAVVTWLLPTQASPPPDQSFDPATYIIGPEEDLPRKLEYMEFVMHAPSDYYKSSTSPYPLTDRELNNQSGFIWTSMESKERG-