Monarch geneset OGS2.0

DPOGS200468
TranscriptDPOGS200468-TA1140 bp
ProteinDPOGS200468-PA379 aa
Genomic positionDPSCF300260 + 295066-300149
RNAseq coverage1073x (Rank: top 12%)
Annotation
HeliconiusHMEL0104154e-16396.14% 
BombyxBGIBMGA011179-TA2e-14490.84% 
Drosophilapho-PB6e-7079.74% 
EBI UniRef50UniRef50_E5RWX70.084.83%Pleiohomeotic n=4 Tax=Coelomata RepID=E5RWX7_BOMMO
NCBI RefSeqNP_001164362.18e-12059.28%pleiohomeotic [Tribolium castaneum]
NCBI nr blastpgi|3198030270.084.83%pleiohomeotic [Bombyx mori]
NCBI nr blastxgi|3198030270.084.83%pleiohomeotic [Bombyx mori]
Group
Gene OntologyGO:00036763.5e-18nucleic acid binding
KEGG pathway 
InterPro domain[2-341] IPR0171148.1e-125Transcription factor yin/yang
[219-249] IPR0130873.5e-18Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16556 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200468-TA
ATGATGAATCAAGAAATTATGTGTGAAGTTGAGATTGGAACACACGATGGCATGGTCGAGATAGATGTGGATGAACAGGTCGGCGTTAAGTATGAAGAAATTGAGACGGGTCAGTACGAGGAATATATAACCGATGATCAGTATGAAGAGGTTGAGGAACAGGTTATCGGTATGCAGCATTTAGAGGAAGTTGGCAGCGAGGAGGTTATACTGCCAGGGATATCCGAGGAGGTTCTACATAATGATGCGACCTACGACAATGTGTACCAGCCTCCCGGAAACAGCAGAAGAGGTCGTGGCAGGCCGAGAGTGAACGCCTCCAACAACATACACCATTTGATGCCAATAGGTGAAGTACCCATGGGTCTCATGGAGAACAATTCAGGACGTGCTGCTCGTAGATGGGAACAGAAACAGGTCCAGATCAAAACAATGGAGGGAGAGTTCTCAGTCACCATGTGGGCCACCGGAGAGGATGACGATGACGGATCGAACCCCGAACCCGATCCTGATTACACCGAGTACATGACGGGGAAGAAGAATGTGCTGGGAAATGATAATATGCCAGGCTTAGACTTATCAGATCCGAAACAATTGGCGGAGTTCGCGCGTCCAGGACACAAGATCAGATTGAAGAAACCCTCCCCGGAGTCATCAGACAGAACCATAGCGTGTCCACACAAGGGTTGTTCGAAGATGTTCAGAGACAATTCGGCCATGAGGAAACATTTGCACACGCACGGACCGAGAGTTCACGTCTGTGCTGAATGCGGCAAAGCGTTTGTTGAGAGTTCGAAGTTGAAACGTCACCAGCTGGTCCACACGGGGGAGAAGCCGTTCCAGTGCACCTTCGAGGGTTGCGGGAAGAGATTCTCATTGGACTTTAATCTCAGAACCCACGTCCGTATCCACACGGGAGATCGTCCCTATGTATGTCCGTTCGACGGCTGCAATAAGAAATTCGCGCAGTCAACGAACCTGAAGTCACACATACTGACACACGCCAAGGCCAAATCAAGAAACTCACTGTCTCGGAACGGTAACGCCTATGAATCAGCCAGAACGACGCCACAATACGTCCAGCTGGAGGTGTCTCCGGACGACTCAAATCCTCAACTAATCTTCTACACTCACGAGTGA

Protein sequence:

>DPOGS200468-PA
MMNQEIMCEVEIGTHDGMVEIDVDEQVGVKYEEIETGQYEEYITDDQYEEVEEQVIGMQHLEEVGSEEVILPGISEEVLHNDATYDNVYQPPGNSRRGRGRPRVNASNNIHHLMPIGEVPMGLMENNSGRAARRWEQKQVQIKTMEGEFSVTMWATGEDDDDGSNPEPDPDYTEYMTGKKNVLGNDNMPGLDLSDPKQLAEFARPGHKIRLKKPSPESSDRTIACPHKGCSKMFRDNSAMRKHLHTHGPRVHVCAECGKAFVESSKLKRHQLVHTGEKPFQCTFEGCGKRFSLDFNLRTHVRIHTGDRPYVCPFDGCNKKFAQSTNLKSHILTHAKAKSRNSLSRNGNAYESARTTPQYVQLEVSPDDSNPQLIFYTHE-