Monarch geneset OGS2.0

DPOGS203984
TranscriptDPOGS203984-TA1194 bp
ProteinDPOGS203984-PA397 aa
Genomic positionDPSCF300005 + 1131242-1165902
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0179258e-15585.25% 
BombyxBGIBMGA002126-TA1e-3694.29% 
Drosophilaap-PA2e-9651.06% 
EBI UniRef50UniRef50_F2Z7K40.090.16%Bmptp-Z and Bmap-A fusion protein alpha n=8 Tax=Obtectomera RepID=F2Z7K4_BOMMO
NCBI RefSeqNP_001139341.18e-11262.90%apterous a [Tribolium castaneum]
NCBI nr blastpgi|3289251280.090.45%apterous A splicing isoform type B [Bombyx mori]
NCBI nr blastxgi|3289251280.090.45%apterous A splicing isoform type B [Bombyx mori]
Group
Gene OntologyGO:00036772.3e-23DNA binding
GO:00063552.3e-23regulation of transcription, DNA-dependent
GO:00435651.4e-22sequence-specific DNA binding
GO:00037001.4e-22sequence-specific DNA binding transcription factor activity
GO:00055153.3e-21protein binding
GO:00082708.6e-16zinc ion binding
KEGG pathway 
InterPro domain[283-350] IPR0122872.3e-23Homeodomain-related
[284-346] IPR0013561.4e-22Homeobox
[268-343] IPR0090573.3e-21Homeodomain-like
[124-178] IPR0017818.6e-16Zinc finger, LIM-type
Orthology groupMCL15589 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203984-TA
ATGGGAGTTTACGAAGAGAGAGGGGCGATGCACTGGCAACAGAATGAGCGATATCTCTCCACGTACGAGACGGGGTCAGAGTTGTCGCCTGTTGCCCCAGCAGCGTCGCCGGGATCACCCAGAGACTGCACCTCGTGTCGCAAGCGAGAACCTCCAGATGAACCCGCTCCACCCGCTGAGGATGCTTGCGCTGGCTGCGGAGCACGAATAACTGATAGATACTACCTTCTAGCGCTGGAGCGGCGCTGGCACACCCCATGCCTCAGGTGCTGTGAATGCAAGATGCCTCTCGACTCTGAACAGAGATGTTATGCTCGTGACAGCAATATATTTTGCAAGAATGACTACTTCAGGTTGTACGGTTCAAAGCGGTGTTCTCGTTGCAACACGACCATTTCAGCATCAGAATTGGTGATGAGAGCGCGCGACTTGGTCTTTCACGTCCACTGTTTCTCCTGTGCACTCTGCAGCGCCCGACTCACAAAAGGCGACACATTCGGCATCAGGGATTCAGCTGTTTATTGCAGGCTACACTACGAAACTATGCCGGATTATGCTCCCCATATGTCTGTACCGGGGCCTCCACAGATGTGTCCAGGTCCTTACGCCGGCCCTCCACCGGGTTCGCACTACCCACCATACCCCTCTCCTGAGTTCTCCCGAGTGGAGCCCGATGTCCCCAAAGGCTCTTTCTTCAACGGGGTATCAGCTCCTCCGCCGAGACAAAAAGGCCGTCCGAGAAAAAAGAAGCCTAAAGACCAAGATTTAATGACAGCAAATCTTGATCTCAACCCCGACTACTTGGAGATGGGTTTCCGGGGCGGCGGCGGGCTGGGATCCACATCACGCACCAAGCGCATGCGCACCAGCTTCAAGCACCACCAGTTACGCACCATGAAGTCATACTTCGCCATCAACCATAACCCAGACGCAAAAGACCTGAAGCAATTGAGCCAGAAGACTGGCCTTCCCAAAAGGGTGTTACAGGTATGGTTTCAAAATGCGCGGGCAAAATGGCGGCGTATGGTGACAAAGCAAGAGAACAAGATGACGGACAAATGTTCTCCAGACGGCTCCTTGGAGATGGACATGTACCACGGACCGATGGGTTCCATACAATCCTTACCCCCGCACAGCCCACCCTACAGCGTGATGGGAGGCCCCCCGAGCCCGAACTCGATGGACTGTCCGTAG

Protein sequence:

>DPOGS203984-PA
MGVYEERGAMHWQQNERYLSTYETGSELSPVAPAASPGSPRDCTSCRKREPPDEPAPPAEDACAGCGARITDRYYLLALERRWHTPCLRCCECKMPLDSEQRCYARDSNIFCKNDYFRLYGSKRCSRCNTTISASELVMRARDLVFHVHCFSCALCSARLTKGDTFGIRDSAVYCRLHYETMPDYAPHMSVPGPPQMCPGPYAGPPPGSHYPPYPSPEFSRVEPDVPKGSFFNGVSAPPPRQKGRPRKKKPKDQDLMTANLDLNPDYLEMGFRGGGGLGSTSRTKRMRTSFKHHQLRTMKSYFAINHNPDAKDLKQLSQKTGLPKRVLQVWFQNARAKWRRMVTKQENKMTDKCSPDGSLEMDMYHGPMGSIQSLPPHSPPYSVMGGPPSPNSMDCP-