Monarch geneset OGS2.0

DPOGS207175
TranscriptDPOGS207175-TA639 bp
ProteinDPOGS207175-PA212 aa
Genomic positionDPSCF300001 + 4890251-4892089
RNAseq coverage58x (Rank: top 69%)
Annotation
HeliconiusHMEL0084718e-5495.10% 
BombyxBGIBMGA000638-TA7e-4574.66% 
Drosophilacaup-PA9e-4289.47% 
EBI UniRef50UniRef50_E2AIB67e-4277.50%Homeobox protein caupolican n=9 Tax=Formicidae RepID=E2AIB6_CAMFO
NCBI RefSeqXP_002048160.12e-4387.37%GJ11493 [Drosophila virilis]
NCBI nr blastpgi|3320243212e-4277.50%Homeobox protein araucan [Acromyrmex echinatior]
NCBI nr blastxgi|1700278203e-4792.78%iroquois-class homeodomain protein irx [Culex quinquefasciatus]
Group
Gene OntologyGO:00036771.1e-22DNA binding
GO:00063551.1e-22regulation of transcription, DNA-dependent
GO:00055156.5e-18protein binding
GO:00056343.1e-14nucleus
GO:00435652.5e-11sequence-specific DNA binding
GO:00037002.5e-11sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[72-141] IPR0122871.1e-22Homeodomain-related
[53-129] IPR0090576.5e-18Homeodomain-like
[87-126] IPR0084223.1e-14Homeobox KN domain
[70-134] IPR0013562.5e-11Homeobox
Orthology groupMCL15530 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207175-TA
ATGGCTTCTCGATCGATCGAGTACTTACGCCCACGTTGTACCCACGCCTGCGCTGCAACGCCCTGCAATGAGGTGCAAGCTCAGACCGACTGGGGACCATTTTTTAACTATTTTCACGCACGAAACCTATTTATTGGGCCCTTCAGACCGCGCCTTTGTCATGTTGCCGGGAAAATCGAAGTCAGATATGGAGCTGGATACGATTTAGCTGCCAGGCGAAAAAACGCTACCAGAGAATCAACAGCTACGCTCAAAGCATGGCTGAACGAGCATAAGAAAAACCCTTACCCCACGAAGGGGGAAAAGATTATGCTGGCGATCATCACCAAGATGACGCTGACACAGGTCTCGACGTGGTTCGCGAACGCAAGACGGCGCCTGAAGAAAGAAAACAAGATGACTTGGGAGCCGAAGAATAAAACGGATGATGACGAAGACACGATGCTCTCCGACGAAGAAAGGGAACAGGACGATAAAATAAAAGCTAACAAAGTGAATTGGCATAACCCGGTTGAGAGCCGGAGGGTGTACGGGCCACCGGGAGGAAGTCGGGGGGAGGAAAGGGAACGCGGGACGGAGGGGGGAAGGGCGTGTGACAGGAGCGGGGAAGGGGGGGCGGTGGTGGTCTCAAGCGTGTAA

Protein sequence:

>DPOGS207175-PA
MASRSIEYLRPRCTHACAATPCNEVQAQTDWGPFFNYFHARNLFIGPFRPRLCHVAGKIEVRYGAGYDLAARRKNATRESTATLKAWLNEHKKNPYPTKGEKIMLAIITKMTLTQVSTWFANARRRLKKENKMTWEPKNKTDDDEDTMLSDEEREQDDKIKANKVNWHNPVESRRVYGPPGGSRGEERERGTEGGRACDRSGEGGAVVVSSV-