Monarch geneset OGS2.0

DPOGS203956
TranscriptDPOGS203956-TA1032 bp
ProteinDPOGS203956-PA343 aa
Genomic positionDPSCF300005 + 315123-341456
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0135205e-14095.15% 
BombyxBGIBMGA002017-TA2e-13498.30% 
Drosophilaacj6-PA1e-14172.36% 
EBI UniRef50UniRef50_P243502e-13972.36%Inhibitory POU protein n=47 Tax=Coelomata RepID=IPOU_DROME
NCBI RefSeqXP_001651018.17e-15278.27%inhibitory pou [Aedes aegypti]
NCBI nr blastpgi|3495846530.098.25%abnormal chemosensory jump 6, isoform C [Bombyx mori]
NCBI nr blastxgi|3495846530.098.25%abnormal chemosensory jump 6, isoform C [Bombyx mori]
Group
Gene OntologyGO:00063554.1e-53regulation of transcription, DNA-dependent
GO:00037004.1e-53sequence-specific DNA binding transcription factor activity
GO:00055153e-36protein binding
GO:00036775e-33DNA binding
GO:00435651.4e-17sequence-specific DNA binding
KEGG pathway 
InterPro domain[181-258] IPR0003274.1e-53POU-specific
[200-217] IPR0138473e-36POU
[186-258] IPR0109825e-33Lambda repressor-like, DNA-binding
[260-336] IPR0122876.8e-18Homeodomain-related
[279-341] IPR0013561.4e-17Homeobox
[263-338] IPR0090572e-16Homeodomain-like
Orthology groupMCL11579 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203956-TA
ATGGCACCCCCGGGCTGTTTCCCGGGTCGGTACAGCCCCACGTACCGCAGTTCAGACCCTCGGCGCTGCGTGCCGAATCCTTCCAGTCGAATACTGGAGGATGCATCATTAATGTGCAACTCGTGGTCACCGCGTCACAATGGTGACATATTCAGCGGTCTGAACGACGGGTTGCTGAGCAGGGCGGAGGCGTTGGCTGCGGTAGACATCGGCAAGCATCAGTCGGGACCTCAACCGGGCCCTCCGCTACCACAGCTGAAGCATGATATGGTGTATCACCACGGAGTGGGAGGTCCACCACCCCATAATGCAAGACCACATCAGATGGGCCACCATGGCATGGAAGGGCTAGACATGCTTGATCCGCTCACATCCTCGTCAATGACTACCTTAGCACCTATGGGTGAAGCCGCACCACCGCATCATCAATTGCATGGCTATGGCGCTATGAACCATGTTATGAATCACCACCACCACACAGGAGGCCTTGGACATGCCCCACCAGCACACCTTGGACATCCGGCTGCGGCTTTACACCCAGATACAGATACAGATCCTCGTGAACTCGAGGCATTTGCCGAAAGGTTCAAACAAAGAAGAATAAAACTGGGAGTAACTCAAGCTGATGTTGGTAAGGCACTAGCTAATCTTAAGCTGCCAGGTGTTGGAGCGTTATCCCAAAGTACAATTTGTAGATTCGAAAGTTTGACTCTAAGTCATAACAACATGATAGCATTAAAACCTATTTTACAAGCCTGGTTAGAAGAAGCAGAAGCGCAGGCAAAAAATAAAAGAAGGGACCCAGATGCACCCAGCGTTTTGCCAGCCGGAGAAAAAAAAAGAAAGAGAACATCCATAGCGGCACCAGAAAAGAGAAGTCTCGAAGCTTATTTTGCTGTTCAGCCGCGTCCATCGGGTGAAAAAATTGCAGCGATTGCAGAAAAGTTAGATCTCAAGAAAAACGTAGTTCGGGTATGGTTCTGCAACCAGAGGCAAAAACAGAAACGTATGAAATTCGCTGCACAACACTGA

Protein sequence:

>DPOGS203956-PA
MAPPGCFPGRYSPTYRSSDPRRCVPNPSSRILEDASLMCNSWSPRHNGDIFSGLNDGLLSRAEALAAVDIGKHQSGPQPGPPLPQLKHDMVYHHGVGGPPPHNARPHQMGHHGMEGLDMLDPLTSSSMTTLAPMGEAAPPHHQLHGYGAMNHVMNHHHHTGGLGHAPPAHLGHPAAALHPDTDTDPRELEAFAERFKQRRIKLGVTQADVGKALANLKLPGVGALSQSTICRFESLTLSHNNMIALKPILQAWLEEAEAQAKNKRRDPDAPSVLPAGEKKRKRTSIAAPEKRSLEAYFAVQPRPSGEKIAAIAEKLDLKKNVVRVWFCNQRQKQKRMKFAAQH-