Monarch geneset OGS2.0

DPOGS204459
TranscriptDPOGS204459-TA1119 bp
ProteinDPOGS204459-PA372 aa
Genomic positionDPSCF300002 + 395149-397964
RNAseq coverage440x (Rank: top 28%)
Annotation
HeliconiusHMEL0062444e-16282.95% 
BombyxBGIBMGA007802-TA3e-16181.00% 
Drosophilarepo-PA3e-3862.05% 
EBI UniRef50UniRef50_D6WMA54e-5851.10%Reversed polarity n=1 Tax=Tribolium castaneum RepID=D6WMA5_TRICA
NCBI RefSeqXP_969909.17e-5951.10%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910834411e-5751.10%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910834411e-6546.30%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00036772.6e-29DNA binding
GO:00063552.6e-29regulation of transcription, DNA-dependent
GO:00435651.5e-27sequence-specific DNA binding
GO:00037001.5e-27sequence-specific DNA binding transcription factor activity
GO:00055152.5e-25protein binding
KEGG pathway 
InterPro domain[88-170] IPR0122872.6e-29Homeodomain-related
[111-173] IPR0013561.5e-27Homeobox
[102-180] IPR0090572.5e-25Homeodomain-like
Orthology groupMCL17821 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204459-TA
ATGTACGTGTGCGGGGAAAGCGGCGCTGGGCCGCGGTACGAGTGCGCCTTCGACGGTGGCATGGAACAGCCCTTCGACGAGCACATGTTCAGCGAGTTTGGCAAGGAGAGACAGGTGCAGGTTGTTGTCGGAGCCAGCGGGGAACTGCAGTACCGTGACGAGCTGCCGGTGTATGCAACGGCTGAACAAAAACGGAAGGACGAACCGCTCCTGCTGCAGGCGGTAGAGGTTCAGCCTTCCCAGCACTCCCAGCATGTCCCAACAACTACCACAACCACGACAACTTCAAAGAAAAGCGACAAAAAGAAAAGTGACAATAACGGCATTAAAAAGAAAAAAACGAGAACTACTTTTACTGCCTATCAGTTGGAAGAATTGGAAAGAGCCTTTGAACGTGCTCCATATCCTGATGTGTTCGCCCGAGAGGAACTAGCTCTGAAGTTGAATTTATCCGAATCAAGAGTTCAGGTTTGGTTTCAAAACAGAAGAGCGAAATGGCGTAAGCGTGAACCACCAAGAAAGACAGGATACATAGGATCCAGCTCACCGAGTTCTACCACATTAGGTGGTGGCTTTTCGGGTATCGGAGGCAACTTGCCAGCATTCCCTCAAAACGGCTTACCAGCACCTTCAGATTCTTGGTCCTACCAACACTCGTACGAACTGTCATCACATCATTTACTGTCTTCGGGCAGTAGTGGTTATCCCGCTTTCAACACGCAACCGGCTTATTCTTATACCACAGTGCTGAACGGACATGACGGACAAATGTTCGCGCCACGGCATTCATACGAGTACGGAGAGGGCAGCCCGCCCCCGCTAGGCGTACGTGACTATCCCATGATTGCTTCACACTCCCCGCAGATGGAAACCCACGGGCACGAAGACAAATTAGAATACCGTGGCCATGAACACGAAGACAAATATTCAGCGTGTGCCTTACAAGAGGAACCGCCGCGGTACACCTCCCCACCCGAAGATTATGACAAATGTAATATGGTGCCTCATGACAAACATTACGAAATTGATCGCCACTCTGAACTAGCACAGCCTGTAGTAGTCAAAATGGAACCCAGCCCCGGCCAAGCATACACGTCACTGCCCCCTTTTTTGAATTGA

Protein sequence:

>DPOGS204459-PA
MYVCGESGAGPRYECAFDGGMEQPFDEHMFSEFGKERQVQVVVGASGELQYRDELPVYATAEQKRKDEPLLLQAVEVQPSQHSQHVPTTTTTTTTSKKSDKKKSDNNGIKKKKTRTTFTAYQLEELERAFERAPYPDVFAREELALKLNLSESRVQVWFQNRRAKWRKREPPRKTGYIGSSSPSSTTLGGGFSGIGGNLPAFPQNGLPAPSDSWSYQHSYELSSHHLLSSGSSGYPAFNTQPAYSYTTVLNGHDGQMFAPRHSYEYGEGSPPPLGVRDYPMIASHSPQMETHGHEDKLEYRGHEHEDKYSACALQEEPPRYTSPPEDYDKCNMVPHDKHYEIDRHSELAQPVVVKMEPSPGQAYTSLPPFLN-