Monarch geneset OGS2.0

DPOGS201237
TranscriptDPOGS201237-TA933 bp
ProteinDPOGS201237-PA310 aa
Genomic positionDPSCF300037 - 154604-161614
RNAseq coverage8x (Rank: top 85%)
Annotation
HeliconiusHMEL0153964e-1638.03% 
BombyxBGIBMGA012499-TA5e-6387.80% 
Drosophilalms-PA7e-1856.00% 
EBI UniRef50UniRef50_D6WAI47e-4342.27%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WAI4_TRICA
NCBI RefSeqXP_974114.11e-4342.27%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910776403e-4242.27%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910776401e-4141.72%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00063556.9e-23regulation of transcription, DNA-dependent
GO:00435656.9e-23sequence-specific DNA binding
GO:00037006.9e-23sequence-specific DNA binding transcription factor activity
GO:00036775.2e-21DNA binding
GO:00055158.1e-21protein binding
GO:00056341e-05nucleus
KEGG pathway 
InterPro domain[201-263] IPR0013566.9e-23Homeobox
[190-259] IPR0122875.2e-21Homeodomain-related
[200-274] IPR0090578.1e-21Homeodomain-like
[223-234] IPR0204791.8e-06Homeobox, eukaryotic
[230-239] IPR0000471e-05Helix-turn-helix motif, lambda-like repressor
Orthology groupMCL17878 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201237-TA
ATGAACTCTATAGATTATATATCTAGCACGTGTGAGAATGATACAGGGAAGTTTGATAATAATGTCAGTGTCACGAATGATTTGACGGAGGAACGTACAGAGATACGGACATTCGTGGAATCGGATTCCGATTTGGACAGTGAAAATGAAACCGTCGATGTCATATGTGAGAACAGTGAACAAATGAATAGTATTAATTACAATACAATAGCTTACGGCTCCGTCGATAGTATTATATACAGCAAAAACTTATACGCTAACAAAATGACGGTGAAAAAGAAACGGGTTTGTGATAGTGAACAAATTAAAATTATAAACGATTCCGCAAACAGTTTATTACAGGCGAAGGGGAAAAACTTTCTGATAGACAGTATACTAGGCAACGACGAGAGTAAAACTCAAAGAAAACTCGTCAAAGACGCCGATAACCCAGAGGAGACGGGAGACGAACATAATGTGTCATCAACCTCAACATGTCCGGACATCTCCATCAACAGTGCTGATGTCTTGGCGGGACATGCGTACGCGCATTGGCTGGCGACACAACAACCTACTTTTTATGATGACAAAAATAATCGGAGGCAGAAACGTTCCGGACCAGAGAGGAAACCTCGACAAGCATACAGCGCTAAACAACTAGAGAGACTCGAATCTGAATTTAAGTTGGACAAATATCTGAGCGTATCAAAAAGATTGGAGCTCTCCAAGGCGCTCGGACTCACTGAGGTTCAAATAAAAACGTGGTTTCAAAATCGAAGGACAAAATGGAAGAAACAACTCACATCTCGTCTCAAGATCGCCCAGCGTCAGGGATTATTTCCCGGACATATCTTCGGACACGCCCCTCAGACTTATTCACTTATAAATCCTTATACCTACAGTCCATTAAGCTGCATGTTCACCCCCGTGACGTTGCCGACGTCGCAACCATGA

Protein sequence:

>DPOGS201237-PA
MNSIDYISSTCENDTGKFDNNVSVTNDLTEERTEIRTFVESDSDLDSENETVDVICENSEQMNSINYNTIAYGSVDSIIYSKNLYANKMTVKKKRVCDSEQIKIINDSANSLLQAKGKNFLIDSILGNDESKTQRKLVKDADNPEETGDEHNVSSTSTCPDISINSADVLAGHAYAHWLATQQPTFYDDKNNRRQKRSGPERKPRQAYSAKQLERLESEFKLDKYLSVSKRLELSKALGLTEVQIKTWFQNRRTKWKKQLTSRLKIAQRQGLFPGHIFGHAPQTYSLINPYTYSPLSCMFTPVTLPTSQP-