Monarch geneset OGS2.0

DPOGS207623
TranscriptDPOGS207623-TA777 bp
ProteinDPOGS207623-PA258 aa
Genomic positionDPSCF300199 - 210489-213520
RNAseq coverage131x (Rank: top 56%)
Annotation
HeliconiusHMEL0093166e-11886.21% 
BombyxBGIBMGA014549-TA3e-6468.22% 
Drosophilaal-PA2e-5055.29% 
EBI UniRef50UniRef50_G6DBX94e-7160.44%Paired-like family homeodomain transcription factor n=3 Tax=Nymphalidae RepID=G6DBX9_DANPL
NCBI RefSeqNP_001107838.12e-5957.03%aristaless [Tribolium castaneum]
NCBI nr blastpgi|2814851308e-11686.59%paired-like family homeodomain transcription factor [Heliconius erato]
NCBI nr blastxgi|2814851302e-12486.59%paired-like family homeodomain transcription factor [Heliconius erato]
Group
Gene OntologyGO:00063556.3e-30regulation of transcription, DNA-dependent
GO:00435656.3e-30sequence-specific DNA binding
GO:00037006.3e-30sequence-specific DNA binding transcription factor activity
GO:00036779.4e-30DNA binding
GO:00055151.8e-27protein binding
GO:00056343.2e-06nucleus
GO:00072752.6e-05multicellular organismal development
KEGG pathway 
InterPro domain[38-100] IPR0013566.3e-30Homeobox
[24-98] IPR0122879.4e-30Homeodomain-related
[29-107] IPR0090571.8e-27Homeodomain-like
[67-76] IPR0000473.2e-06Helix-turn-helix motif, lambda-like repressor
Orthology groupMCL20935 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207623-TA
ATGGGAGTGTCTGATACCGGTTCATCCGCTACTCCTGAACTACCAGTCCACGATATCGATCGGCCAGGGTCGGGTAGTGGAGTCGATGACGAAGACATCCCGAGGAGGAAACAGAGGAGGTACAGAACGACCTTCACCAGCTACCAACTAGATGAACTGGAGAAAGCCTTCGGAAGAACTCACTATCCAGATGTTTTTACAAGGGAGGAATTGGCTCTCAAAATTGGACTCACTGAAGCAAGAATACAGGTGTGGTTTCAAAACCGGAGAGCAAAATGGCGCAAGCAAGAAAAGGTGGGTCCCCACGCTCATCCCTACGGCGGATACTTGGGAGGACAGCCTTTGCCAACAGCCGCAATGCCAGTATCGCCACACTCACTGACACAACTTGGCTTCGGATTGAGGAAGCCTTTCGACAGCTCCTTGGCTACATTCAGGTATGCCAGTAGTCCACTGTTTGGGACGCAATACCTACCACCGCTGACCCGGCCTCATCTATTCGGTGCTCCGTTGTACGCCACCTCGCCAGCTCATTTTCATTCTCTTTTCGCTAACCTAACCGCACCAGAACCACCGCGCGCATCACCCGAACATTCCCGATTATCCCCCGAGGTCACCCGATCTCCATCTCTTTCTCCCCCCATCTCCCCCGGTAGTGAAACTCTCCCCCACGTTGAAGATGTTAGAAGTTCCAGTATAGCAGCCTTAAGGCTAGCTGCGAGAGAACACGAATTAAGATTAGAACTGTTGCGGCAAAGAGCCGATTTAATTTGTTAG

Protein sequence:

>DPOGS207623-PA
MGVSDTGSSATPELPVHDIDRPGSGSGVDDEDIPRRKQRRYRTTFTSYQLDELEKAFGRTHYPDVFTREELALKIGLTEARIQVWFQNRRAKWRKQEKVGPHAHPYGGYLGGQPLPTAAMPVSPHSLTQLGFGLRKPFDSSLATFRYASSPLFGTQYLPPLTRPHLFGAPLYATSPAHFHSLFANLTAPEPPRASPEHSRLSPEVTRSPSLSPPISPGSETLPHVEDVRSSSIAALRLAAREHELRLELLRQRADLIC-