Monarch geneset OGS2.0

DPOGS207048
TranscriptDPOGS207048-TA876 bp
ProteinDPOGS207048-PA291 aa
Genomic positionDPSCF300001 + 1944867-1957120
RNAseq coverage75x (Rank: top 65%)
Annotation
HeliconiusHMEL0068712e-6179.47% 
BombyxBGIBMGA012988-TA2e-5895.65% 
DrosophilaLim1-PA5e-7379.44% 
EBI UniRef50UniRef50_UPI0002060FFC4e-7268.02%UPI0002060FFC related cluster n=1 Tax=unknown RepID=UPI0002060FFC
NCBI RefSeqXP_002426470.14e-7595.17%LIM/homeobox protein Lhx3, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700092192e-8259.62%lim1 [Tribolium castaneum]
NCBI nr blastxgi|2700092191e-8660.13%lim1 [Tribolium castaneum]
Group
Gene OntologyGO:00036778.1e-22DNA binding
GO:00063558.1e-22regulation of transcription, DNA-dependent
GO:00435654.3e-21sequence-specific DNA binding
GO:00037004.3e-21sequence-specific DNA binding transcription factor activity
GO:00055151.4e-20protein binding
KEGG pathway 
InterPro domain[92-172] IPR0122878.1e-22Homeodomain-related
[113-175] IPR0013564.3e-21Homeobox
[97-172] IPR0090571.4e-20Homeodomain-like
Orthology groupMCL10839 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207048-TA
ATGAAAGGAAAATCCGGAGACAACGGCACATCTCTCCCGAAACTTCCGAGCGGTTTTGTCGATCGTTTATCGAACTTATCGAAAATTTCCTCACCAGCGGATTCCCTCCTCGGCTCAGGCTCTGACGAAGAGGAGGAAGAAGAATCCCGTGCGGGGAACAACTCTAGCAGCCCCGCTCACGCTCCCCACCCAGCCTTGCACCCAGACCTCTCGCATAACGGGGATGCCAAACCTCATGAGGACTCAGAGGACCAGGGTTCGTTAGACGGCGATCCGGAGACCAGGGATTCGCAAGCTGAGAACAAGTCACCAGACGACGGAAATGGCGGCTCCAAGAGGAGAGGTCCAAGGACGACAATAAAAGCCAAGCAGCTGGAGATACTGAAATCTGCCTTCTCACAGACACCGAAGCCAACGAGGCATATCCGGGAACAGCTGGCCAAGGAAACTGGATTACCGATGAGGGTCATACAGGTTTGGTTTCAAAACAAGAGATCAAAAGAGAGACGTTTGAAACAGTTGACATCTATGGGAAGGGGTCCATTCTTTGGTTCCTCACGCAAGATGCGTGGCTTCCCAATGAATCTGTCCCCCGGGGGACTGGAGGAGGGGCCTCCGGGGTTCCCATACTTCGCGACAGCAGATGGGAAGTTCGAGTTTGGATATGGCCCGCCGTTCCACCATGATGCTCCATTCTTCCATCCTCCACCGAACATGCCCTTCAACCAGCCAGGTGGTATGGAGTCGCTGCAGGGCGGCGAGTTCCCTGAACAGTTCCCCCCGCCCGATCACCTAGTGTTGCCGCGACCCTCGTCCCCGGAGTTCACTTTCGGGGACGCCCCGCCTCCCCTACACCCCGAGGGGCTGGTCTGGTAG

Protein sequence:

>DPOGS207048-PA
MKGKSGDNGTSLPKLPSGFVDRLSNLSKISSPADSLLGSGSDEEEEEESRAGNNSSSPAHAPHPALHPDLSHNGDAKPHEDSEDQGSLDGDPETRDSQAENKSPDDGNGGSKRRGPRTTIKAKQLEILKSAFSQTPKPTRHIREQLAKETGLPMRVIQVWFQNKRSKERRLKQLTSMGRGPFFGSSRKMRGFPMNLSPGGLEEGPPGFPYFATADGKFEFGYGPPFHHDAPFFHPPPNMPFNQPGGMESLQGGEFPEQFPPPDHLVLPRPSSPEFTFGDAPPPLHPEGLVW-