Monarch geneset OGS2.0

DPOGS208364
TranscriptDPOGS208364-TA984 bp
ProteinDPOGS208364-PA327 aa
Genomic positionDPSCF300146 - 384245-385466
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0087942e-16787.57% 
BombyxBGIBMGA012234-TA0.096.04% 
DrosophilaCG32105-PB2e-11560.67% 
EBI UniRef50UniRef50_Q9VTW53e-11360.67%CG32105 n=16 Tax=Arthropoda RepID=Q9VTW5_DROME
NCBI RefSeqXP_967240.11e-13774.11%PREDICTED: similar to CG32105 CG32105-PB [Tribolium castaneum]
NCBI nr blastpgi|910779542e-13674.11%PREDICTED: similar to CG32105 CG32105-PB [Tribolium castaneum]
NCBI nr blastxgi|2700028591e-14375.30%LIM homeobox transcription factor 1, beta [Tribolium castaneum]
Group
Gene OntologyGO:00063553.9e-22regulation of transcription, DNA-dependent
GO:00435653.9e-22sequence-specific DNA binding
GO:00037003.9e-22sequence-specific DNA binding transcription factor activity
GO:00055157.4e-20protein binding
GO:00036771.8e-18DNA binding
GO:00082701.7e-13zinc ion binding
KEGG pathway 
InterPro domain[130-192] IPR0013563.9e-22Homeobox
[114-189] IPR0090577.4e-20Homeodomain-like
[118-191] IPR0122871.8e-18Homeodomain-related
[47-104] IPR0017811.7e-13Zinc finger, LIM-type
Orthology groupMCL11701 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208364-TA
ATGCGAGTAGGCGATTTATCTTGGCACGAACACTGCCTGAGTTGCTGTGTCTGCGGCTGTCCCCTTGCCCATACATGTTACACTAGAAATGCGAAACTATATTGCAAACCAGATTATGACAGATTATTCGGCGTCAAGTGTACTCGCTGCGGAGACAGATTGCTACCCCAGGAGATGGTCATGAGAGCGCAGCAGTACGTCTTTCACATTCAATGCTTTGTTTGTGTCATGTGCTGCCAACCTCTGCAAAAAGGCGAACAATATGTCATCAGGGCTGGACAGATATTTTGCAGACAAGACTTTGAGAAAGAAATGTATCTGATGCAACATGCCGAAGATGATATGATAATAGATGATTCGGAGCGGCCTCGTGACGGAAGGAGGGGACCAAAACGCCCCAGAACTATACTAACATCAGCTCAGCGACGGCAATTCAAAGCTTCTTTTGAAGTAAGCCCGAAACCGTGCCGCAAAGTACGAGAAGCTTTAGCTAAAGACACGGGATTGAGTGTGAGAGTGGTCCAGGTGTGGTTCCAGAATCAGAGAGCTAAAATGAAAAAGATACAACGCAAGGCGAAACAAGAAGGTGATAAAAATAATGACAAGGACAAAGACAAAGATGAAAAGAGTATAAAACAAGAGTCGCCCTCCAGTGAACACGGGAATTATCTTGGCTTGGACAACTCTTATTCAGCTTCTAGTCAACCACTCAACCCGAATTTGCCATATTCTCCTGATGACTATCCAGCTCACTCCGGAGACAGCTTCTGCAGTTCGGATATTTCTCTTGATGGGAGTAATTTCGATCAACTTGATGAAGGTACTTCTGATACCATGAGCCTACAGAACTTGGAGGTACCTCACCTTCCTCATCATGGGAACCATTCGTCACACGAGCCCCTGAATCTTGGCACAGGGGCTGTTGTCAACCCCATCGATAAGTTATATCTAATGCAAAATTCATATTTTAGTACAGATCATTGA

Protein sequence:

>DPOGS208364-PA
MRVGDLSWHEHCLSCCVCGCPLAHTCYTRNAKLYCKPDYDRLFGVKCTRCGDRLLPQEMVMRAQQYVFHIQCFVCVMCCQPLQKGEQYVIRAGQIFCRQDFEKEMYLMQHAEDDMIIDDSERPRDGRRGPKRPRTILTSAQRRQFKASFEVSPKPCRKVREALAKDTGLSVRVVQVWFQNQRAKMKKIQRKAKQEGDKNNDKDKDKDEKSIKQESPSSEHGNYLGLDNSYSASSQPLNPNLPYSPDDYPAHSGDSFCSSDISLDGSNFDQLDEGTSDTMSLQNLEVPHLPHHGNHSSHEPLNLGTGAVVNPIDKLYLMQNSYFSTDH-