Monarch geneset OGS2.0

DPOGS207539
TranscriptDPOGS207539-TA912 bp
ProteinDPOGS207539-PA303 aa
Genomic positionDPSCF300456 + 17220-23995
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0119852e-2465.00% 
BombyxBGIBMGA000211-TA9e-2794.83% 
Drosophilahbn-PA5e-3558.39% 
EBI UniRef50UniRef50_D6WQD45e-3850.76%Homeobrain n=1 Tax=Tribolium castaneum RepID=D6WQD4_TRICA
NCBI RefSeqXP_315326.41e-3875.25%AGAP005311-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2700109932e-3750.76%homeobrain [Tribolium castaneum]
NCBI nr blastxgi|1582939677e-3961.19%AGAP005311-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00063551.9e-27regulation of transcription, DNA-dependent
GO:00435651.9e-27sequence-specific DNA binding
GO:00037001.9e-27sequence-specific DNA binding transcription factor activity
GO:00036772.7e-27DNA binding
GO:00055154.3e-26protein binding
KEGG pathway 
InterPro domain[125-187] IPR0013561.9e-27Homeobox
[122-186] IPR0122872.7e-27Homeodomain-related
[116-194] IPR0090574.3e-26Homeodomain-like
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207539-TA
ATGTTCAAACTAAATGTTGTTGTAATCTGCGAGGTACCGTCGTCGGTAGGACTCGACCTCCGATATCCCCCGGTTCATAAATCGGCGACGACTCGCTCGCACGAAATTCGTCTCTATTTGCATCGCAGTAGGGTTCAAGTTAAGGGACGTCGGTACGTAATTAGCGAGTTTAAAGAGCAGAAGGGTGAAGGCGTCGAGTTGAAAAGCGAAGGCGTGCACTCTGACTCGGATCACGAACACGAACAGGAACACGAACATGAACACGAACACGAGCATGAGCACGAACACGAGCACGAACACGAGCCGAACGTCGAGCCGGGGCGAGCAGACCACGAGCCGCTAGAAGCACTTGACGCCGGGAGACCACGCAAGGTGCGCCGCAGTCGGACCACCTTCACCACCTACCAGCTTCACGAGCTGGAGAGAGCCTTCGACAAGACGCAGTACCCGGACGTGTTCACTAGAGAGGAGCTCGCACTACGTCTGGACCTCAGCGAGGCGAGGGTTCAGGTGTGGTTCCAGAACAGACGCGCCAAGTGGCGCAAAAGAGAGAAGGCACTGGGGAGAGAACACGCGCCTTTCCTGCATCACGAACATGGTGTGGGGGAGTGGGGCGGCGGTATGGGCGTGGGTGGTGTGGGCGTGGGTGTCGGTGTAGGCGGCGGGGAATGGTGGCTCGGTCTGGGCGCTCCACTGTGGCACGACGCGCCTCCCGCCGCAGCCTTCAGAGCTTTACTTCACAGGTACGTGCTGGCGCTGCCTCCGCCGGCCGCGATCTCCCCCCCTGCTCGGTCTCCCCCTCCATCCCCAGCTCGTAGCTCTCCTCCCCGGCCGGTAGCCCCAATGGCGCCCGCGCCGCTCTCGGAGCCACTACGCCTACTACACGACCGCTACAGCCGCGTACACACGTAG

Protein sequence:

>DPOGS207539-PA
MFKLNVVVICEVPSSVGLDLRYPPVHKSATTRSHEIRLYLHRSRVQVKGRRYVISEFKEQKGEGVELKSEGVHSDSDHEHEQEHEHEHEHEHEHEHEHEHEPNVEPGRADHEPLEALDAGRPRKVRRSRTTFTTYQLHELERAFDKTQYPDVFTREELALRLDLSEARVQVWFQNRRAKWRKREKALGREHAPFLHHEHGVGEWGGGMGVGGVGVGVGVGGGEWWLGLGAPLWHDAPPAAAFRALLHRYVLALPPPAAISPPARSPPPSPARSSPPRPVAPMAPAPLSEPLRLLHDRYSRVHT-