Monarch geneset OGS2.0

DPOGS210970
TranscriptDPOGS210970-TA1575 bp
ProteinDPOGS210970-PA524 aa
Genomic positionDPSCF300004 - 287523-290340
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0250320.080.81% 
BombyxBGIBMGA006400-TA0.075.24% 
Drosophilapb-PA6e-4869.23% 
EBI UniRef50UniRef50_D6W9442e-7543.22%Proboscipedia n=5 Tax=Tribolium castaneum RepID=D6W944_TRICA
NCBI RefSeqNP_001107807.13e-7643.22%maxillopedia [Tribolium castaneum]
NCBI nr blastpgi|154503274e-7543.12%homeodomain transcription factor Maxillopedia [Tribolium castaneum]
NCBI nr blastxgi|1672342101e-9344.32%maxillopedia [Tribolium castaneum]
Group
Gene OntologyGO:00063554e-29regulation of transcription, DNA-dependent
GO:00435654e-29sequence-specific DNA binding
GO:00037004e-29sequence-specific DNA binding transcription factor activity
GO:00036771.1e-24DNA binding
GO:00055157.6e-24protein binding
KEGG pathway 
InterPro domain[31-93] IPR0013564e-29Homeobox
[31-93] IPR0122871.1e-24Homeodomain-related
[5-92] IPR0090577.6e-24Homeodomain-like
[53-64] IPR0204792.1e-07Homeobox, eukaryotic
Orthology groupMCL22123 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210970-TA
ATGAGCAAAAACTTGCCGAGTTGCACTGGCGCGGGCGGCTACGAACAATCGCTAAGGAAATTACTTCGAATTACGTACGAAAATGGCTTGCCTCGTAGGCTGCGCACAGCTTACACAAATACTCAGCTGTTGGAATTGGAGAAGGAATTTCACTTCAACAAGTACCTGTGCAGGCCGCGAAGAATTGAGATTGCTGCGTCCCTTGACCTTACTGAGAGACAGGTTAAAGTATGGTTTCAAAATCGACGAATGAAACACAAACGACAAACGTTAAGCAAAAGCGAAGACGGCGATGATAAGGACTCCACTACTTCCGAGGGTGGCAAGAGTTCTAAAACAGGTTTGGAAAAATTCCTTGATGACGACGGTCCTCTATCGGGCAAGAAGAGTTGTCAAGGATGTGAACTGCCACCAGGTGCTCTATGCTCTCCTACTGAAGATCTTCCGGAGCTGACATCCCGAACAAGAAATAATAATACTCCAAGCGCAACCAATAACAATAGCTTTGGGAGTGACGGTGCTTCTAGTGTTGCTTCATCTTCTTCGCTTGATAAACTCGCAGAAGAAGACTCCCGAGAAGGTCCACCTCCGGTTAACGCTGTTAATGTACCTAGGAATCTAGCGAAAAGAATTAAGCAGGAATCAAGGAAGCGATCTCCATCATTAGATGCCACAGGATGTAAAGTGTCTCCATCGTCTTCTAAAGAGGGCCTTATAGGAGTTACGGGTTTACCAGATGGCAAATTCTCATCAGTAAACTTAACACCATCATCTACCCCCGGCACACCGTCTAGTATGCATCAAAGTCCACTCGGAGCATATCCCAGGCCCTCACCCCCACACGCACCTGGAGCTCCTTTGACACAAACCGTACCTAATGCGATACCACCATATGTAATTAGAGGCAATGCACCTCCAGGTCAATTTGTACCTCACCCCGACTTTCGCATGGAAACAAAACAATTCGTCGGTAAGCTCGCCCAGTACCCGCAAAATAACAGATCGTATGAAGCTTACTCAGCGCTACAAGGTAGCGACCATCATATATATCCAGGACGTAATCCGACTTCAAGAGCAACTAACGGCATTGGTTCTAGACAATCTTATCCTCACGAAATGTATCAGAACTATGCTTACACTGGATACGGAAAAGATCAGACCGGGTATGGTCATCCCGGTTACGATCAAGGTCAAAGTTACCCAGCCGAAATGGGCTACCCCAATAGCCATTACGGATACCACTACCACGAAAGTGGACAGCACGACCACGGCCACGGATACTTCAGTAGCGAAGGACAGAAGAACATGCACGGCCACGAATATTCAAAGAATTATTACGACACAAACTCTTACAATCAACAAGGGAGTACACAACCCAGTTATGGTCCCAACAACCCGCAAGGCGAAGGTTACACAGGAACAGCTGAGTGCGGGGAGGGTTACGGATCTTTTCAACAATTCTATGAAGCGACTCACGCTACACCCGCGACCGGAGAAAACTCTAATTCCTCGTCAGACTTCCACTTTCTAAGCAATCTGGCTAACGACTTTGCTCCTGAATATTACACCATTTGA

Protein sequence:

>DPOGS210970-PA
MSKNLPSCTGAGGYEQSLRKLLRITYENGLPRRLRTAYTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQVKVWFQNRRMKHKRQTLSKSEDGDDKDSTTSEGGKSSKTGLEKFLDDDGPLSGKKSCQGCELPPGALCSPTEDLPELTSRTRNNNTPSATNNNSFGSDGASSVASSSSLDKLAEEDSREGPPPVNAVNVPRNLAKRIKQESRKRSPSLDATGCKVSPSSSKEGLIGVTGLPDGKFSSVNLTPSSTPGTPSSMHQSPLGAYPRPSPPHAPGAPLTQTVPNAIPPYVIRGNAPPGQFVPHPDFRMETKQFVGKLAQYPQNNRSYEAYSALQGSDHHIYPGRNPTSRATNGIGSRQSYPHEMYQNYAYTGYGKDQTGYGHPGYDQGQSYPAEMGYPNSHYGYHYHESGQHDHGHGYFSSEGQKNMHGHEYSKNYYDTNSYNQQGSTQPSYGPNNPQGEGYTGTAECGEGYGSFQQFYEATHATPATGENSNSSSDFHFLSNLANDFAPEYYTI-