Monarch geneset OGS2.0

DPOGS211000
TranscriptDPOGS211000-TA1170 bp
ProteinDPOGS211000-PA389 aa
Genomic positionDPSCF300004 + 399096-406877
RNAseq coverage43x (Rank: top 72%)
Annotation
HeliconiusHMEL0250539e-17793.85% 
BombyxBGIBMGA006486-TA2e-8889.10% 
DrosophilaDfd-PA7e-6665.85% 
EBI UniRef50UniRef50_Q9XZR56e-16889.29%Dfd protein n=2 Tax=Arthropoda RepID=Q9XZR5_BOMMO
NCBI RefSeqNP_001037341.11e-16889.29%transcription factor deformed [Bombyx mori]
NCBI nr blastpgi|1129836142e-16789.29%transcription factor deformed [Bombyx mori]
NCBI nr blastxgi|1129836140.090.05%transcription factor deformed [Bombyx mori]
Group
Gene OntologyGO:00063552.3e-26regulation of transcription, DNA-dependent
GO:00435652.3e-26sequence-specific DNA binding
GO:00037002.3e-26sequence-specific DNA binding transcription factor activity
GO:00036773.6e-26DNA binding
GO:00055151.4e-23protein binding
GO:00056342.4e-05nucleus
KEGG pathway 
InterPro domain[181-243] IPR0013562.3e-26Homeobox
[175-241] IPR0122873.6e-26Homeodomain-related
[165-238] IPR0090571.4e-23Homeodomain-like
[203-214] IPR0204792.7e-08Homeobox, eukaryotic
Orthology groupMCL17806 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211000-TA
ATGAACGGTGGATACCAGCCGCAGCCCGATCCGAAATTCCCACCCTCGGAAGAGTACAGCCAGGCTGATTACATACCACCAGGAGAATACTACCAGCACCAACACCTACAGTATGGCTACCAGCACCAGTATCCAGTACAGATTGGTACAGGATATGGATATGGTGGTTATTACCATCCTATACCACAGCACCCACCTGTAGCGCCACCACCACGGCCGCCGGAACCGTCTCATCCGTCTGAAGCGGACCACGGCTTTGTACCGCACAACGATACAGCCCCAGGATTAGCTGAACTGGGGTTAAGGCTGGAGAGGCATATAGAAGAAGCTGCACCAGCTGGGAGAAAACTCCAGGCGTTAGGCCGCGAAGGCTCACCCGCCTCGGAGTTGGATGACGAAAGACTACTCCTAGACAGCCCTCCGGTAGAAGACGATGATTCCTCTATATGTTCTGATAATACAGATAGAGTTATTTACCCCTGGATGAAGAAGATTCATGTCGCTGGAGCCTCGAATGGAAACTATCAACCTGGAATGGAGCCTAAGAGACAGCGGACGGCGTACACGAGGCACCAAATATTAGAATTGGAGAAGGAGTTCCATTACAATCGCTACCTTACTAGAAGAAGGAGAATAGAAATAGCGCACACTCTAGTCCTATCGGAGAGACAAATAAAAATATGGTTCCAGAACAGAAGAATGAAATGGAAGAAAGACAATAAACTACCGAATACCAAAAATGTTAGGAGAAAAACTAATCCGGCCGGAGTGACTACGACCACAACAAAGGGAGCCACGCCAAAGAGTAGATCTAATAAAAATACAAATAATAATGATAAGAAAAAATCAAATAACAATGGACGTCCGGACAGTATAGAGAATGTGACAGATAATATTATGAACGAGCTGCACTGTGTCGGTGTTCAGCAGAACATGTCGCATGATCCGGCGATAAGTCCGATGACGCATCCATCTAACCAGATGAATCCAGGACATCCTATGACGGCACACCACTTGCACATGATGGGAATGCACGCTGGGCTGGACCCGTCACTAGTGGCCAATCTATCAGCCCACGGGTCACCAGTTGGCAGTACCGGCTGCGGGACTGGCATCAGTACTGGCAACGCACCCGTCATCAAATCGGACTATGGACTCACCGCCTTATAA

Protein sequence:

>DPOGS211000-PA
MNGGYQPQPDPKFPPSEEYSQADYIPPGEYYQHQHLQYGYQHQYPVQIGTGYGYGGYYHPIPQHPPVAPPPRPPEPSHPSEADHGFVPHNDTAPGLAELGLRLERHIEEAAPAGRKLQALGREGSPASELDDERLLLDSPPVEDDDSSICSDNTDRVIYPWMKKIHVAGASNGNYQPGMEPKRQRTAYTRHQILELEKEFHYNRYLTRRRRIEIAHTLVLSERQIKIWFQNRRMKWKKDNKLPNTKNVRRKTNPAGVTTTTTKGATPKSRSNKNTNNNDKKKSNNNGRPDSIENVTDNIMNELHCVGVQQNMSHDPAISPMTHPSNQMNPGHPMTAHHLHMMGMHAGLDPSLVANLSAHGSPVGSTGCGTGISTGNAPVIKSDYGLTAL-