Monarch geneset OGS2.0

DPOGS207210
TranscriptDPOGS207210-TA1158 bp
ProteinDPOGS207210-PA385 aa
Genomic positionDPSCF300001 + 5994616-5997116
RNAseq coverage154x (Rank: top 53%)
Annotation
HeliconiusHMEL0061814e-13467.07% 
BombyxBGIBMGA010706-TA9e-9368.66% 
Drosophilabap-PA8e-3275.90% 
EBI UniRef50UniRef50_D6WZY43e-3657.24%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZY4_TRICA
NCBI RefSeqXP_975015.15e-3757.24%PREDICTED: similar to Homeobox protein bagpipe (NK-3) [Tribolium castaneum]
NCBI nr blastpgi|910910109e-3657.24%PREDICTED: similar to Homeobox protein bagpipe (NK-3) [Tribolium castaneum]
NCBI nr blastxgi|910910103e-3457.24%PREDICTED: similar to Homeobox protein bagpipe (NK-3) [Tribolium castaneum]
Group
Gene OntologyGO:00036774.7e-24DNA binding
GO:00063554.7e-24regulation of transcription, DNA-dependent
GO:00435655.7e-23sequence-specific DNA binding
GO:00037005.7e-23sequence-specific DNA binding transcription factor activity
GO:00055152.9e-22protein binding
KEGG pathway 
InterPro domain[138-222] IPR0122874.7e-24Homeodomain-related
[162-224] IPR0013565.7e-23Homeobox
[136-223] IPR0090572.9e-22Homeodomain-like
[184-195] IPR0204791.6e-06Homeobox, eukaryotic
Orthology groupMCL19587 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207210-TA
ATGGAGTCCAAATTAAATTACAAAGACACTCGCATACAGGACAGTGATAAAACCAATATAAATCTGACGACACCGTTCTCAATAAACGACATTCTGACTAGAGAGAATGAATCCAAGTGCAATTTTGAAAATGGAATGTTTTGCTCGGACGCGTTCGGCGGGAAAATGAAATGTCACAAACCGGCCAGTTACAATAAAGACGATTATAAGAAAGAGGCCACTGAAAAGAGTATGAATTACTACGACGATAATTATAAGGATTACACTGATGATGGAGCCATTGATATGTCTAGGAAGAATAATTTCCCAGTCACAGAATTATCCGATGACTACGACTCCCGCTCGTCCAACACTGGCTCTCCGCGACGTCACTCCTCCCCCTCCTGCGACAGCTACCGGCCCTTCATACAGTACGACAGGAAGTGCAGCACCCCCACAGACTCTAGACACGGGTACATGGAGTACACACAGAACAGCGGGCGCAAGAAACGCTCCAGGGCCGCCTTCTCCCACGCTCAGGTGTACGAGCTGGAGAGGAGATTCAGTCAACAGAGGTATCTATCGGGACCAGAGCGCGCTGACCTCGCCGTCAGTCTGAAACTGACGGAGACGCAGGTCAAGATCTGGTTCCAGAACCGACGGTACAAGACTAAGAGGAAACAGATGCAGTTACAAGAGAGCGGCTTACTAGCGAACCACGCGAGGAAGGTGGCTGTGAAGGTGCTGGTTAACGGACAGCTCCCGGACATTAAGGCGTACCAAGCACCGTGCAAGAACCCAGCACTGTTCCCGCCACCGATGCTTGATAACTCGATAATAAAGCACTTCGCCGGCGTCTACGATTTTGCTAAAAATCCCGAATACAAGGCACTCCTGCAAGAGTCCTACAACCAGGCTGTGTTACAATCGCTGTACGCTCCGCACATGGGGGTAAATTACCCGAATTTACCCTTATCGTATATGTATTACCCCGGACATCCAGCGTTCGGGATGCCGTGCGACATGAATGGGGAGACTAGCGACAAGTACGCGGGTAACAGGGGTAACGGGGGTAACAAAGGTAACGAGGGTAACAAGGGTAACGATAAATGCAAACCAATGGACGTCAACGAGAATAGCAACGATAGTGTTGCGAGCAACATAGAAGTCGAAAATTAA

Protein sequence:

>DPOGS207210-PA
MESKLNYKDTRIQDSDKTNINLTTPFSINDILTRENESKCNFENGMFCSDAFGGKMKCHKPASYNKDDYKKEATEKSMNYYDDNYKDYTDDGAIDMSRKNNFPVTELSDDYDSRSSNTGSPRRHSSPSCDSYRPFIQYDRKCSTPTDSRHGYMEYTQNSGRKKRSRAAFSHAQVYELERRFSQQRYLSGPERADLAVSLKLTETQVKIWFQNRRYKTKRKQMQLQESGLLANHARKVAVKVLVNGQLPDIKAYQAPCKNPALFPPPMLDNSIIKHFAGVYDFAKNPEYKALLQESYNQAVLQSLYAPHMGVNYPNLPLSYMYYPGHPAFGMPCDMNGETSDKYAGNRGNGGNKGNEGNKGNDKCKPMDVNENSNDSVASNIEVEN-