Monarch geneset OGS2.0

DPOGS214398
TranscriptDPOGS214398-TA978 bp
ProteinDPOGS214398-PA325 aa
Genomic positionDPSCF300069 - 241326-245357
RNAseq coverage165x (Rank: top 51%)
Annotation
HeliconiusHMEL0055883e-10674.74% 
BombyxBGIBMGA011254-TA2e-12168.25% 
Drosophilahth-PC3e-2852.29% 
EBI UniRef50UniRef50_D6WYT83e-8353.57%Putative uncharacterized protein n=2 Tax=Endopterygota RepID=D6WYT8_TRICA
NCBI RefSeqXP_970138.15e-8453.57%PREDICTED: similar to Homeobox protein PKNOX2 (PBX/knotted homeobox 2) (Homeobox protein PREP-2) [Tribolium castaneum]
NCBI nr blastpgi|910878879e-8353.57%PREDICTED: similar to Homeobox protein PKNOX2 (PBX/knotted homeobox 2) (Homeobox protein PREP-2) [Tribolium castaneum]
NCBI nr blastxgi|910878873e-9054.86%PREDICTED: similar to Homeobox protein PKNOX2 (PBX/knotted homeobox 2) (Homeobox protein PREP-2) [Tribolium castaneum]
Group
Gene OntologyGO:00036771.4e-22DNA binding
GO:00063551.4e-22regulation of transcription, DNA-dependent
GO:00056342.6e-19nucleus
GO:00055151e-16protein binding
GO:00435652.5e-09sequence-specific DNA binding
GO:00037002.5e-09sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[218-282] IPR0122871.4e-22Homeodomain-related
[233-272] IPR0084222.6e-19Homeobox KN domain
[214-291] IPR0090571e-16Homeodomain-like
[215-280] IPR0013562.5e-09Homeobox
Orthology groupMCL15964 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214398-TA
ATGCAGGGCGGCGCCCCCGTCACTGACGCCGATCAGGCACAGTTCGAGGCGGACAAGCGAGCTGTCTACAAACATCCTCTATTCCCGCTTCTGGCGCTGCTGTTAGAGCGATGCGAGCAGGCGACTGCTGGGGCGGAGCCCCCGGCCGCGGATGCCTTCGGGGCCGACCTGCAGGCCTTCGTACAGCACCAGCGAAGGGACCGCCGCCCCTTCCTGGTGGACGACCCCGAGATCGACGGCTTGATGATTAAGAGCATCCAGGTGCTCCGCATACACCTACTGGAGCTGGAGAAGGTGCAGGAGCTGTGCAGGGACTTCTGCGGAAGATACATAGCCTGTCTCAAGACGAAGATGCAAAGTGAAAATCTTCTGAGGACCGATTACTCTGCTGAATTACTAATAATTTCTGTTCTATATGTCCACCAGTCCCCGGCTCCCCCGCCGCCGCCATCTTTCCCCCCGCCTTACCCCGCGCTGACTGATCTAGCCTACCCCAGAGATAGCGCATCTCTCGTAGTCCAAGGTTCAACACCTATCGGTCAAATCGGAGCTGGTTTAATATCCACTGATTCATTTACAGGCACCAACTCTAATTCGTGTTCTTCGGTATCTGGTTCCCCGCCACCCGCTGATGATGATGATGAGGGAGTCAAGCGGGGAGTCCTGCCGCGACACGCGACCCAGGTCATGAGGGCCTGGCTGTTCCAACACCTGGTGCATCCTTATCCCACGGAGGAGGAAAAGCGCTCCCTGGCGGCGCAGACGAGACTCACCCTCCTCCAGGTCAACAACTGGTTCATTAACGCCAGGAGACGCATACTCCAGCCCATGCTGGACTGTCAAGAGAAACCTGGTGGTAAGAAGAGTAAAAACGGGTCATCCATCAGCAAGCGGTACTGGCCGGATGCTCTCACCAACCAGCAGTTCACAGCTGGTATGTGTTCATATGAAGCATATATTTTTATCATAACAGTATAA

Protein sequence:

>DPOGS214398-PA
MQGGAPVTDADQAQFEADKRAVYKHPLFPLLALLLERCEQATAGAEPPAADAFGADLQAFVQHQRRDRRPFLVDDPEIDGLMIKSIQVLRIHLLELEKVQELCRDFCGRYIACLKTKMQSENLLRTDYSAELLIISVLYVHQSPAPPPPPSFPPPYPALTDLAYPRDSASLVVQGSTPIGQIGAGLISTDSFTGTNSNSCSSVSGSPPPADDDDEGVKRGVLPRHATQVMRAWLFQHLVHPYPTEEEKRSLAAQTRLTLLQVNNWFINARRRILQPMLDCQEKPGGKKSKNGSSISKRYWPDALTNQQFTAGMCSYEAYIFIITV-