Monarch geneset OGS2.0

DPOGS207209
TranscriptDPOGS207209-TA1032 bp
ProteinDPOGS207209-PA343 aa
Genomic positionDPSCF300001 + 5905476-5917164
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0061782e-6977.30% 
BombyxBGIBMGA010693-TA1e-4964.97% 
DrosophilaDr-PA5e-4045.25% 
EBI UniRef50UniRef50_D6WZY94e-6151.11%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZY9_TRICA
NCBI RefSeqXP_975059.17e-6251.11%PREDICTED: similar to Drop CG1897-PA [Tribolium castaneum]
NCBI nr blastpgi|910910141e-6051.11%PREDICTED: similar to Drop CG1897-PA [Tribolium castaneum]
NCBI nr blastxgi|910910143e-6248.75%PREDICTED: similar to Drop CG1897-PA [Tribolium castaneum]
Group
Gene OntologyGO:00063555.6e-26regulation of transcription, DNA-dependent
GO:00435655.6e-26sequence-specific DNA binding
GO:00037005.6e-26sequence-specific DNA binding transcription factor activity
GO:00036771e-22DNA binding
GO:00055155.5e-22protein binding
GO:00056349.8e-07nucleus
KEGG pathway 
InterPro domain[223-285] IPR0013565.6e-26Homeobox
[203-283] IPR0122871e-22Homeodomain-related
[214-292] IPR0090575.5e-22Homeodomain-like
[252-261] IPR0000479.8e-07Helix-turn-helix motif, lambda-like repressor
[245-256] IPR0204794.9e-06Homeobox, eukaryotic
Orthology groupMCL13978 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207209-TA
ATGGCGATACAGAGCGTGTCCAAGTATCAGAGCAGTGCCGTCGAAGCCACAGAGCCTCCGAAGGCTTCGAGGATAAGTTTCAGCGTAGCATCGATATTGGCTGACACGAAACAGAATGAAACCGCCGAGATGTTGAGACATCATCTGACGATATCGGAATCTCCGCTACGACAGTCCCCTGCTTCTCAACCGGAACTGCCTGGGGGGAAATCTCTGTCGCGACCGGCTTCCACGACGCCGCCGCTCAATCTGAACGTAACCGCGTCCTCGGACGAGGAGTACGAAGATTCCGTGAAAGAGGACTCGATAGTGGATGTCGAAGATTTACAGAACAGTCTTCAGAGTGACGATGATGAGAGAGTGGATAAGGAGCGGATGGGTCCAATACGCCCGACGCCGTTCAGCGCGTTGGCGGCGGCGGCGGCGGCCTACCACAGCCTCACCTGGCCGGCACCACCCTCAGTGGTGCCAACCTTCGGACCGATGTTCCAATCACACTTCCCTGTCGGACACATTACCGATAACATCACAAGTGTAGATTATACGGTAGTGGGATCCCATCACGCCATCCGCACTTCCTGCAAGAGCAAATTAGAGGCCGCAAACAATCATGATGCAAACGGAGAGCCGCCCAAGTTGAAGTGCAATTTGCGTAAGCACAAGCCGAATCGCAAGCCGCGGACACCGTTCACAGCACAGCAATTACGTGCTCTGGAGAGCAAGTTTGTGGATAAGCAATATCTGAGCATTGCTGAGCGAGCGGAATTCTCCTCGTCACTCGGCCTGTCAGAGACTCAGGTAAAAATCTGGTTCCAGAATCGTAGGGCGAAAGCAAAACGAGTTCAGGAAGCTGAGATAGAAAAGTTGAAAATGGCACAGTTTGCTCGTCATCCGCATCACATGTACACGCATCCCTTACAGCAGTACTTCCCCCATCACCTCATGGGCAGACCCTTGCCGCCAATGATGCCGCACTTGACCATGCAATCACCAGGATCCCCATCACCAAACACACAGACAAACGGACAGTGA

Protein sequence:

>DPOGS207209-PA
MAIQSVSKYQSSAVEATEPPKASRISFSVASILADTKQNETAEMLRHHLTISESPLRQSPASQPELPGGKSLSRPASTTPPLNLNVTASSDEEYEDSVKEDSIVDVEDLQNSLQSDDDERVDKERMGPIRPTPFSALAAAAAAYHSLTWPAPPSVVPTFGPMFQSHFPVGHITDNITSVDYTVVGSHHAIRTSCKSKLEAANNHDANGEPPKLKCNLRKHKPNRKPRTPFTAQQLRALESKFVDKQYLSIAERAEFSSSLGLSETQVKIWFQNRRAKAKRVQEAEIEKLKMAQFARHPHHMYTHPLQQYFPHHLMGRPLPPMMPHLTMQSPGSPSPNTQTNGQ-