Monarch geneset OGS2.0

DPOGS202358
TranscriptDPOGS202358-TA1104 bp
ProteinDPOGS202358-PA367 aa
Genomic positionDPSCF300104 - 208060-224217
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0029013e-7596.38% 
BombyxBGIBMGA013358-TA5e-4948.59% 
DrosophilaPoxn-PA7e-6085.19% 
EBI UniRef50UniRef50_D6WT292e-7152.45%Pox neuro n=2 Tax=Endopterygota RepID=D6WT29_TRICA
NCBI RefSeqXP_973036.13e-7252.45%PREDICTED: similar to Pox neuro CG8246-PA [Tribolium castaneum]
NCBI nr blastpgi|910876315e-7152.45%PREDICTED: similar to Pox neuro CG8246-PA [Tribolium castaneum]
NCBI nr blastxgi|910876312e-7351.38%PREDICTED: similar to Pox neuro CG8246-PA [Tribolium castaneum]
Group
Gene OntologyGO:00036779.2e-86DNA binding
GO:00063559.2e-86regulation of transcription, DNA-dependent
GO:00055152.7e-37protein binding
KEGG pathway 
InterPro domain[17-142] IPR0015239.2e-86Paired box protein, N-terminal
[18-145] IPR0090572.7e-37Homeodomain-like
[20-85] IPR0119915e-35Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL19647 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202358-TA
ATGTTTACTTATTCTAGATTCGTAGTTACAAATGCTGGTACAATTCCAGGTCAAGCAGGCGTGAACCAGCTCGGGGGTGTGTTCGTGAACGGTCGCCCTCTCCCAGACGTGGTGAGGAAGAGGATCGTGGAGCTAGCCATCCTCGGAGTACGGCCGTGTGACATCAGCAGACAACTACTCGTATCACACGGCTGCGTCTCAAAAATATTGACCAGATTCTACGAAACTGGATCCATAAGACCCGGTTCCATCGGCGGGAGTAAGACCAAGCAAGTGGCTACTCCTACGGTGGTGAAGAAGATACTACGTCTGAAACAAGAGAACCCAGGCATGTTTGCGTGGGAAATACGTGAGCGGTTGCTGAGCGCGAGGGTCTGCGAGCCTCACTCCATACCCTCAGTGTCATCTGTTAACAGGATATTGCGGAACAGCGGCCTGGTATGGAACGAGGAAGACGGAAGACACGAACCATTCCCGCCATCGGAGTTACAAAACAACATGGCGGACTATATGTCAATGAAGACGCCGTTACCTCCCATACAGCAGACGTCTCCGTATTTCGCGCACAACTCCGTTAGAGTTCAACCTCCGACAGAAACGACTTATGATCGCAGACTCGCCACATCATGGCTATTGGCCAATCAGGTTCAAGCTCAAACACTCCTGAAACCGTATCCGATATCCCCCTGGCAGAGAGTCATGATGCCGTATCCGGATTCGAAGAACTTCACACCGTACGCTCTGAATCTACACAGCGAGCTTCTGAACAGAATCAACCCGGAAGAAGTGAAATCGGAGACATCCGAACACATCTCCGTCGAGACGAGCGACGATAGCACAGATAAACCGGAAGATCAAGAAGCTAAGAAGGAAAAAAAGAAGAATCCGTACTCCATAGAAGAGTTACTGAAGAAGCCGGACAAAATAGTAACATCAAATCCCGTCGCGTTCCAGAACTTTCTGCGTCAACCGAGCGGCAGCATGATAGAGTACAGCCAAGAGAAGACCAGCGACAGAAGCTCTCCAGCGAGCTACTGTTCCATCCACAGCGGCACGTCCAACGACTTCGATTCAACCAGTTCAGAACTAAAAACGGGTAATTGA

Protein sequence:

>DPOGS202358-PA
MFTYSRFVVTNAGTIPGQAGVNQLGGVFVNGRPLPDVVRKRIVELAILGVRPCDISRQLLVSHGCVSKILTRFYETGSIRPGSIGGSKTKQVATPTVVKKILRLKQENPGMFAWEIRERLLSARVCEPHSIPSVSSVNRILRNSGLVWNEEDGRHEPFPPSELQNNMADYMSMKTPLPPIQQTSPYFAHNSVRVQPPTETTYDRRLATSWLLANQVQAQTLLKPYPISPWQRVMMPYPDSKNFTPYALNLHSELLNRINPEEVKSETSEHISVETSDDSTDKPEDQEAKKEKKKNPYSIEELLKKPDKIVTSNPVAFQNFLRQPSGSMIEYSQEKTSDRSSPASYCSIHSGTSNDFDSTSSELKTGN-