Monarch geneset OGS2.0

DPOGS216048
TranscriptDPOGS216048-TA1350 bp
ProteinDPOGS216048-PA449 aa
Genomic positionDPSCF300067 - 192037-198162
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0150480.092.00% 
BombyxBGIBMGA009025-TA0.088.50% 
Drosophilagsb-PA8e-10176.07% 
EBI UniRef50UniRef50_G6DKT30.0100.00%Gooseberry n=2 Tax=Endopterygota RepID=G6DKT3_DANPL
NCBI RefSeqXP_974212.24e-13359.74%PREDICTED: similar to AGAP010358-PA [Tribolium castaneum]
NCBI nr blastpgi|2700128213e-13460.48%gooseberry [Tribolium castaneum]
NCBI nr blastxgi|2700128211e-12960.31%gooseberry [Tribolium castaneum]
Group
Gene OntologyGO:00036772.1e-82DNA binding
GO:00063552.1e-82regulation of transcription, DNA-dependent
GO:00055152e-41protein binding
GO:00435654.3e-26sequence-specific DNA binding
GO:00037004.3e-26sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[1-121] IPR0015232.1e-82Paired box protein, N-terminal
[1-122] IPR0090572e-41Homeodomain-like
[1-65] IPR0119911.2e-36Winged helix-turn-helix transcription repressor DNA-binding
[159-230] IPR0122873.1e-27Homeodomain-related
[163-225] IPR0013564.3e-26Homeobox
Orthology groupMCL12776 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216048-TA
ATGAATCAGTTAGGCGGAGTCTTCATTAACGGACGTCCTCTGCCGAACCACATCCGTCTAAAAATCGTGGAGATGGCAGCGGCGGGTGTTCGGCCTTGTGTCATCTCAAGACAGTTGAGGGTCTCCCACGGCTGTGTCTCTAAAATACTCAATAGATACCAGGAAACTGGATCTATTCGCCCTGGTGTGATAGGGGGATCGAAGCCGAGAGTAGCCACGCCCGAAGTTGAAAACAGGATCGAAGAATTGAAGAGACAAAACCCAGGTATATTTTCCTGGGAAATTAGAGATAAATTAATAAAAGAAGGCATATGTGATAAAAACACCGCGCCATCAGTAAGTTCGATTTCGCGACTCATAAGAGGAGGCAAAAGGGACGAATCAGATCCTAGAAGGAACCACAGTATAGATGGTATTCTTGGACCATCCTCATCGTGTGAGGATAGTGATACGGAGTCTGAGCCTGGTATAACGTTAAAAAGAAAACAACGAAGATCTAGAACAACCTTTTCTGGAGATCAACTTGAAGCTCTCGAGCGCGCTTTCACGCGAACCCAATACCCAGACGTTTACACTCGTGAAGAATTAGCTCAAAAGACGAAGTTGACCGAAGCACGTGTTCAGGTATGGTTCTCAAACCGAAGAGCACGTCTTCGCAAACAACTGAACTCACAACAATTGAGTGCTTTTAATACAATGTCTTTACAATCTGCATTTCCCTCCGTTCATCAACAATACGAACCACCAACAACATTTAATGCACAGTGCGCGTCGTGGCAACAATCATATTCTGCTTTGGGTAGCAGTTCTGTTCTGAATTCTGCTTTGGCACCTTCTTTACATCAGTCATCATTATCAGCTCCTTCTGTTTGTCAATCTGCTCTTACAGCACCATCCCTCCATCCTCCCACATCGAGTTCATATTCATCAGGAAACTTGACGCCATTGTCACATTCATCTGAACTGCCTACACCTATACAAGCCTCTACTGATGCTACACCACCAAGTTCAAGCCCAATCACTTCCAGCCCTGCTGGAAATCAAAGCGGTGGCATCACTTACCAACATCCAACTTACACTAATACTAGTGATGTAGTTTCACACCCATACGGCTATGGTGACTACGCAAAACAAGAACATATGTCAGCACATAACCACTGGACTTCAAGACAACTGAGTGGACATTCACAAAATAAATTAGCAGAAGTCAGCGCTTGGCCAGAAAATTATAGTTCATTCTTTGGCGCGAATACTCACTATGCGTCGCACGCGCATTCACCTAGTGAAGCAAAGTCTGGTTATCCCTATATAGGACAGCTTGGCGGAATGGATATGGGGAGAGTTCATTGA

Protein sequence:

>DPOGS216048-PA
MNQLGGVFINGRPLPNHIRLKIVEMAAAGVRPCVISRQLRVSHGCVSKILNRYQETGSIRPGVIGGSKPRVATPEVENRIEELKRQNPGIFSWEIRDKLIKEGICDKNTAPSVSSISRLIRGGKRDESDPRRNHSIDGILGPSSSCEDSDTESEPGITLKRKQRRSRTTFSGDQLEALERAFTRTQYPDVYTREELAQKTKLTEARVQVWFSNRRARLRKQLNSQQLSAFNTMSLQSAFPSVHQQYEPPTTFNAQCASWQQSYSALGSSSVLNSALAPSLHQSSLSAPSVCQSALTAPSLHPPTSSSYSSGNLTPLSHSSELPTPIQASTDATPPSSSPITSSPAGNQSGGITYQHPTYTNTSDVVSHPYGYGDYAKQEHMSAHNHWTSRQLSGHSQNKLAEVSAWPENYSSFFGANTHYASHAHSPSEAKSGYPYIGQLGGMDMGRVH-