Monarch geneset OGS2.0

DPOGS200482
TranscriptDPOGS200482-TA1134 bp
ProteinDPOGS200482-PA377 aa
Genomic positionDPSCF300158 - 188273-214457
RNAseq coverage29x (Rank: top 76%)
Annotation
HeliconiusHMEL0128040.092.84% 
BombyxBGIBMGA010418-TA9e-6893.57% 
DrosophilaPtx1-PD1e-9960.26% 
EBI UniRef50UniRef50_D6WA805e-10057.21%Ptx1 n=4 Tax=Coelomata RepID=D6WA80_TRICA
NCBI RefSeqNP_001091838.10.092.88%pituitary homeobox1 [Bombyx mori]
NCBI nr blastpgi|1482986611e-17992.88%pituitary homeobox1 [Bombyx mori]
NCBI nr blastxgi|1482986610.092.88%pituitary homeobox1 [Bombyx mori]
Group
Gene OntologyGO:00036775.7e-29DNA binding
GO:00063555.7e-29regulation of transcription, DNA-dependent
GO:00435657.3e-26sequence-specific DNA binding
GO:00037007.3e-26sequence-specific DNA binding transcription factor activity
GO:00055155.7e-24protein binding
GO:00056341.9e-08nucleus
GO:00072751.9e-08multicellular organismal development
KEGG pathway 
InterPro domain[32-375] IPR0162336.8e-123Homeobox Pitx/unc30
[120-202] IPR0122875.7e-29Homeodomain-related
[141-203] IPR0013567.3e-26Homeobox
[132-210] IPR0090575.7e-24Homeodomain-like
[330-346] IPR0036541.9e-08Paired-like homeodomain protein, OAR
Orthology groupMCL10826 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200482-TA
ATGGAGCCGCTCAGCGAGAACCTTTGCCTCAGTGAGCTGGACCTCCACGGAGGACACCTCCACGAGTCCGTGGCGGCCTCCGTGGGCTCGGCCATCTCCAGTCTAATGGCTCCTCTACATCACGAGCCCCAACACGCACCCTTGCACCACGCGCCGCCTCTTCATCACGAACCCTTGGAGAAACTTAAATTGTGGGCTGAGACTGGTGAGTTCCGGGACAGTGGCCACTCGTCCCTGGCGTCAGGGTCCAGCGTGGAACAGTCGCTGTACGCTCCGCCGAGACCACGGAGAGATAGGAAGATCACCGATGACAATCGTCAAATGTCATCCGAGCCGACAATAAAGACGGAATCGACGACGCCGGGGGTCGGGCCTGATGAATCCTCGTTGGATGATAAGGGGGATAAAAAGAACAAGCGGCAAAGAAGACAGCGGACCCACTTCACCAGCCAACAGTTGCAAGAGTTGGAGGCGACCTTCGCCAGAAACCGCTACCCGGATATGTCCACCCGAGAAGAGATAGCCATGTGGACCAACCTCACTGAGGCGAGGGTCAGGGTATGGTTCAAAAACAGGAGAGCCAAGTGGAGGAAGCGCGAGAGGAATGCGATGAACGCAGCGGCAGCGGCCGCGGCCGACTTCAAGAACGGTTTCAGCACTCAGTTCAACGGTTTGATGCAGCCCTTTCCCGACACCGAGGCCTTGTACTCCTCGTATCCTTACAACAACTGGGCGGCTAAGGTCCCGAGCCCTTTGGGGACGAAGAGTTTCCCGTGGCCGGTGAATCCGCTCGGATCCGTGGTGCCCGCGAGCCATCACCAGGGTTCCGTGAACTGTTTCAACACGGCGACCTCGATGGGCGCGGTGGGCGGCGGCGGGATGGCCGGCGTCGGGAGTTCCGCAGGCGTGGGAGGCGCCGTCGCCCCCTGTCCCTACACGGCTCCCCCCAACCCGTACAGTATGTACCGCGCCGAGCCTTGCCCGGCGATGTCGTCGTCCATCGCCTCGCTGCGGCTCAAGGCCAAGCAGCACTCCTCGGGCTTCAGTAGCGGCTACGGCGGCGTGTCGCCCGTGTCCAGGGCCGGCTCGGCGCCTCTCTCGGCCTGTCAGTACGGCGGCGGCGAGCGGCTCTGA

Protein sequence:

>DPOGS200482-PA
MEPLSENLCLSELDLHGGHLHESVAASVGSAISSLMAPLHHEPQHAPLHHAPPLHHEPLEKLKLWAETGEFRDSGHSSLASGSSVEQSLYAPPRPRRDRKITDDNRQMSSEPTIKTESTTPGVGPDESSLDDKGDKKNKRQRRQRTHFTSQQLQELEATFARNRYPDMSTREEIAMWTNLTEARVRVWFKNRRAKWRKRERNAMNAAAAAAADFKNGFSTQFNGLMQPFPDTEALYSSYPYNNWAAKVPSPLGTKSFPWPVNPLGSVVPASHHQGSVNCFNTATSMGAVGGGGMAGVGSSAGVGGAVAPCPYTAPPNPYSMYRAEPCPAMSSSIASLRLKAKQHSSGFSSGYGGVSPVSRAGSAPLSACQYGGGERL-