Monarch geneset OGS2.0

DPOGS216135
TranscriptDPOGS216135-TA1641 bp
ProteinDPOGS216135-PA546 aa
Genomic positionDPSCF300182 + 567808-604269
RNAseq coverage45x (Rank: top 71%)
Annotation
HeliconiusHMEL0122146e-9590.52% 
BombyxBGIBMGA009432-TA2e-5266.46% 
Drosophilatoy-PA1e-11768.69% 
EBI UniRef50UniRef50_F4WTH37e-13553.78%Paired box protein Pax-6 n=6 Tax=Coelomata RepID=F4WTH3_ACREC
NCBI RefSeqXP_002423925.11e-14063.37%Paired box protein Pax-6, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700063822e-15870.02%twin of eyeless [Tribolium castaneum]
NCBI nr blastxgi|2700063827e-15968.13%twin of eyeless [Tribolium castaneum]
Group
Gene OntologyGO:00036771.5e-50DNA binding
GO:00063551.5e-50regulation of transcription, DNA-dependent
GO:00055158.5e-27protein binding
GO:00435652.4e-26sequence-specific DNA binding
GO:00037002.4e-26sequence-specific DNA binding transcription factor activity
KEGG pathwaybta:2868572e-104 
 K08031 (PAX6)maps-> Maturity onset diabetes of the young
InterPro domain[160-267] IPR0015231.5e-50Paired box protein, N-terminal
[211-275] IPR0119911.3e-33Winged helix-turn-helix transcription repressor DNA-binding
[211-275] IPR0090578.5e-27Homeodomain-like
[362-424] IPR0013562.4e-26Homeobox
[337-422] IPR0122877.7e-26Homeodomain-related
Orthology groupMCL15122 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216135-TA
ATGCAACGATTACATATTCAGGTCTTTGTTTACGAATTCATCGCTGTGAAGGACATTGAAACAAACATTATTTGTGTTGTATCTGTGCGCGAGACTCCGCCCTCCGTGCTCACACGGGTGACTTGCAATCAAGTCGCCAGTCGCTACCGAGCCGTGTTAACGACCGCACCTACATCACAAACATACACACATAACGTTCAATATTATATTAATACAGTGACAGTGTACTATACGAACGTAATAACATACGGACTGTTTTATATTAGTGTGGCGCTGCACGGGGGCTGGGGGCGGGCGGCGCCTGCGCCCGCGCACGCAGCCACCACGCTGGTAGGCGGACTGGTCGCGCCGCCGCGGGAACCAACCACGGCGGCAGAAGCCTTCTGGAAGATGCCGCATAAAGGCGTTTATGCTCTAGACGAGCTGATGCATAGCGCGGCGATGGGTGGTGGAGCGCTGTTCGGATGCTCGTCTGCAGGGCACAGCGGCATCAACCAGCTCGGAGGGGTCTATGTGAACGGCAGACCGCTCCCGGACTCCACTCTCCAAGATACTGGGAAGAACGATTCGTCAAAAAATACCTCTATCCACAGATATTACGAGACTGGTTCCATCAAACCTAGAGCGATCGGTGGTTCGAAGCCAAGAGTGGCGACCACTCCCGTGGTCCAGAAGATAGCTGACTACAAGAGAGAATGTCCATCCATCTTCGCCTGGGAGATAAGGGACCGTCTGCTCAGCGAGAACGTCTGCAACAATGATAATATACCAAGCGTGTCATCAATAAACCGTGTGCTGCGTAATCTCGCCTCTCAGAAGGAGCAGGCAGCGTCAGCACAGAACGACAGCGTTTACGAGAAGCTGAGAATGTTCAACGGCCAGGCGGCCACGGGTTGGTGGTACCCAGGGTTACCGACCGCACCAGCACCAACCATACCCGCGCCGATACCGCAACAGCTGAACAGACCGGAGGAACATAAACGAGCAGATACGCTGCAATCGGAGGCTGGGTCTGATGGGAACAGCGAGCACGCGTCGTCTGGAGATGAAGACTCGCAAATGAGGCTGAGGCTGAAGAGGAAGCTGCAAAGGAACAGAACGTCCTTCACAAACGATCAGATAGATAGTCTCGAAAAAGAGTTCGAGCGCACTCACTACCCGGATGTTTTCGCGCGGGAACGACTGGCGGAAAAGATCGGATTACCTGAGGCACGTATCCAGGTGTGGTTTTCAAACCGTCGAGCTAAGTGGCGTCGTGAGGAGAAGCTTAGGAGCCAAAGAAGAGACGCGCCCGCGTCGCCCGCGCCTCCGGCTAGGCTGCCGTTGAATGGCGGGTTCAACTCCATGTACAGCCCCATACCACAACCTATCGCCACCATGACTGATACATATAGTTCGATGTCGTCCGGTCTGTCGTCCTCGTGTCTCCAGCAACGTGACGGTGGGTATCCGTACATGTTCGGGGACGTCCTCTCGGGCGGCGGGTACAGAGCGCCCGCGGCACACCAGCAACACGCCGCGTACAGCCAGCCACAGAGCGCGGGCAGCACCGGTGTGATATCGGCGGGTGTGAGCGTCCCCGTCCAAATACCTTCTCAGGGGCCGGACCTCGCGTCGAATTACTGGGGTAGGCTTCAGTGA

Protein sequence:

>DPOGS216135-PA
MQRLHIQVFVYEFIAVKDIETNIICVVSVRETPPSVLTRVTCNQVASRYRAVLTTAPTSQTYTHNVQYYINTVTVYYTNVITYGLFYISVALHGGWGRAAPAPAHAATTLVGGLVAPPREPTTAAEAFWKMPHKGVYALDELMHSAAMGGGALFGCSSAGHSGINQLGGVYVNGRPLPDSTLQDTGKNDSSKNTSIHRYYETGSIKPRAIGGSKPRVATTPVVQKIADYKRECPSIFAWEIRDRLLSENVCNNDNIPSVSSINRVLRNLASQKEQAASAQNDSVYEKLRMFNGQAATGWWYPGLPTAPAPTIPAPIPQQLNRPEEHKRADTLQSEAGSDGNSEHASSGDEDSQMRLRLKRKLQRNRTSFTNDQIDSLEKEFERTHYPDVFARERLAEKIGLPEARIQVWFSNRRAKWRREEKLRSQRRDAPASPAPPARLPLNGGFNSMYSPIPQPIATMTDTYSSMSSGLSSSCLQQRDGGYPYMFGDVLSGGGYRAPAAHQQHAAYSQPQSAGSTGVISAGVSVPVQIPSQGPDLASNYWGRLQ-