Monarch geneset OGS2.0

DPOGS216098
TranscriptDPOGS216098-TA708 bp
ProteinDPOGS216098-PA235 aa
Genomic positionDPSCF300182 - 416708-439747
RNAseq coverage20x (Rank: top 80%)
Annotation
HeliconiusHMEL0097025e-6297.41% 
BombyxBGIBMGA009432-TA3e-7075.14% 
Drosophilatoy-PA1e-7381.66% 
EBI UniRef50UniRef50_F4WTH33e-7367.12%Paired box protein Pax-6 n=6 Tax=Coelomata RepID=F4WTH3_ACREC
NCBI RefSeqXP_001944246.14e-7582.66%PREDICTED: similar to toy [Acyrthosiphon pisum]
NCBI nr blastpgi|2700063827e-7778.33%twin of eyeless [Tribolium castaneum]
NCBI nr blastxgi|2700063822e-7571.63%twin of eyeless [Tribolium castaneum]
Group
Gene OntologyGO:00036776e-90DNA binding
GO:00063556e-90regulation of transcription, DNA-dependent
GO:00055151.3e-40protein binding
KEGG pathwayxtr:4484475e-70 
 K08031 (PAX6)maps-> Maturity onset diabetes of the young
InterPro domain[18-142] IPR0015236e-90Paired box protein, N-terminal
[19-144] IPR0090571.3e-40Homeodomain-like
[21-86] IPR0119911.1e-36Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL18880 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216098-TA
ATGCATAGCGCGGCGATGGGTGGTGGAGCGCTGTTCGGATGCTCGTCTGCAGGGCACAGCGGCATCAACCAGCTCGGAGGGGTCTATGTGAACGGCAGACCGCTCCCGGACTCCACGAGGCAGAAGATAGTGGAACTGGCGCACTCCGGAGCACGACCCTGCGACATCAGCAGGATACTCCAAGTCAGCAACGGATGCGTCTCCAAGATACTGGGAAGGTATTACGAGACCGGTTCTATAAGACCGCGCGCCATCGGCGGTTCCAAGCCCCGTGTGGCTACAGCCGAAGTTGTTAGCAAGATAGCCCAGTACAAGAGAGAATGTCCATCCATCTTCGCCTGGGAGATAAGGGACCGTCTGCTCAGCGAGGGTGTCTGCACGTCAGATAATATTCCAAGCGTTTCTTCAATAAACCGTGTGTTACGAAACCTGGCTGCTCAGAAAGAAAAGTCAAGCAACCAGCAACCATCCAACGACTGTTCGACGCCTGTATACGAGCGGTTGCGATTACTTGGTACCCCGGGATCGGCTCCGACGTGGCCGAGGTCGCCCTGGCCGACGCAGATAGACACCAGAACACCACCTTACCAACTACACAGCTTGAGTCCTGGACCTCAGGCTATTGGCTGCAACGGCACAGAACTGCCGGTCATGAAGAAAGGGTTGAAAGTCCAGTGGGCGGAGCGCGCATTACTGCCGCCCTGGTGA

Protein sequence:

>DPOGS216098-PA
MHSAAMGGGALFGCSSAGHSGINQLGGVYVNGRPLPDSTRQKIVELAHSGARPCDISRILQVSNGCVSKILGRYYETGSIRPRAIGGSKPRVATAEVVSKIAQYKRECPSIFAWEIRDRLLSEGVCTSDNIPSVSSINRVLRNLAAQKEKSSNQQPSNDCSTPVYERLRLLGTPGSAPTWPRSPWPTQIDTRTPPYQLHSLSPGPQAIGCNGTELPVMKKGLKVQWAERALLPPW-