Monarch geneset OGS2.0

DPOGS204182
TranscriptDPOGS204182-TA951 bp
ProteinDPOGS204182-PA316 aa
Genomic positionDPSCF300034 + 124798-155285
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0099642e-6785.42% 
BombyxBGIBMGA005108-TA6e-6784.72% 
Drosophiladsx-PA7e-3192.06% 
EBI UniRef50UniRef50_D6WUC61e-5043.59%Doublesex n=3 Tax=Endopterygota RepID=D6WUC6_TRICA
NCBI RefSeqNP_001036871.18e-7550.61%doublesex isoform F [Bombyx mori]
NCBI nr blastpgi|3025130452e-10972.78%female-specific doublesex isoform F1 [Antheraea mylitta]
NCBI nr blastxgi|3025130352e-12473.42%female-specific doublesex isoform F1 [Antheraea assama]
Group
Gene OntologyGO:00075482.9e-24sex differentiation
GO:00056342.9e-24nucleus
GO:00036772.9e-24DNA binding
GO:00063552.9e-24regulation of transcription, DNA-dependent
GO:00037002.9e-24sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[223-281] IPR0149322.6e-30Doublesex dimerisation
[31-84] IPR0012752.9e-24DM DNA-binding
Orthology groupMCL17281 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204182-TA
ATGGTCTCCGTGGGCGCGTGGAGGCGTCGCACTCCCGACGACTGTGAAGACAGGTCGGAACCCGGCGCCTCCAGCTCTGGAGTCCCGCGGGCGCCGCCGAACTGCGCCCGCTGCCGCAACCACAGGCTGAAGATCGAGCTGAAAGGCCACAAGCGCTATTGCAAGTACCGCTACTGTACTTGTGAGAAGTGCCGTCTCACCGCCGACAGGCAGAGGGTTATGGCTCTGCAAACGGCGTTGCGGCGAGCCCAAGCGCAGGACGAAGCGCGGGCTCGGGCTCTGGAGACGGGAATTCAGCCGCCAGGGATAGAATTGGACAGGCCGGAGCCTCCGACAGTGAAGGCTCCTAGGAGTCCCGTGGTGCCGCCACCGGCTGCTATTGACCGTCGATCGCTGGGCTCAGCCAGCTGTGATTCAATTCCCGAATCACCTCCGGGAATGTCGCCGTATTCAGCTCCTCCGGCGTCCGCGCCTCCACAGCCGACCATGCCGCCGCTACTGCCGCCACAGCAACCAGAAGACAATCCCTCTGTATTTATTTATATTCTTGTGAACCGTGAATGGGATAGCTTTAATGTTGATGCGTTCATCCCGCTCCCACCTGACTATCCCGGGATGGATTGGGATCCCCATTGGCCGGGGATTGGGGAGGATTCGTTTTCACTGGAGTCTTTGGTGGAGAACTGTCATAAGCTGCTGGAGATGTTCCACTACTCCTGGGAGATGATGCCACTGGTGCTGGTCATCCTGAACTACGCTGGCAGCGACCTCCAGGAGGCTGCCAGGAAGATCGACGAAGGGAAGATGATTATCAACGAATACGCCAGGAAACATAATCTGAACATATTCGATGGGCACGAGCTACGGAACTCGACTCGACAGAAAATGCTGAGCGAAATTAATAATATAAGTGGTGTACTGTCATCGTCCATGAAGTTGTTTTGCGAATGA

Protein sequence:

>DPOGS204182-PA
MVSVGAWRRRTPDDCEDRSEPGASSSGVPRAPPNCARCRNHRLKIELKGHKRYCKYRYCTCEKCRLTADRQRVMALQTALRRAQAQDEARARALETGIQPPGIELDRPEPPTVKAPRSPVVPPPAAIDRRSLGSASCDSIPESPPGMSPYSAPPASAPPQPTMPPLLPPQQPEDNPSVFIYILVNREWDSFNVDAFIPLPPDYPGMDWDPHWPGIGEDSFSLESLVENCHKLLEMFHYSWEMMPLVLVILNYAGSDLQEAARKIDEGKMIINEYARKHNLNIFDGHELRNSTRQKMLSEINNISGVLSSSMKLFCE-