Monarch geneset OGS2.0

DPOGS203877
TranscriptDPOGS203877-TA1506 bp
ProteinDPOGS203877-PA501 aa
Genomic positionDPSCF300402 - 38435-45742
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0081340.081.99% 
BombyxBGIBMGA003821-TA7e-10792.50% 
Drosophilasim-PA5e-15071.39% 
EBI UniRef50UniRef50_Q7QE883e-15869.48%AGAP000773-PA n=22 Tax=Bilateria RepID=Q7QE88_ANOGA
NCBI RefSeqXP_967930.22e-17061.24%PREDICTED: similar to Single minded [Tribolium castaneum]
NCBI nr blastpgi|1892417103e-16961.24%PREDICTED: similar to Single minded [Tribolium castaneum]
NCBI nr blastxgi|1892417101e-17362.52%PREDICTED: similar to Single minded [Tribolium castaneum]
Group
Gene OntologyGO:00055156.2e-18protein binding
GO:00071652e-13signal transduction
GO:00048712e-13signal transducer activity
GO:00063558.3e-12regulation of transcription, DNA-dependent
GO:00056345.5e-07nucleus
GO:00037005.5e-07sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[251-337] IPR0136556.2e-18PAS fold-3
[80-146] IPR0000142e-13PAS
[82-142] IPR0137678.3e-12PAS fold
[15-30] IPR0010675.5e-07Nuclear translocator
[2-58] IPR0115984.2e-06Helix-loop-helix DNA-binding
Orthology groupMCL10938 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203877-TA
ATGAAGGAGAAGAGCAAGAACGCGGCACGTTCGAGGAGGGAGAAGGAAAACGCTGAGTTCCTCGAACTAGCTAAACTGTTACCACTACCATCAGCCATCACCTCACAGCTGGACAAGGCGTCGGTGATACGGCTCACCACAAGTTACCTGAAGATGAGGCAGGTCTTCCCTGATGGTCTGGGAGACGCCTGGGGCGCCGCCCCTCCTCCACCACAGCCCAGGGAACTCTCAATACGAGAGCTGGGATCCCATCTCCTGCAGACCCTCGATGGGTTTATATTCGTGGTGTCACCAGATGGAAAGATTATGTACATAAGTGAGACGGCGTCCGTTCATCTCGGACTTAGTCAGGTGGAATTGACCGGGAACTCTATATACGAGTACATCCACCAAGCTGATCACGAGGAGATGTCCGCGGTGCTCAGCCTTCAGCATCCGCACACGTATGCTGGACCGCCGGCCGTTGGGTATCCTGTAGGTGGTACCTGGAGTCCCAACGTGGACGTGGAGTGTGAGAGAGCCTTCTTCATCAGGATGAAGTGCGTCCTCGCTAAGAGGAACGCTGGCCTCACCACGTCAGGGTATAAGGTCATCCACTGTTCTGGATACCTCCGCGCCCGCCGCTTCGGCGACGGCACGGCTCCTCTCGGGCTGGTCGCCGTCGGCCACTCCCTCCCGCCGTCAGCCGTCACCGAGCTGAAGCTCCACTCCAACATGTTCATGTTCCGCGCCTCGCTGGACATGAGGCTCATCTTCCTGGACGCCAGGGTGGCGTCTCTCACCGGCTACGAGCCTCAGGACCTCATCGAGAAGACCCTGTACCACTACATCCACGGCACGGACGTGCTGCACATGAGATACTCGCACTGCACGCTGCTGACCAAGGGCCAGGTGACGTCGCGCTACTACCGCTTCCTGACCAAGTCCGGCGGCTGGGTGTGGATGCAGAGCTACGCCACCATCGTGCACAACTCCCGGTCCTCGCGCCCGCACTGCATCGTGTCCGTCAACTACGTGCTCAGCGACGTGGAGGAGAAGAACCTCGTCCTCAACATAGAGCAGGGCCCGCCCAAGGCGAGCCCCGAGCCGCAGCCGCCCGCCGCCAAGGCGCCGCACCCCGCGGGCGAGGACTTCGGCGACGGCTACGGCTATCCCGAGTACAGCCTGCCGGTCATACCCTCGTACGACGCGCACGAGGACTACCAGAACGGCTACCAGGAGATGTTCTACGAGAACTACGCGGAACCGGAGGTGGTCAACTACGTCTACCCTCAGAACCAGCGGCCGTTCTCGGCGAGCTCGTCCTCCTGCAGCTCGGTGGAGAGCTCGGAGGTCAACCAGTACAACTACACCAACCTCATCTCGTTCTACGGACACGGCGCCCAGGGCCAGAGGCAGGCGGAGGGCTTCAGCAGCTTCGCCAAGAACCCGAGCGCCGCGCCGGACGGGTTCGCCGGCGTCATCGTGGACAACACGCAGTTCCACAGCAACGAGTACGTGCACTGA

Protein sequence:

>DPOGS203877-PA
MKEKSKNAARSRREKENAEFLELAKLLPLPSAITSQLDKASVIRLTTSYLKMRQVFPDGLGDAWGAAPPPPQPRELSIRELGSHLLQTLDGFIFVVSPDGKIMYISETASVHLGLSQVELTGNSIYEYIHQADHEEMSAVLSLQHPHTYAGPPAVGYPVGGTWSPNVDVECERAFFIRMKCVLAKRNAGLTTSGYKVIHCSGYLRARRFGDGTAPLGLVAVGHSLPPSAVTELKLHSNMFMFRASLDMRLIFLDARVASLTGYEPQDLIEKTLYHYIHGTDVLHMRYSHCTLLTKGQVTSRYYRFLTKSGGWVWMQSYATIVHNSRSSRPHCIVSVNYVLSDVEEKNLVLNIEQGPPKASPEPQPPAAKAPHPAGEDFGDGYGYPEYSLPVIPSYDAHEDYQNGYQEMFYENYAEPEVVNYVYPQNQRPFSASSSSCSSVESSEVNQYNYTNLISFYGHGAQGQRQAEGFSSFAKNPSAAPDGFAGVIVDNTQFHSNEYVH-