Monarch geneset OGS2.0

DPOGS208609
TranscriptDPOGS208609-TA1884 bp
ProteinDPOGS208609-PA627 aa
Genomic positionDPSCF300052 + 336882-343228
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0165953e-17989.21% 
BombyxBGIBMGA013425-TA8e-16883.77% 
DrosophilaAdf1-PC1e-0832.00% 
EBI UniRef50UniRef50_UPI000206035D5e-1036.73%UPI000206035D related cluster n=1 Tax=unknown RepID=UPI000206035D
NCBI RefSeqXP_001978064.11e-1023.18%GG17894 [Drosophila erecta]
NCBI nr blastpgi|3454947645e-1226.12%PREDICTED: hypothetical protein LOC100679762 [Nasonia vitripennis]
NCBI nr blastxgi|3454947643e-1325.47%PREDICTED: hypothetical protein LOC100679762 [Nasonia vitripennis]
Group
Gene OntologyGO:00036771.5e-06DNA binding
KEGG pathway 
InterPro domain[15-95] IPR0065786.1e-19MADF domain
[301-337] IPR0042101.5e-06BESS motif
Orthology groupMCL20565 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208609-TA
ATGAAGGTTAAAGTGAAGGGAGCTAGGTCCAAGAATATGGACAAGCTGATAGCCGCTGTCCAAGCACGCGAATGCTTGTGGGATAAGAGCTATAGAGGTCACAGGAATCGGTTTAAGTTGGAGCGTTATTGGAATGAAGTGGCCGCGGAAGTTGGTACTACAAGTATCAATTGCAGAAAGAGATGGAAGAACCTGAAGGACCAATGTCGGAAAGAGATGAAAAAGAACAACGAATCAGAATGGCCTCACTTCCAGAAATTGAAGTTCATCCACCATCAGTTCCTGGCTGAAGAGGAGGAAGGCGACGAGGACACAACAGATGAAGTCTTTAATGGCTTGGATGAATCTATTAAACGGCCCAAGTTGGGTTACACCATGAGAAAGAAATTGCTTATGAACAAGAGAAGAACAATTGACGTTGATATGGACAAGCTGATAGAACTGGTGCAGGCCAGGGAGATTATATGGAACAGACAGCTGAAGGGACATCACAATTGGTATAAATTGGATGAAAGTTGGAAAGAAATTTCTCAACAACTTGGAGTTACGCGCGATGAAGCCAGGCTTAAATGGAAATATCTACGCGACCAAGCTAGAAAGGAATGTCGTAAGCAAGTATCAGAATGGGAATATCTACCGAAGCTTCAGTTCTTAACAAATCAGTTTAATGACTACGAAGGTCATGACACAACAGACGACCACTTCAACCAGGACTTCGAACCCACGCCTGAACAAAGCAGCAGTTATTACTTGGAACCACCTGCAACAAATGATATAAGTGTCAAAGATGATGAATTCGACGAATTCGATACTAAACCTATCATAATGGAGACAGATTTTTACGATGACGATGATGAAGCCCAAAAAGGGACAGGAAATGTTGGAACTCCTGAACCAGCTAAGGACGAAGATATCGGTTTTTTCAATAGTCTCATACCCCATGTTAAGAAACTACTGCCAGCGAAGAAATTGATGCTGCGGATGAAGATCCAGGAACTGGTTTATAATACAGTGTACAGTATCAATTGCAGAAAGAGATGGAAGAACCTGAAGGACCAATGTCGGAAAGAGATGAAAAAGAACAACGAATCAGAATGGCCTCACTTCCAGAAATTGAAGTTCATCCACCATCAGTTCCTGGCTGAAGAGGAGGAAGGCGACGAGGACACAACAGATGAAGTCTTTAATGGCTTGGATGAATCTATTAAACGGCCCAAGTTGGGTTACACCATGAGAAAGAAATTGCTTATGAACAAGAGAAGAACAATTGACGTTGATATGGACAAGCTGATAGAACTGGTGCAGGCCAGGGAGATTATATGGAACAGACAGCTGAAGGGACATCACAATTGGTATAAATTGGATGAAAGTTGGAAAGAAATTTCTCAACAACTTGGAGTTACGCGCGATGAAGCCAGGCTTAAATGGAAATATCTACGCGACCAAGCTAGAAAAGAATGTCGTAAGCAAGTATCAGAATGGGAATATCTTCCGAAGCTTCAGTTCTTAACAAATCAGTTTAATGACTACGAAGGTCATGACACAACAGACGACCACTTCAACCAGGACTTCGAACCCACGCCTGAACAAAGCAGCAGTTATTACTTGGAACCACCTGCAACAAATGATATAAGTGTCAAAGATGATGAATTCGACGAATTCGATACTAAACCTATCATAATGGAGACAGATTTTTACGATGACGATGATGAAGCCCAAAAAGGGACAGGAAATGTTGGAACTCCTGAACCAGCTAAGGACGAAGATATCGGTTTTTTCAATAGTCTCATACCCCATGTTAAGAAACTACTGCCAGCGAAGAAATTGATGCTGCGGATGAAGATCCAGGAACTGGTTTATAATACAGTGTACAGTGAGACCTAG

Protein sequence:

>DPOGS208609-PA
MKVKVKGARSKNMDKLIAAVQARECLWDKSYRGHRNRFKLERYWNEVAAEVGTTSINCRKRWKNLKDQCRKEMKKNNESEWPHFQKLKFIHHQFLAEEEEGDEDTTDEVFNGLDESIKRPKLGYTMRKKLLMNKRRTIDVDMDKLIELVQAREIIWNRQLKGHHNWYKLDESWKEISQQLGVTRDEARLKWKYLRDQARKECRKQVSEWEYLPKLQFLTNQFNDYEGHDTTDDHFNQDFEPTPEQSSSYYLEPPATNDISVKDDEFDEFDTKPIIMETDFYDDDDEAQKGTGNVGTPEPAKDEDIGFFNSLIPHVKKLLPAKKLMLRMKIQELVYNTVYSINCRKRWKNLKDQCRKEMKKNNESEWPHFQKLKFIHHQFLAEEEEGDEDTTDEVFNGLDESIKRPKLGYTMRKKLLMNKRRTIDVDMDKLIELVQAREIIWNRQLKGHHNWYKLDESWKEISQQLGVTRDEARLKWKYLRDQARKECRKQVSEWEYLPKLQFLTNQFNDYEGHDTTDDHFNQDFEPTPEQSSSYYLEPPATNDISVKDDEFDEFDTKPIIMETDFYDDDDEAQKGTGNVGTPEPAKDEDIGFFNSLIPHVKKLLPAKKLMLRMKIQELVYNTVYSET-