Monarch geneset OGS2.0

DPOGS213641
TranscriptDPOGS213641-TA1407 bp
ProteinDPOGS213641-PA468 aa
Genomic positionDPSCF300165 - 78860-85727
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0045896e-16479.90% 
BombyxBGIBMGA004582-TA2e-12481.92% 
DrosophilaFoxP-PC1e-8458.66% 
EBI UniRef50UniRef50_Q9VH872e-9263.57%CG16899 n=6 Tax=Drosophila RepID=Q9VH87_DROME
NCBI RefSeqXP_001600027.12e-10771.88%PREDICTED: similar to IP01211p [Nasonia vitripennis]
NCBI nr blastpgi|1953886008e-9463.88%GJ23621 [Drosophila virilis]
NCBI nr blastxgi|1953886001e-8864.26%GJ23621 [Drosophila virilis]
Group
Gene OntologyGO:00063552.4e-37regulation of transcription, DNA-dependent
GO:00435652.4e-37sequence-specific DNA binding
GO:00037002.4e-37sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[372-453] IPR0017662.4e-37Transcription factor, fork head
[362-457] IPR0119912.5e-33Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL12394 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213641-TA
ATGAGAAAAATAGAACAGAAGTTGGAGGAGAGCCAGCTCCCTGAGGAAGATCGGATGGCAGGAGATGCGGTCTGGGGAACGATAGGAGGTCCGGAGCCGGCACTGAACTCAAGTCCGGGGGGCGCAGCCAAGCCGCCGCCTCTCAACTCATTCGAGCTGGACCGCTGTTTGGATAGGGAGGAGTCCGGTCGTTGGGGCGAGGCGGCTCGTCGTCGCCGGCGCAGTTCCCCGCGCCGTTCGCCCCCGCCCGGAAATGAACCCCTGTCCCTCGCCAGGGCGTCCGCCCCGGCCTCTCCCCTGGTGACGTCATCTCCATCGTCCCCGTCTCCCGCTCGCTCGCCGCTGCTGGCGCCGGACCTGCTGGCGATGCAGCTCCTCGACCAGCACTCCCAGCTGCAGGCGCTCATGAAGCAGAGACTCTTCCACCAGCACCACCTGCAGAAACAGCACATGTCGTCGGAGGCAGCTAAGCGTCAGTTGGAACAGTCCCGCCTCCAGGACCAGATCAACCTGAACCTTCTCTCTCAGTCTCACCTCCAGCCGCCGGAGACTTCTCCCAGTCTCCAGCAGCAGCAGTTGGTCCAGCAGCTCCAGGCGGTCCAGCGGCAGTATCTCATGCACGCTCCTATGTCCGTCCCACCAAACGCTCCTCCAGACTACGACACGGGCAGCGAGTTGGAGGAGCACCCTCTGTTCGGTCGCGGGGTGTGCAAGTGGCCGGGCTGCGACGCGCTCGCTGAGGACTTCCAAGCCTTCCTCAAACACCTGGAGGCGGCCCACACCCTGGACGACCGGTCGGCGGCGCAGGCGCGGGTCCAGATGCAGGTGGTCGCCCAGCTGGAACTCCAGCTGAGGCGGGAGAGGGACCGCCTGGCCGCCATGATGAGACACCTGCACGCCGCCAGGGACAACCACAACAAGATGCACGTAGTTATCAGTATGTCCGTGTCGCCAGGTCCGGCCAGCGAGGGCTCCTCCCCCGGGCCCGTGCGCCGCCGCGTGTCGGACAAGTCCGGAGTCGCCATCGCTGGAGGTCTTCCGTACATGCTCGAAAGAGCCGGATTAGACGTTCAACAAGAGATACAGAGGAACCGGGAGTTCTACAAGACGGCCGACGTGCGGCCGCCCTTCACCTACGCGAGTCTCATACGGCAGGCCATCATCGAGTCCCCGGACAAACAGCTGACTCTCAACGAGATCTACAACTGGTTCCAGTCCACCTTCTGCTACTTCCGACGGAACGCCGCCACCTGGAAGAACGCCGTCCGCCACAACCTGTCGCTCCACAAGTGCTTCATGAGGGTGGAGAACGTGAAGGGGGCCGTGTGGACCGTGGACGAGGTGGAGTTCTACAAGAGGAGGCCCCAGCGGGCCCACGCCGCCATCCACACCGGGTACTGCTCCTGA

Protein sequence:

>DPOGS213641-PA
MRKIEQKLEESQLPEEDRMAGDAVWGTIGGPEPALNSSPGGAAKPPPLNSFELDRCLDREESGRWGEAARRRRRSSPRRSPPPGNEPLSLARASAPASPLVTSSPSSPSPARSPLLAPDLLAMQLLDQHSQLQALMKQRLFHQHHLQKQHMSSEAAKRQLEQSRLQDQINLNLLSQSHLQPPETSPSLQQQQLVQQLQAVQRQYLMHAPMSVPPNAPPDYDTGSELEEHPLFGRGVCKWPGCDALAEDFQAFLKHLEAAHTLDDRSAAQARVQMQVVAQLELQLRRERDRLAAMMRHLHAARDNHNKMHVVISMSVSPGPASEGSSPGPVRRRVSDKSGVAIAGGLPYMLERAGLDVQQEIQRNREFYKTADVRPPFTYASLIRQAIIESPDKQLTLNEIYNWFQSTFCYFRRNAATWKNAVRHNLSLHKCFMRVENVKGAVWTVDEVEFYKRRPQRAHAAIHTGYCS-