Monarch geneset OGS2.0

DPOGS207171
TranscriptDPOGS207171-TA1173 bp
ProteinDPOGS207171-PA390 aa
Genomic positionDPSCF300001 + 4732523-4745107
RNAseq coverage113x (Rank: top 59%)
Annotation
HeliconiusHMEL0078074e-8068.59% 
BombyxBGIBMGA000635-TA2e-6278.74% 
Drosophilacroc-PA1e-3147.62% 
EBI UniRef50UniRef50_UPI0002247A0E4e-5147.01%UPI0002247A0E related cluster n=1 Tax=unknown RepID=UPI0002247A0E
NCBI RefSeqXP_001603061.12e-5146.67%PREDICTED: similar to forkhead protein/ forkhead protein domain [Nasonia vitripennis]
NCBI nr blastpgi|3454965911e-5047.01%PREDICTED: hypothetical protein LOC100119258 [Nasonia vitripennis]
NCBI nr blastxgi|3504022523e-5944.53%PREDICTED: hypothetical protein LOC100742971 [Bombus impatiens]
Group
Gene OntologyGO:00063555.3e-55regulation of transcription, DNA-dependent
GO:00435655.3e-55sequence-specific DNA binding
GO:00037005.3e-55sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[79-169] IPR0017665.3e-55Transcription factor, fork head
[73-175] IPR0119914.4e-43Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL17122 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207171-TA
ATGTGTCTCCAGGAGTCAAACACTCCAGACTCCTTCAATCAGCCAGAGATGAAGGAAGCTGAAGATTTCTCTCGGGTCTATCAGACCCTTACCCTCTCTGCGCTTTCCAGAGACGATTCTGCTAATTCACCTACAAGCAGCGAGAATAAGCCCAAGCCAAAGCCTACCCCGGCAGCTTGTCCTGGCAGTTCGCCTGAAATGAACCCACAGTCTACTACACCGTCTTCACAGGCACTCACAAAACCGCCATACTCTTACGTGGCTCTGATTGCTATGGCTATCACCAATAGCCAGAATAAGCGCGCAACCCTAAGTGAAATATACGCTTACATTACCAAAAAATTTCCTTTCTTCGAGAAGGATAAAAAGGGCTGGCAAAATTCAATCAGACACAATCTGAGCCTGAACGAATGCTTCATTAAAGTACGCAGAGAGGGCGGAAGTGAAAGCAAGGGAAATTATTGGACACTTGATCCGCAATGCGGAGACATGTTCGTGAATGGCAACTTCAGGCGGCGACGTCGTATGAAGAGACCATTCAGGGCCGCTCCATATAAGACAATGTTCGACGGCTACGTCGCCCACGGTGGTCAACATCCCCACATGCCCATCCAGCTCGGGCACAGGAACTACTTCGGTTCTAGTACACCCTATCCTCCGTCTTACCCGAGATATGATGCATGGCTGAGTCAGCCGACAGGCGGATTGGGTTACCCTGCTCCGATAGCTCGCAGTCCCCCTGGTTGCTCCCCCCAGGCGTCTAACGTGAACCCCTTCTCCACCCACCAAAACCAAGGACAGTTACAGAGCCCGTTGCAATCCATGCAACCGATGACAATGAATTACAATACGCTCAATGTTGCCGCCATAGGTGAGTTTGATGGCTCTTCTAGTCCTGGATCTGGTTACGCCGCTGGTAGCTTCTCTCCAAATCGTCATCATGATATTGTCACTTTATCTGATGCTGTTTCTCGTTTTTCTTTTTGGCCCGAAGGTGGATCGTCAAGTCCCAACTCTGGATATGTCCCAACTAACTTCTCCCCCCGCAGACATGAAGCTGTCTCCTCCTCCGATGCTGCTGGTCGCTACTCTTTCTGGCCTGACGTTGGTGTTAAAGAAGAATCGTCTTCAAGCCTTGTGGCGAATGGAGGTTATACTAAGTATTTCATGTAA

Protein sequence:

>DPOGS207171-PA
MCLQESNTPDSFNQPEMKEAEDFSRVYQTLTLSALSRDDSANSPTSSENKPKPKPTPAACPGSSPEMNPQSTTPSSQALTKPPYSYVALIAMAITNSQNKRATLSEIYAYITKKFPFFEKDKKGWQNSIRHNLSLNECFIKVRREGGSESKGNYWTLDPQCGDMFVNGNFRRRRRMKRPFRAAPYKTMFDGYVAHGGQHPHMPIQLGHRNYFGSSTPYPPSYPRYDAWLSQPTGGLGYPAPIARSPPGCSPQASNVNPFSTHQNQGQLQSPLQSMQPMTMNYNTLNVAAIGEFDGSSSPGSGYAAGSFSPNRHHDIVTLSDAVSRFSFWPEGGSSSPNSGYVPTNFSPRRHEAVSSSDAAGRYSFWPDVGVKEESSSSLVANGGYTKYFM-