Monarch geneset OGS2.0

DPOGS203265
TranscriptDPOGS203265-TA1647 bp
ProteinDPOGS203265-PA548 aa
Genomic positionDPSCF300229 + 34847-38644
RNAseq coverage386x (Rank: top 31%)
Annotation
HeliconiusHMEL0102145e-2458.02% 
BombyxBGIBMGA000445-TA1e-13068.80% 
Drosophilajumu-PA7e-6744.79% 
EBI UniRef50UniRef50_D6X3302e-8038.74%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X330_TRICA
NCBI RefSeqXP_001843251.15e-8845.23%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700307501e-8645.23%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|3504217614e-8640.34%PREDICTED: hypothetical protein LOC100748977 [Bombus impatiens]
Group
Gene OntologyGO:00063551.3e-42regulation of transcription, DNA-dependent
GO:00435651.3e-42sequence-specific DNA binding
GO:00037001.3e-42sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[298-391] IPR0017661.3e-42Transcription factor, fork head
[295-396] IPR0119917.1e-34Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL34488 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203265-TA
ATGGATCTGTACATCACTGACTCACTCCAGGACATGCTGGATATGGACATCAAAAACGAAATAGCGACAGACTTGAGCAGTATAACAGATTTTTCTGATTCATTAGGATTAAATTTCTCAGAAATGCCACCGTTATTAGACATGGAAACAGATAATTCTGTGACGTGGCTGAATAACTCTTCGAGTTTCGTACACAATCTCGATCTGTATGGATCGGAAGCGAACGCTGTCATGGTCAATCCGAACTCCGTAATGCCGTCGACGTTCGCTGAAACTCCAGTCAAAAGTATTGTAAAAGAAGAGGCTTCACATTTGCTGCTCACTTCAGCTGCGAATAACGATCTCACCAATAACACGTCGCTATCGAGTCCTAAAGAGGAAAAAAGTCATTTAACATTCTCACCGAACGCCATCAAGGTGGCTAAAGTACAGGAATCGGAAGACACAAAAAACAAGAAGCCGATGGAGGAGGCGACGCAGATGGTCATTTATGTGCGTAAACAAGACAAGACTGTAGTCAAAGATTTATTAAAGGATTTAGACACGAATAAAACTAAGAGTTCAACCTTAACGCCCACCGTCAGAATAAAGTCGAGTCAACAGGAAGTATTAAAAATAAATAATAAGAACTGTTCAGTTTTAAACACGAACCAAAAACTATCACAGTCTTTAGGCACGAAAACTATTATATCCGGTAATATACACATATTGGACGCACAGCAATCTAGGACAATTTTAGCTAATGGTAACAAACAAGCTACGATATTAATTGATAATTCATCACTAAACAATAGCAGGCAAATAATTAAGACCTCAGTAACTGGTGCATTTACAGTAGACACTAGCCAAGCTAAGTACGTAAATAATTCAAGTAAAACAGTCGCGGGAGAGTTTCCAAAGCCGGCCTATTCGTATTCATGTTTAATAGCAATGGCGCTAAAAAACTCGAGAACGGGAAGCTTACCAGTGTCAGAGATTTATAATTTTATGTGTCAACATTTCCCCTATTTCAAAACCGCACCAAACGGTTGGAAGAATTCCGTAAGGCATAATCTAAGCTTAAACAAATGTTTCGAAAAGATTGAAAAACCATCGACGAATGGAAGTCAACGGAAAGGCTGTCTATGGGCTATGAATCCATCGAAAGTCGGCAAAATGGATGAAGAAGTCCAGAAATGGTCCAGGAAGGATCCTCAGGCAATCAAGAAAGCTATGATTTATCCAGAGACCTTGGAAGCGTTGGAACGCGGAGAGATGAAGTACAGCGGGTTCGGCAGCGACAATGACGCGGACGAAGATAATGATAACGATAATGACACAGAGGACTTGGACTTGGAGATAGACCCTGAGGAATCAGATCAAGAATTGGAAGTAGAAGAAGTGGAAGGAACGGGCATGGTTGGGGCTTACCGCGTACTGGCTCCAGGGCTATATGGAGACCTGAGTGATGTTGAGGTATTGGATCAGTCGTACGAAGAGATCGACATCGATACTAAACCAGTAAAATTAGACCTATCTGTTACCGAAAATTATACGATCCATTCCGCTAAACGGGCAAAGACGAGCTTCATATACCAGCCGGTGACGTCACAGACGCACACGAGTCGAAGAAAGACGCCGCTCGTCAACAGAATAGCGTTAGTTTAA

Protein sequence:

>DPOGS203265-PA
MDLYITDSLQDMLDMDIKNEIATDLSSITDFSDSLGLNFSEMPPLLDMETDNSVTWLNNSSSFVHNLDLYGSEANAVMVNPNSVMPSTFAETPVKSIVKEEASHLLLTSAANNDLTNNTSLSSPKEEKSHLTFSPNAIKVAKVQESEDTKNKKPMEEATQMVIYVRKQDKTVVKDLLKDLDTNKTKSSTLTPTVRIKSSQQEVLKINNKNCSVLNTNQKLSQSLGTKTIISGNIHILDAQQSRTILANGNKQATILIDNSSLNNSRQIIKTSVTGAFTVDTSQAKYVNNSSKTVAGEFPKPAYSYSCLIAMALKNSRTGSLPVSEIYNFMCQHFPYFKTAPNGWKNSVRHNLSLNKCFEKIEKPSTNGSQRKGCLWAMNPSKVGKMDEEVQKWSRKDPQAIKKAMIYPETLEALERGEMKYSGFGSDNDADEDNDNDNDTEDLDLEIDPEESDQELEVEEVEGTGMVGAYRVLAPGLYGDLSDVEVLDQSYEEIDIDTKPVKLDLSVTENYTIHSAKRAKTSFIYQPVTSQTHTSRRKTPLVNRIALV-