Monarch geneset OGS2.0

DPOGS215040
TranscriptDPOGS215040-TA1038 bp
ProteinDPOGS215040-PA345 aa
Genomic positionDPSCF300208 - 512162-513521
RNAseq coverage26x (Rank: top 77%)
Annotation
HeliconiusHMEL0020167e-13772.73% 
BombyxBGIBMGA005669-TA4e-12164.46% 
Drosophilafd68A-PG5e-1438.21% 
EBI UniRef50UniRef50_D6NPF88e-7476.12%Putative forkhead box J protein (Fragment) n=125 Tax=Heliconius RepID=D6NPF8_9NEOP
NCBI RefSeqXP_002423753.14e-3840.22%forkhead protein/ forkhead protein domain, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2951505064e-7476.62%putative forkhead box J protein [Heliconius erato favorinus]
NCBI nr blastxgi|2951506439e-8276.35%putative forkhead box J protein [Heliconius erato emma]
Group
Gene OntologyGO:00063551.4e-18regulation of transcription, DNA-dependent
GO:00435651.4e-18sequence-specific DNA binding
GO:00037001.4e-18sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[157-230] IPR0017661.4e-18Transcription factor, fork head
[148-230] IPR0119912.5e-17Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL17792 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215040-TA
ATGCATATAACGTCGAGCTTCTTTCCGGCGTCTGGCGAGATGCTGGACGCCTCCGTGAATCTTTACGGGGATTCGGGAGATCAGTGCGGGTTTTTCACTTTGGACGAAGCGGTTCCAGAATCTTCGCATACCGTTGAAATTGAATACGTTTATGAACAACCGGATGCTGGCAAAAACGAGGAGTTGCGGACATCAAAAGCTTCCTCGGCCCAACCGAAGAAGACCCAAGCGAAACAAAAGCAGATCAAAGTAGTCGAAGAGGAGTCTGAGACGGATTTGACCAATCTAACATGGTTGCAAAATATCACCAATATCATGGCAATGCCCCAATTTCCGATACCGCCAATGTCACCGAATCCTCAGGTGAAAGTTCAACCTCAGAATACCAGGCTGCAGAAGTTCAACCAAACTATCGCCAAGTGCCAGAAAGACTTCATGGAGAACAAAGAGGAATATCAAAAGAACAGCGATAAGAAGCCTCCATATTCTTACAGCACTTTGATCTGTATGGCCATGCGGTATAATAATGACAAAATGACACTATCCGCGATCTATTCTTGGATTCGTGACAGCTTTAAGTACTATCGGAACGCGGATCCCACTTGGCAGAATTCAAAAGTAGCTCGTTCTAAACACGAGCCGGGTAAAGGTGGATTCTGGAAACTCGATCTGGCTCATTTGGAAGGAACTAAACGTATATCAAACCGTCCCCACAAAAAGAAGAAGAACGAGACCAAAACAGAAGCGAAATTTGACAGGAAGGTGTCCGAAGAGAAGACGGCTGTTGCCAATATACCACAAGTTGATTGCGTCAATATGGAAGATGTCGGGCAAGCGGTGACACTTCATTTGCCTGAATTTAATCTCCCTTTGCCCGATATTGAAATGACAAATGTGGGGGCAAATGTCATTGTGGAGCCTGTAGCGCCGCCCCTGATGCCAGAAGACGACTTGTCTTCTTTGCTATTAAGTCCCACAGATTGGGAAGACCTTCAATTGGACATGTTAGACAATTATCTGGATTCTTGTTTCAAGTAA

Protein sequence:

>DPOGS215040-PA
MHITSSFFPASGEMLDASVNLYGDSGDQCGFFTLDEAVPESSHTVEIEYVYEQPDAGKNEELRTSKASSAQPKKTQAKQKQIKVVEEESETDLTNLTWLQNITNIMAMPQFPIPPMSPNPQVKVQPQNTRLQKFNQTIAKCQKDFMENKEEYQKNSDKKPPYSYSTLICMAMRYNNDKMTLSAIYSWIRDSFKYYRNADPTWQNSKVARSKHEPGKGGFWKLDLAHLEGTKRISNRPHKKKKNETKTEAKFDRKVSEEKTAVANIPQVDCVNMEDVGQAVTLHLPEFNLPLPDIEMTNVGANVIVEPVAPPLMPEDDLSSLLLSPTDWEDLQLDMLDNYLDSCFK-