Monarch geneset OGS2.0

DPOGS202854
TranscriptDPOGS202854-TA912 bp
ProteinDPOGS202854-PA303 aa
Genomic positionDPSCF300018 + 1146530-1147441
RNAseq coverage143x (Rank: top 54%)
Annotation
HeliconiusHMEL0026942e-13680.25% 
BombyxBGIBMGA010297-TA1e-12274.05% 
Drosophilaslp2-PA6e-5971.60% 
EBI UniRef50UniRef50_B0XKS36e-6458.75%Fork head domain transcription factor slp2 n=2 Tax=Culicinae RepID=B0XKS3_CULQU
NCBI RefSeqNP_001170880.11e-11273.18%forkfead transcription factor G1 [Bombyx mori]
NCBI nr blastpgi|2944598952e-11173.18%forkfead transcription factor G1 [Bombyx mori]
NCBI nr blastxgi|2944598954e-13077.96%forkfead transcription factor G1 [Bombyx mori]
Group
Gene OntologyGO:00063553.3e-61regulation of transcription, DNA-dependent
GO:00435653.3e-61sequence-specific DNA binding
GO:00037003.3e-61sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[70-160] IPR0017663.3e-61Transcription factor, fork head
[62-165] IPR0119911e-44Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL25728 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202854-TA
ATGGTGAAGTTCGAAGGAGACTTTAGTATTAACGCTATATTAATGAACCACGCGGTGACCAAGCCGCCACCCTCACCGACCACCTCGTCTACGGCGACTCCCTCGGAGGCCGACCTCAGTGACTCGGAGCTAGACGTCACGGGCACCGGCTCCGAGCCCGTTGACTGCTCTAAGCCCAGGATGGAGGACGACAAGAAGGACAAGAAGCATGAGAAACCGGCCTACAGCTACAACGCTCTAATCATGATGGCCATACGGAATAGTCCTGAGAAACGCCTCACGCTCAACGGCATCTACGAGTACATCATGACAAACTTCCCCTACTACAAGGAGAACCGACAGGGCTGGCAGAACTCCATCCGTCACAATTTGAGTCTGAACAAGTGTTTTGTGAAAGTGCCGCGACACTACGACGACCCCGGTAAGGGTAACTACTGGATGTTGGACGCCTCGGCTGAAGATGTCTTCATCGGTGGAACCACAGGCAAGCTTCGCCGGCGCTCGGCTCTCAACGGCAGATCCCGTCTCGCGTGTTTCAAGCGGCCGCTATTCCCCGGCGCCCCGCTCGCCGGTCCATACCCTCCTGCAACATACTCTCAGCTAGTCGGCCTCTACTCACAGCTGCTGTACCAGAGGTATGCGCCGATGCAGATGAAGACTCCTCCGGTGACACCCGGTCCAGTTCATCCGGCGTTCAGAAACGAGATGGCCTACACCAGTCTCCCGTACTCACCTCTGTACGGTGACCGCCTTCCCTCAGGTCCTTTCTGCCAGCCCCTGGTCCACTCTCCCCCGACACACTCACCACCCACATCCGGATCCTCCAGCCCCGAACTGACGTCGCCCTCGCCCTCCCATCACCCTCTCTCGCCTCACATCTTCAAGCCCGTAACTGTTCTCACACGACAGTGA

Protein sequence:

>DPOGS202854-PA
MVKFEGDFSINAILMNHAVTKPPPSPTTSSTATPSEADLSDSELDVTGTGSEPVDCSKPRMEDDKKDKKHEKPAYSYNALIMMAIRNSPEKRLTLNGIYEYIMTNFPYYKENRQGWQNSIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDASAEDVFIGGTTGKLRRRSALNGRSRLACFKRPLFPGAPLAGPYPPATYSQLVGLYSQLLYQRYAPMQMKTPPVTPGPVHPAFRNEMAYTSLPYSPLYGDRLPSGPFCQPLVHSPPTHSPPTSGSSSPELTSPSPSHHPLSPHIFKPVTVLTRQ-