Monarch geneset OGS2.0

DPOGS209081
TranscriptDPOGS209081-TA1275 bp
ProteinDPOGS209081-PA424 aa
Genomic positionDPSCF300175 + 55035-64723
RNAseq coverage775x (Rank: top 17%)
Annotation
HeliconiusHMEL0056833e-15173.71% 
BombyxBGIBMGA010490-TA1e-2754.35% 
Drosophilafd68A-PG2e-5253.36% 
EBI UniRef50UniRef50_D6WY203e-8449.25%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WY20_TRICA
NCBI RefSeqXP_623740.24e-8851.79%PREDICTED: similar to forkhead box K1 [Apis mellifera]
NCBI nr blastpgi|3407251579e-8752.78%PREDICTED: forkhead box protein K2-like [Bombus terrestris]
NCBI nr blastxgi|3407251572e-9752.35%PREDICTED: forkhead box protein K2-like [Bombus terrestris]
Group
Gene OntologyGO:00063553.7e-49regulation of transcription, DNA-dependent
GO:00435653.7e-49sequence-specific DNA binding
GO:00037003.7e-49sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[142-233] IPR0017663.7e-49Transcription factor, fork head
[136-237] IPR0119912.8e-39Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL11577 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209081-TA
ATGTGGGCTGCTCGCCTGTGCTCTGCAGCTCTTGTGCTAGATGCAAAATGTACTCTCCGTTTTCCGTCTACCAACATCCGCCTGGAGTTTCAGTCGTTGGTTGAGGAGAGTGGCGTCGGGTCTGGGGGAGCAGGCCCACCCCTGCCCCCGCTACGTATTTCCATACCAGTGGACAACGATGGACGGAGTCCTGCGCCCTCGCCGACAGGTACGATAAGCGCCACCAACAGCTGTCCCACGTCGCCCCGGGGCGCCGGCTCCTCCGGCCGACGACACCCGGACCTCGGCCTGGTGGCGCAGTACGCCGCCCTGGCCGACCACCAGCGACCGAACTCGAACGGAACCGCGGCTTCGTCTACGTCGGACTCCGGCTACAGTTCCCGGGACGCCCGTGATGCCAGGGAGCATCGCGAGGGTCGGGACGAGGCTAAACCACCGTACAGCTACGCCCAGCTGATAGTCCAAGCTGTTGCTTCGGCGGCGGACAAGCAGCTCACGCTGAGCGGCATCTACAGCTACATCACCAAGCACTACCCCTACTACCGGACCGCCGACAAGGGCTGGCAGAACTCGATCCGACACAACCTGTCGCTCAACCGTTACTTCATCAAGGTTCCTCGTAGTCAGGAAGAGCCGGGCAAGGGCAGCTTTTGGCGTATCGACCCACAGAGTGAAGGGAAACTCATCGAGCTGGCCTTCAGACCTCGCCGCCCGAGGGGAGTTCAGTTCAGGGCACCCTTCGGACTCTCCTCAAGGAGCGCTCCTACTTCTCCGTCTCAAGTCGGCGTCTCCGGGCTGGTCACGCCTGAGGAGTTGTCGCGAGAACCCACGCCTGACCTCTTCACCGCCGAGGAACATGAACAACAGCAGTCCGGCCAACAACGCTTGTCATCATCGTCACAATATCTGTTCCCGCAGAGAAGTGGGGTCAGTCAGAGCGCGCCCGGATCACCTGGTCACGGCGTGTACGCGGGCGGCAGCGGCTTAGTGATGGCCGGACATCAGATAACGGTTGTCACCAACGGAGCTGGAGGGGAGAGAGAAGAGAAGTACGTGGTGGGCACGTCGGGGGGTGGGCTGGTGTCGATACCCGAGGAGGAGGTCCAGGCTGCCAACCTACTGCTTCATCAGCACTCACCTTACTACGCCGGGTACAGCGGTGACGAGAACTGCGCGCTGGGTGGAGAGTTGGTTATAGAGGAGGCGCCCGACGACCCGCCACACAAGAGACCCAAGCATCATGTGTCCGATATAGAAGATCGACGCGCGTATTGA

Protein sequence:

>DPOGS209081-PA
MWAARLCSAALVLDAKCTLRFPSTNIRLEFQSLVEESGVGSGGAGPPLPPLRISIPVDNDGRSPAPSPTGTISATNSCPTSPRGAGSSGRRHPDLGLVAQYAALADHQRPNSNGTAASSTSDSGYSSRDARDAREHREGRDEAKPPYSYAQLIVQAVASAADKQLTLSGIYSYITKHYPYYRTADKGWQNSIRHNLSLNRYFIKVPRSQEEPGKGSFWRIDPQSEGKLIELAFRPRRPRGVQFRAPFGLSSRSAPTSPSQVGVSGLVTPEELSREPTPDLFTAEEHEQQQSGQQRLSSSSQYLFPQRSGVSQSAPGSPGHGVYAGGSGLVMAGHQITVVTNGAGGEREEKYVVGTSGGGLVSIPEEEVQAANLLLHQHSPYYAGYSGDENCALGGELVIEEAPDDPPHKRPKHHVSDIEDRRAY-