Monarch geneset OGS2.0

DPOGS202627
TranscriptDPOGS202627-TA1443 bp
ProteinDPOGS202627-PA480 aa
Genomic positionDPSCF300371 - 49574-62150
RNAseq coverage316x (Rank: top 36%)
Annotation
HeliconiusHMEL0102145e-17488.76% 
BombyxBGIBMGA008311-TA1e-10465.76% 
DrosophilaCHES-1-like-PB1e-4069.52% 
EBI UniRef50UniRef50_E2AJK13e-5347.95%Forkhead box protein N3 n=2 Tax=Formicidae RepID=E2AJK1_CAMFO
NCBI RefSeqXP_625198.23e-5744.88%PREDICTED: similar to checkpoint suppressor 1 [Apis mellifera]
NCBI nr blastpgi|3071917956e-5336.03%Forkhead box protein N3 [Harpegnathos saltator]
NCBI nr blastxgi|3071917951e-5235.86%Forkhead box protein N3 [Harpegnathos saltator]
Group
Gene OntologyGO:00063551.6e-43regulation of transcription, DNA-dependent
GO:00435651.6e-43sequence-specific DNA binding
GO:00037001.6e-43sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[283-372] IPR0017661.6e-43Transcription factor, fork head
[280-367] IPR0119914.5e-34Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL17828 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202627-TA
ATGGCGCCAATGCGCGGCGAACGCGGCGGTTCACCGGCGAGGGGTGCTCGAGCGTCCCTGCCCCTGCCGCGGCGTCTGATCAGACAGATAGCGAGACATGTGGCGTCCAGCGGGAGGTCGCTTAGACGGCTGGAGGTCTCCGGCAGGGTTATTGACAACTTTGATGACATCGATGATGGATCCGATGAATCCGACATGGAGACACAGACGGCCGTTGCCTGCGCCGGTAACTCTGGTAATTTACCACCAGCTGATCACGATGACGATCTCACCAGTCTCAGTTGGTTGCAGGACAAGAATTTACTGAGCGGAATAAATCTAACTAAAAGTGACATAGAGGACAGCAAACTGATCAGCAGTCAGGTGGTTGTGAAAATGGCGCCAATGCGCGGCGAACGCGGCGGTTCACCGGCGAGGGGTGCTCGAGCGTCCCTGCCCCTGCCGCGGCGTCTGATCAGACAGATAGCGAGACATGTGGCGTCCAGCGGGAGGTCGCTTAGACGGCTGGAGGTCTCCGGCAGGGTTATTGACAACTTTGATGACATCGATGATGGATCCGATGAATCCGACATGGAGACACAGACGGCCGTTGCCTGCGCCGGTAACTCTGGTAATTTACCACCAGCTGATCATGATGACGATCTCACCAGTCTCAGTTGGTTGCAGGACAAGAATTTACTGAGCGGAATAAATCTAACTAAAAGTGACATAGAGGACAGCAAACTGATCAGCAGTCAGGTGGTTGTGAAAACAGAGCCATCTATAACTCCGCCGCCGTCGTCGCGAGTCCCCTCCCCTCCCCGCACCCCCTGTAAAGCTCCCCCCTCCCCTCCCGCCGTGAACACCCACACCAAGCCTCCCTACTCCTTCTCCTGCCTCATCTTCATGGCGATAGAAGCCGCCCCAGCCAGGGCTCTCCCAGTTAAGGAGATATACGCCTGGATAGTCAGACACTTCCCATACTTCAAGCACGCACCACAGGGCTGGAAGAACAGCGTCAGGCATAATCTATCATTGAACAAATGCTTCCATAAGGTGGCAGCGGCTCCTGGGTTAGGGAAAGGTTCGCTTTGGACTGTCGACCCTCAGCACCGATCGTCTCTTCTTCAGGCCTTCGGTCGCCAGCCTGTGCCTCCGGTGGAGGTGGAGGAGCAAGAGACGTCGACCAGTGTAAAGAACACTCCAGACCCCCAACTGTTCCCGTACCTGGCCCGCAGATTGGCCGACGCCCGTTCCCCTCCCGGCGCCGACGAGTACCTCGCCGCGGCCACTGTACTAGCCATGAAGTATGGACCCGCTGTACTAGACCAGCTCCCTCCGGAGGCTCACCTGGTTATATCTCGGTGTGCTCGTGACGAACACTCGTATAGTGGCGGCGAGGAGCGGCGTACGGCCGAGGCCCTCCTCAACCTGGCCGGGGTCAGACCTCACGCCCCCAGCTAG

Protein sequence:

>DPOGS202627-PA
MAPMRGERGGSPARGARASLPLPRRLIRQIARHVASSGRSLRRLEVSGRVIDNFDDIDDGSDESDMETQTAVACAGNSGNLPPADHDDDLTSLSWLQDKNLLSGINLTKSDIEDSKLISSQVVVKMAPMRGERGGSPARGARASLPLPRRLIRQIARHVASSGRSLRRLEVSGRVIDNFDDIDDGSDESDMETQTAVACAGNSGNLPPADHDDDLTSLSWLQDKNLLSGINLTKSDIEDSKLISSQVVVKTEPSITPPPSSRVPSPPRTPCKAPPSPPAVNTHTKPPYSFSCLIFMAIEAAPARALPVKEIYAWIVRHFPYFKHAPQGWKNSVRHNLSLNKCFHKVAAAPGLGKGSLWTVDPQHRSSLLQAFGRQPVPPVEVEEQETSTSVKNTPDPQLFPYLARRLADARSPPGADEYLAAATVLAMKYGPAVLDQLPPEAHLVISRCARDEHSYSGGEERRTAEALLNLAGVRPHAPS-