Monarch geneset OGS2.0

DPOGS214188
TranscriptDPOGS214188-TA1095 bp
ProteinDPOGS214188-PA364 aa
Genomic positionDPSCF300014 - 15523-21894
RNAseq coverage394x (Rank: top 31%)
Annotation
HeliconiusHMEL0023042e-14175.30% 
BombyxBGIBMGA006232-TA4e-9460.90% 
DrosophilaDp-PA7e-4459.86% 
EBI UniRef50UniRef50_F4WEU71e-6347.35%Transcription factor Dp-1 n=10 Tax=Pancrustacea RepID=F4WEU7_ACREC
NCBI RefSeqXP_001605304.13e-6446.71%PREDICTED: similar to transcription factor Dp-2 (E2F dimerization partner 2) [Nasonia vitripennis]
NCBI nr blastpgi|3407094156e-6447.02%PREDICTED: transcription factor Dp-1-like [Bombus terrestris]
NCBI nr blastxgi|3072132121e-6447.00%Transcription factor Dp-1 [Harpegnathos saltator]
Group
Gene OntologyGO:00063551.5e-27regulation of transcription, DNA-dependent
GO:00056671.5e-27transcription factor complex
GO:00037001.5e-27sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[19-319] IPR0156482.4e-82Transcription factor DP
[173-265] IPR0119913.8e-40Winged helix-turn-helix transcription repressor DNA-binding
[179-258] IPR0033161.5e-27Transcription factor E2F/dimerisation partner (TDP)
[265-312] IPR0148892.7e-11Transcription factor DP, C-terminal
Orthology groupMCL13627 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214188-TA
ATGTCCCAGAATACTAGTATAGTAAATTTCCTAATTCACGACGCTAATGGACAGCCCCAAATGGTTAAAGTAGTTCAAAATACAAATACCCAACCCATTGCTCAGTTAAAATTAAAACAAAATCCTGTGAAGGTTTTTAAATTATCTTCCTCAGTTGAAGGAAATACTCAGCCTCCTGTTCTGCAACCAATTCAGGTTACAGGCATCAAATGTGCTTCAAAATTGGTACAAGTACCTGTTGATACTTTAGCAAGGTTGCAAAGAATTAAATCTGAACCTGAATCAAGTGTGAATATTGAAAATAATATTGAGAATCAAAGGCAGTTAGGTGAGGTTAATGTTATACAATCCATACCACCAGAACTGGTGCCTATAGTAACAAAATCTGAGCCTGAGGATACTGTAGAGGGACACAAAGAGAGGTTCACATCTAGTAATTCTATAATGTCTCATGCTTCACGGAGAAGACATGATTCTGACAATGATCCACCTGCAGAATACACGACGAAAAGACGTAAGCATGCAGATAAAGTGGGCAAAGGTCTTAGGCATTTTTCAATGAAAGTATGTGAAAAAGTAAGAAACAAAGGATTTACTTCATACAATGAAGTGGCTGATGAATTAGTTTTAGAATTTGCAGCAGGAATGCACGGTTCAGCTGATAGCCAACAATACGATCAAAAGAATATTAGGAGAAGAGTATATGACGCGCTCAATGTACTAATGGCCATGAATATTATATCAAAGGAGAAAAAAGAAATTCGATGGTTAGGGCTTCCAACTAACTCGGTGCAGGAATGTTCGGCTCTTGAAAAAGAAAAACAAACTAAAGTGGAGCAAATACAGAAGAAAACACAGCAATTACAAGAACTTATATTACAGCATATATCATTCAAAAGTTTAATAGAAAGAAATAAAGAAGCTGAAAATAAAGAAATGGGTAGGGGAACGAGTAATGCTCTTGCTGATATCCTTGACGATGAAGATGTAGATGAACTTGGTGCCGATGAAGAGGCATTAGAGGGTGAAGCAGAAGGAGAGGAGCAAGAAGATGGTAATGATTATAGTGAAGACAGCAGTGATGTGGATGTCTAG

Protein sequence:

>DPOGS214188-PA
MSQNTSIVNFLIHDANGQPQMVKVVQNTNTQPIAQLKLKQNPVKVFKLSSSVEGNTQPPVLQPIQVTGIKCASKLVQVPVDTLARLQRIKSEPESSVNIENNIENQRQLGEVNVIQSIPPELVPIVTKSEPEDTVEGHKERFTSSNSIMSHASRRRHDSDNDPPAEYTTKRRKHADKVGKGLRHFSMKVCEKVRNKGFTSYNEVADELVLEFAAGMHGSADSQQYDQKNIRRRVYDALNVLMAMNIISKEKKEIRWLGLPTNSVQECSALEKEKQTKVEQIQKKTQQLQELILQHISFKSLIERNKEAENKEMGRGTSNALADILDDEDVDELGADEEALEGEAEGEEQEDGNDYSEDSSDVDV-