Monarch geneset OGS2.0

DPOGS214573
TranscriptDPOGS214573-TA1158 bp
ProteinDPOGS214573-PA385 aa
Genomic positionDPSCF300050 - 670591-673297
RNAseq coverage16040x (Rank: top 1%)
Annotation
HeliconiusHMEL0091062e-16784.87% 
BombyxBGIBMGA005140-TA3e-13876.71% 
Drosophilakay-PF6e-1832.98% 
EBI UniRef50UniRef50_F4W7931e-4542.42%Transcription factor kayak n=4 Tax=Formicidae RepID=F4W793_ACREC
NCBI RefSeqNP_001164292.12e-4445.57%kayak isoform A [Tribolium castaneum]
NCBI nr blastpgi|3072057834e-5543.70%hypothetical protein EAI_03399 [Harpegnathos saltator]
NCBI nr blastxgi|2828481478e-6844.05%kayak isoform A [Tribolium castaneum]
Group
Gene OntologyGO:00056347.4e-18nucleus
GO:00036777.4e-18DNA binding
GO:00063551.9e-11regulation of transcription, DNA-dependent
GO:00435651.9e-11sequence-specific DNA binding
GO:00037001.9e-11sequence-specific DNA binding transcription factor activity
GO:00469831.7e-08protein dimerization activity
KEGG pathway 
InterPro domain[132-148] IPR0008377.4e-18Fos transforming protein
[137-201] IPR0048271.9e-11Basic-leucine zipper (bZIP) transcription factor
[137-197] IPR0116161.7e-08bZIP transcription factor, bZIP-1
Orthology groupMCL14739 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214573-TA
ATGCAGAACATCGATCCCCTAGAGTTCGCCAACATGCTGGCCACCGAGCTGTGGTGCCAACAGTTGGCCAACCTAGAGGGCCTCCAGTCGGGGGTCCCAACCCGCACCACGGCGACCATAACCCCCACACAGTTGCGCAACTTCGAACAGACTTACATCGAGTTGAGCAACTGTCGCAGCGAACAGACGAACCACGCGGGCTTCGTACCGCCTTCCGTCACGCAGGCCAACACTTACGGCATCCTGAATTCGGCGGCTTACTGTGACTCGGGCCCGACGACATTGCATGTGTCGCCGGGCCCACTATCAGCGAGCGGCGACAGCAGCAGCAGTCCCGGTCTACCGGCTCCGAAGAGGAGGAACATGGGCGGCAGACGACCAACTAAAGCCCCCCAAGACATCTCTCCAGAAGAAGAGGAACGCAGGAAGATACGCCGCGAGAGAAACAAAATGGCCGCAGCACGCTGTCGCAAACGAAGGTTGGACCATACAAACGAATTGCAAGAGGAAACCGATAAGTTGGAAGAAAAGAAGCAAGCGTTACAGGATGAGATCCGCAAGCTGAGCTCGGACAGGGATTCGCTACAGGCTCTACTCCAGAATCATATGCACAGCGGGTGCAGATTGAATAAGCGATCTACCAGCCCGCCGGATGTAAAGCCATTCCAGGACTCGTACGACTACCAGGAGATGACCGGCCAGGGGGTCAGGGTCAAGGAGGAGGTGATGGACCCCACAGTGGACCCCGTCCTAGGGTTGGATAACGAAATATTCACCTCGCCGACGCCGGACAAAAGGATAATGCTGTCGGCGGCTAATCCGGCCGTGTTAACGGGCACCTCGCTGGACACACCCCCCGTCCGGCCCTCCAGACCCAGCTTCCTCCAAGTACCCCACACACTCACACCCGCACAGATCCACAACAACAAACTGAGCAACAACAACAAAATCCCCGGCATCGAGATCAGCACGCCAAGCAACGGTATACCATTCAACTTCGACAGCCTCATGGAAGGCGGGACGGGGCTGACACCCGTGCACCCTCATCCGTTCGCACACTCTCACCCACACGCGCACCCTCACCCATGCGCACAGCAGCAGCGCGCCGCCCCCGACCTCGCCTCGCCGGATGCACAGAACAGCCTCGTCAGCCTCTGA

Protein sequence:

>DPOGS214573-PA
MQNIDPLEFANMLATELWCQQLANLEGLQSGVPTRTTATITPTQLRNFEQTYIELSNCRSEQTNHAGFVPPSVTQANTYGILNSAAYCDSGPTTLHVSPGPLSASGDSSSSPGLPAPKRRNMGGRRPTKAPQDISPEEEERRKIRRERNKMAAARCRKRRLDHTNELQEETDKLEEKKQALQDEIRKLSSDRDSLQALLQNHMHSGCRLNKRSTSPPDVKPFQDSYDYQEMTGQGVRVKEEVMDPTVDPVLGLDNEIFTSPTPDKRIMLSAANPAVLTGTSLDTPPVRPSRPSFLQVPHTLTPAQIHNNKLSNNNKIPGIEISTPSNGIPFNFDSLMEGGTGLTPVHPHPFAHSHPHAHPHPCAQQQRAAPDLASPDAQNSLVSL-