Monarch geneset OGS2.0

DPOGS214084
TranscriptDPOGS214084-TA1707 bp
ProteinDPOGS214084-PA568 aa
Genomic positionDPSCF300014 - 2405054-2429014
RNAseq coverage505x (Rank: top 25%)
Annotation
HeliconiusHMEL0027261e-12187.12% 
BombyxBGIBMGA005253-TA2e-8287.69% 
Drosophilapdm3-PD4e-11976.41% 
EBI UniRef50UniRef50_F4X5B62e-15554.16%POU domain, class 6, transcription factor 2 n=10 Tax=Coelomata RepID=F4X5B6_ACREC
NCBI RefSeqXP_391982.34e-14353.42%PREDICTED: similar to CG11641-PA [Apis mellifera]
NCBI nr blastpgi|3071706092e-16258.80%POU domain, class 6, transcription factor 2 [Camponotus floridanus]
NCBI nr blastxgi|3228017316e-16157.62%hypothetical protein SINV_00668 [Solenopsis invicta]
Group
Gene OntologyGO:00063556.2e-27regulation of transcription, DNA-dependent
GO:00037006.2e-27sequence-specific DNA binding transcription factor activity
GO:00055153.1e-21protein binding
GO:00036778.2e-21DNA binding
GO:00435651.1e-17sequence-specific DNA binding
KEGG pathway 
InterPro domain[388-480] IPR0003276.2e-27POU-specific
[407-424] IPR0138473.1e-21POU
[393-480] IPR0109828.2e-21Lambda repressor-like, DNA-binding
[501-563] IPR0013561.1e-17Homeobox
[490-558] IPR0122876.1e-17Homeodomain-related
[500-568] IPR0090571.7e-16Homeodomain-like
Orthology groupMCL14996 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214084-TA
ATGACATTGTCGTTTCAAGCGGAGTCAGACCGTGAGAGCGGCGCTAGTTCGCCGGAGGGTGCGCGGGCGCACGCCGACCAGCGCTCGCCGTCGCCGCGCTCGCGAACGCCGCACCAGCACATGAATGGCTCCATGGCATCGATGTTCCAAAATCTGCAAAATTTGGCGAACATGCAACAGAGCATGCCGCTGTCGCAGCAACAAATGTCGCAACAAATGTCACAACAAATGTCGCAGCTGGCTGCCAACCTGCAGGGTCTCACCTCAATGCCATCCAACCCCGTCATCAACTCGCCTCTCAACCTGAGTGTCAGTGCCCCAGGCATGGGATCTCCTACCCCAGTTAACAGCAGTATGCTGCCGCCGGCTATGCCATCACCTATGCCGCAGCTCATCCTGGCCTCTGGACAGCTAGTACAGGGCATACAGGGTGCACAACTGCTGATACCTACTTCTCAAGGTATAGCGACACAAACAATTCTCACCATACCCGTAAATCACGTGAACTCCAACGATCAAATGGTAAATCTCGCTCTGAACAATGGCCAAGTGGTATCCACATCTCTGGCCAATTTACAAGCGATGGCCCAACCCCACCAACTACTAAACTCCAACCCGCAACAAACGTCCAACATTCGGCCGAACATGCTAAATCCAACACTATCGAACGCGCTCCTCAATCCGGGACTGCCAAACTTTTTATCCAACGGAGCGACAAATGCGCAAGAACTGCTGCAAGCATTACAACAGCCGCAAGGGAATCACAATCTCCTACAGACAGTTCAACAAAACAATATGCCACAACAAATGCAAGGCCGAAGATCGTCCTCCCCACGACCTGACAGACATTACAAGGAGAGGGAAAGCTTCGAGCGGTTCGCAGGAGGATCGAGGGAAAGGAACGAGAGAGAAAACTCGGGAGCGGCAGCCCTGAATAGTATTAATAGGCTCGCAGCATCCAACGGCGAGATTACCATAACAACGTCCCATTCAACAGCGGGTACAACAAGCAGTGCGGGTAGTGTTGGCAGTGCGCCTACAGCATCACCTCACGCTCCCGTCAAGCTCTCACCAAGCTCTGTCAAGTCACCAGCACATGACGAGGACCTGTTGGCCGATTCACCTAATCAGCCAACTATAAGTCAGTCGACGGGCAACGTTGTTGATGGAATCAATCTAGAAGACATCAAGGAGTTCGCAAAGGCATTCAAATTACGACGACTAGGCCTAGGGCTGACGCAGACCCAGGTCGGACAAGCGCTTTCCGTCACCGAAGGGCCCGCTTACAGTCAGAGCGCCATTTGCAGTGCCCTGGCTTCGCAGATGCTAGCAGCTCAGCTGTCTTCACAGCAACAAAACATATTTGAGAAATTGGATATAACTCCAAAAAGTGCGCAGAAAATCAAACCGGTGCTTGAACGTTGGATGAAGGAAGCTGAAGAGAGGTACGCGTCCGGTCAGAACCATCTAACGGATTTCATAGGCATGGAGCCGAGCAAGAAACGCAAACGACGGACGTCCTTCACGCCGCAGGCTCTCGAACTACTCAACGCTCACTTCGAACGAAACACGCACCCATCTGGAACAGAAATAACCGGTCTGGCTCACCAGCTCGGCTACGAGCGGGAGGTCATCAGAATATGGTTCTGCAACAAACGACAGGCTTTAAAAAACACCGTGCGAATGATGTCCAAAGGGATGGTCTAA

Protein sequence:

>DPOGS214084-PA
MTLSFQAESDRESGASSPEGARAHADQRSPSPRSRTPHQHMNGSMASMFQNLQNLANMQQSMPLSQQQMSQQMSQQMSQLAANLQGLTSMPSNPVINSPLNLSVSAPGMGSPTPVNSSMLPPAMPSPMPQLILASGQLVQGIQGAQLLIPTSQGIATQTILTIPVNHVNSNDQMVNLALNNGQVVSTSLANLQAMAQPHQLLNSNPQQTSNIRPNMLNPTLSNALLNPGLPNFLSNGATNAQELLQALQQPQGNHNLLQTVQQNNMPQQMQGRRSSSPRPDRHYKERESFERFAGGSRERNERENSGAAALNSINRLAASNGEITITTSHSTAGTTSSAGSVGSAPTASPHAPVKLSPSSVKSPAHDEDLLADSPNQPTISQSTGNVVDGINLEDIKEFAKAFKLRRLGLGLTQTQVGQALSVTEGPAYSQSAICSALASQMLAAQLSSQQQNIFEKLDITPKSAQKIKPVLERWMKEAEERYASGQNHLTDFIGMEPSKKRKRRTSFTPQALELLNAHFERNTHPSGTEITGLAHQLGYEREVIRIWFCNKRQALKNTVRMMSKGMV-