Monarch geneset OGS2.0

DPOGS208840
TranscriptDPOGS208840-TA1122 bp
ProteinDPOGS208840-PA373 aa
Genomic positionDPSCF300036 + 811730-814745
RNAseq coverage1691x (Rank: top 8%)
Annotation
HeliconiusHMEL0154307e-16789.66% 
BombyxBGIBMGA007946-TA0.083.89% 
Drosophilaexd-PA1e-17986.28% 
EBI UniRef50UniRef50_P404261e-13571.15%Pre-B-cell leukemia transcription factor 3 n=276 Tax=Metazoa RepID=PBX3_HUMAN
NCBI RefSeqXP_002100763.18e-18086.51%GE17244 [Drosophila yakuba]
NCBI nr blastpgi|1954791022e-17886.51%GE17244 [Drosophila yakuba]
NCBI nr blastxgi|1954791024e-17886.77%GE17244 [Drosophila yakuba]
Group
Gene OntologyGO:00056347.6e-102nucleus
GO:00037007.6e-102sequence-specific DNA binding transcription factor activity
GO:00036773.2e-29DNA binding
GO:00063553.2e-29regulation of transcription, DNA-dependent
GO:00055151.4e-19protein binding
GO:00435655.7e-19sequence-specific DNA binding
KEGG pathway 
InterPro domain[36-234] IPR0055427.6e-102PBX
[235-305] IPR0122873.2e-29Homeodomain-related
[219-295] IPR0090571.4e-19Homeodomain-like
[236-295] IPR0013565.7e-19Homeobox
Orthology groupMCL11422 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208840-TA
ATGGACGATCCGAATAGAATGATGGCGCACAGCGGTGGTCTCATGGGACCCCAAGGCTATGGTCTTCCTGGCGGCGATGGGGCGCCCGCCACTGGTGAAGGCGAAGCCCGAAAACAAGACATCGGGGAAATTTTACAACAAATCATGAATATTACCGATCAAAGTCTCGATGAGGCTCAAGCAAGAAAACATACACTGAATTGCCACAGAATGAAACCTGCACTTTTCTCTGTATTGTGTGAAATTAAAGAAAAAACAGTTTTGTCTCTTCGCAACACGCAAGAGGAGGAGCCCCCAGATCCGCAACTTATGCGTTTAGACAACATGCTGATTGCTGAAGGGGTAGCGGGTCCGGAGAAAGGTGGCGGTGCTGGTGCTGCTGCCTCGGCATCAGCGGCCGCGGGAGAGTGGGGCAGTGTGGCTCAAGCAGATAACGCGATCGAGCACTCGGACTACCGCGCGAAGCTGGCCCAGATCAGACAGATCTATCACCAGGAACTGGACAAGTACGAGAACGCCTGCAACGAGTTCACCACACACGTCATGAACCTGTTACGAGAGCAGAGCCGCACCAGACCCATCACTCCCAAGGAAATAGAGCGCATGGTGCAGATCATACACAAGAAGTTCAGTTCCATTCAGATGCAGCTGAAGCAGTCCACCTGCGAGGCCGTCATGATCCTGCGTTCTCGTTTCCTGGACGCTCGCAGAAAGCGGCGCAACTTCAGCAAGCAGGCGTCCGAGATCCTGAACGAGTACTTCTACTCGCACCTGTCCAACCCCTACCCCAGCGAGGAGGCCAAGGAGGAGCTGGCGCGCAAGTGCGGCATCACCGTCTCCCAGGTGTCCAACTGGTTCGGCAATAAACGTATTCGCTACAAGAAGAACATCGGCAAGGCGCAGGAGGAGGCGAACCTGTACGCCGCCAAGAAAGCCGCTGCAGCGGGGGCGTCACCGTACTCGATGGGCGCCGCGTCGGGGACGGCCACCCCCATGATGTCTCCGGCGCCCACGCAGGACTCCATGGGGTACGCCCTGCCGGCGGCCGGCTACGACCAGCCTCAACCACCATACGACACCTCCATGTCCTACGACCCCATGCATCAGGACCTGTCGCCTTAG

Protein sequence:

>DPOGS208840-PA
MDDPNRMMAHSGGLMGPQGYGLPGGDGAPATGEGEARKQDIGEILQQIMNITDQSLDEAQARKHTLNCHRMKPALFSVLCEIKEKTVLSLRNTQEEEPPDPQLMRLDNMLIAEGVAGPEKGGGAGAAASASAAAGEWGSVAQADNAIEHSDYRAKLAQIRQIYHQELDKYENACNEFTTHVMNLLREQSRTRPITPKEIERMVQIIHKKFSSIQMQLKQSTCEAVMILRSRFLDARRKRRNFSKQASEILNEYFYSHLSNPYPSEEAKEELARKCGITVSQVSNWFGNKRIRYKKNIGKAQEEANLYAAKKAAAAGASPYSMGAASGTATPMMSPAPTQDSMGYALPAAGYDQPQPPYDTSMSYDPMHQDLSP-