Monarch geneset OGS2.0

DPOGS212956
TranscriptDPOGS212956-TA3267 bp
ProteinDPOGS212956-PA1088 aa
Genomic positionDPSCF300057 + 26813-42845
RNAseq coverage248x (Rank: top 42%)
Annotation
HeliconiusHMEL0219450.093.36% 
BombyxBGIBMGA001739-TA0.064.12% 
DrosophilaStat92E-PF6e-10846.78% 
EBI UniRef50UniRef50_F4WR541e-17561.51%Signal transducer and activator of transcription 5B n=28 Tax=Coelomata RepID=F4WR54_ACREC
NCBI RefSeqNP_001157388.10.083.99%signal transducer and activator of transcription [Bombyx mori]
NCBI nr blastpgi|172255660.085.01%signal transducer and activator of transcription short form [Spodoptera frugiperda]
NCBI nr blastxgi|172255660.085.18%signal transducer and activator of transcription short form [Spodoptera frugiperda]
Group
Gene OntologyGO:00056349.5e-197nucleus
GO:00071659.5e-197signal transduction
GO:00063559.5e-197regulation of transcription, DNA-dependent
GO:00048719.5e-197signal transducer activity
GO:00037009.5e-197sequence-specific DNA binding transcription factor activity
GO:00055091.3e-42calcium ion binding
GO:00055154.7e-36protein binding
KEGG pathwaydre:4454741e-136 
 K11223 (STAT5A)maps-> Pathways in cancer
    Acute myeloid leukemia
    Jak-STAT signaling pathway
    ErbB signaling pathway
    Chronic myeloid leukemia
InterPro domain[563-1019] IPR0012179.5e-197STAT transcription factor, core
[636-900] IPR0089674.8e-91p53-like transcription factor, DNA-binding
[644-898] IPR0138013.1e-80STAT transcription factor, DNA-binding
[182-325] IPR0123453.9e-43STAT transcription factor, DNA-binding, subdomain
[789-913] IPR0119921.3e-42EF-hand-like domain
[2-181] IPR0138001.4e-41STAT transcription factor, all-alpha
[1-181] IPR0159882.1e-41STAT transcription factor, coiled coil
[914-1017] IPR0009804.7e-36SH2 motif
Orthology groupMCL11010 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212956-TA
ATGGCTTCCGAGGAGATCAGAAGCCTCCAGGCTAATATAGAGTCCTTCTCGCTGCAGTACCACGAGTGCCTCAAGAATAAGGGTCACATAAACTATTTGCAACAGAGCGGTCCCATGACCAACGACCGGCGCGAGCTGGAGGCGTGCCTCCGCGTGCAGATCGAGGATATGGAGAGGAAACTCAACGCGCTGGTGGCCCAAATTAACCAGTCCCAAATGGAGTTGGTGGATCACATGAAAGAAAACATCACCAACCTGAGACAGCTGCAGAGCCAGGTGCTGGACGAGGAGCTGATCAAGTGGAAGCGTGAACAGCAGCTGTCAGGAAACGGTGTGCCGATGCAGTCCAACCTGAACAGTATACAGGAGTGGTGTGAGCTGCTGGCCGAGCTGATATGGAGTACGAGACAGCAGGTCTGCAATGTGGGAAGGATTAACAGCAAGACCATAGTGGAGCTGCGGCAACCCCACCTCGCCGAGATGCTGGACGACATGAGCAAGCAGGTGACAAGTCTTCTGTCTACTTTGGTGACTTCAACGTTCGTCATTGAGAAACAACCTCCGCAGGTCATGAAGACAAACACACGCTTCACAGCCACCGTCCGCCTGTTAGTCGGTGGTCAACTGAACGTGCACATGACACCGCCGCGCGTCACGGTGGTGATAATATCGGAGCAACAGGCCCAGCTGCTGCTGAAGAGCGACGTGTCCGGGGGCCGGGGCAAACAGCCCCAGGAGTGTGGGGACATCTTGAACAACAGCGGCTGTATGGAGTACCAGCCGACTTGCAGGCAGCTCAGTGTGAGCTTCCGTAACATGCAGCTGCGTAAGATCAAACGAGCCGAGAAGAAGGGAACAGAGAGTGTGATGGACGAGAAGCTGACGCTGTTGTTCCAGTCGGAGTTCAACGTTGGCGGAGGGGAGCTGGTCTTCCAGGTGTGGACTCTATCTCTGCCGGTGGTTGTGATCGTCCACGGTAACCAGGAGCCCCACGGCTGGGCCACCGTTACTTGGGACAACGCTTTCAGCCCCCCGGGGAGAGTACCATTCGCTGTTCCTGATAAGGTGACCTGGGGTCAGCTAGCTGAGACGCTCCGCATCAAGTTCTGCTCAGCCACAGGTGGTGACCTGTCCGAAGACAACTTGAGGTTCCTCGCTGAAAAAATATTCAGTTCGCACGTCGACAGAACGAGCCTGCCGCTGACAGCGCTCGAGCTGAACAATATGAGCGTTAGCTGGACACAGTTCTGTAAGGACGCTCTGCCTGATAGGAACTTCACCTTCTGGGAGTGGTTCTACATGGTGGTTAAGGTCACCAGGGACTATCTGAGGACGTTGTGGTGTGACCGGCGTTCAATGAGAATGGATTGTGGTTTAACCCATCAGCCTAATAAGTTGCGTGGCGAGGGTAACGAGGTGTTCAGCCTCCAGCCGTTCACGTCCCGCGACCTGATGCTGCGCTCCCTGGCCGACCGTATCCTGGATCTCGCGCAGTTGCAATTCCTTTACCCAAACGTCGCCAAAGATGACGTCTTCTCCAAATACTACACTAAGCCCGAGAACGAGATGCTGAAGAACGGGTATGTGAAGCCCGTCCTGGTGACGACCCTACCTCCCTACATGTCCGCCTCCCCGGCCTACGCACACTCCCCGGACTCGCACAGGAACACGCCCTCTGTGCACAGCAGGTGGAAGCGTGAACAGCAGCTGTCAGGAAACGGTGTGCCGATGCAGTCCAACCTGAACAGTATACAGGAGTGGTGTGAGCTGCTGGCCGAGCTGATATGGAGTACGAGACAGCAGGTCTGCAATGTGGGAAGGATTAACAGCAAGACCATAGTGGAGCTGCGGCAACCCCACCTCGCCGAGATGCTGGACGACATGAGCAAGCAGGTGACAAGTCTTCTGTCTACTTTGGTGACTTCAACGTTCGTCATTGAGAAACAACCTCCGCAGGTCATGAAGACAAACACACGCTTCACAGCCACCGTCCGCCTGTTAGTCGGTGGTCAACTGAACGTGCACATGACACCGCCGCGCGTCACGGTGGTGATAATATCGGAGCAACAGGCCCAGCTGCTGCTGAAGAGCGACGTGTCCGGGGGCCGGGGCAAACAGCCCCAGGAGTGTGGGGACATCTTGAACAACAGCGGCTGTATGGAGTACCAGCCGACTTGCAGGCAGCTCAGTGTGAGCTTCCGTAACATGCAGCTGCGTAAGATCAAACGAGCCGAGAAGAAGGGAACAGAGAGTGTGATGGACGAGAAGCTGACGCTGTTGTTCCAGTCGGAGTTCAACGTTGGCGGAGGGGAGCTGGTCTTCCAGGTGTGGACTCTATCTCTGCCGGTGGTTGTGATCGTCCACGGTAACCAGGAGCCCCACGGCTGGGCCACCGTTACTTGGGACAACGCTTTCAGCCCCCCGGGGAGAGTACCATTCGCTGTTCCTGATAAGGTGACCTGGGGTCAGCTAGCTGAGACGCTCCGCATCAAGTTCTGCTCAGCCACAGGTGGTGACCTGTCCGAAGACAACTTGAGGTTCCTCGCTGAAAAAATATTCAGAACGAGCCTGCCGCTGACAGCGCTCGAGCTGAACAATATGAGCGTTAGCTGGACACAGTTCTGTAAGGACGCTCTGCCTGATAGGAACTTCACCTTCTGGGAGTGGTTCTACATGGTGGTTAAGGTCACCAGGGACTATCTGAGGACGTTGTGGTGTGACCGATTGATCATGGGTTTCATCCAGAAGAAGCAAGCGGAGGAAATGCTGTCCAAGTGTCCCCCGGGCACGTTCCTGTTGAGATTCAGTGACTCGGAGCTGGGAGGTATCACCATAGCTTGGACTGGCGAGGGTAACGAGGTGTTCAGCCTCCAGCCGTTCACGTCCCGCGACCTGATGCTGCGCTCCCTGGCCGACCGTATCCTGGATCTCGCGCAGTTGCAATTCCTTTACCCAAACGTCGCCAAAGATGACGTCTTCTCCAAATACTACACTAAGCCCGAGAACGAGATGCTGAAGAACGGGTATGTAAAGCCCGTCCTGGTGACGACCCTACCTCCCTACATGTCCGCCTCCCCGGCCTACGCACACTCCCCGGACTCGCACAGGAACACGCCCTCTGTGCACAGCAGTTACTTCAGCGCGTCTACACCAGCTCAGACAGAGAGTAGCTTCATGGAGAGCGAACTGTTCGAACAGATAAGGGCCTTCGATCACGAGGAGTTTGACGACTTCGACTTCTACGGGGGTAACGCGGCCATGAAGTGA

Protein sequence:

>DPOGS212956-PA
MASEEIRSLQANIESFSLQYHECLKNKGHINYLQQSGPMTNDRRELEACLRVQIEDMERKLNALVAQINQSQMELVDHMKENITNLRQLQSQVLDEELIKWKREQQLSGNGVPMQSNLNSIQEWCELLAELIWSTRQQVCNVGRINSKTIVELRQPHLAEMLDDMSKQVTSLLSTLVTSTFVIEKQPPQVMKTNTRFTATVRLLVGGQLNVHMTPPRVTVVIISEQQAQLLLKSDVSGGRGKQPQECGDILNNSGCMEYQPTCRQLSVSFRNMQLRKIKRAEKKGTESVMDEKLTLLFQSEFNVGGGELVFQVWTLSLPVVVIVHGNQEPHGWATVTWDNAFSPPGRVPFAVPDKVTWGQLAETLRIKFCSATGGDLSEDNLRFLAEKIFSSHVDRTSLPLTALELNNMSVSWTQFCKDALPDRNFTFWEWFYMVVKVTRDYLRTLWCDRRSMRMDCGLTHQPNKLRGEGNEVFSLQPFTSRDLMLRSLADRILDLAQLQFLYPNVAKDDVFSKYYTKPENEMLKNGYVKPVLVTTLPPYMSASPAYAHSPDSHRNTPSVHSRWKREQQLSGNGVPMQSNLNSIQEWCELLAELIWSTRQQVCNVGRINSKTIVELRQPHLAEMLDDMSKQVTSLLSTLVTSTFVIEKQPPQVMKTNTRFTATVRLLVGGQLNVHMTPPRVTVVIISEQQAQLLLKSDVSGGRGKQPQECGDILNNSGCMEYQPTCRQLSVSFRNMQLRKIKRAEKKGTESVMDEKLTLLFQSEFNVGGGELVFQVWTLSLPVVVIVHGNQEPHGWATVTWDNAFSPPGRVPFAVPDKVTWGQLAETLRIKFCSATGGDLSEDNLRFLAEKIFRTSLPLTALELNNMSVSWTQFCKDALPDRNFTFWEWFYMVVKVTRDYLRTLWCDRLIMGFIQKKQAEEMLSKCPPGTFLLRFSDSELGGITIAWTGEGNEVFSLQPFTSRDLMLRSLADRILDLAQLQFLYPNVAKDDVFSKYYTKPENEMLKNGYVKPVLVTTLPPYMSASPAYAHSPDSHRNTPSVHSSYFSASTPAQTESSFMESELFEQIRAFDHEEFDDFDFYGGNAAMK-