Monarch geneset OGS2.0

DPOGS206305
TranscriptDPOGS206305-TA3474 bp
ProteinDPOGS206305-PA1157 aa
Genomic positionDPSCF300082 - 1092189-1132362
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0082350.088.55% 
BombyxBGIBMGA014124-TA0.078.55% 
DrosophilaCamta-PF8e-11734.81% 
EBI UniRef50UniRef50_E0VVQ00.043.03%Calmodulin-binding transcription activator, putative n=2 Tax=Neoptera RepID=E0VVQ0_PEDHC
NCBI RefSeqXP_968552.20.049.47%PREDICTED: similar to calmodulin-binding transcription activator [Tribolium castaneum]
NCBI nr blastpgi|1892410120.049.47%PREDICTED: similar to calmodulin-binding transcription activator [Tribolium castaneum]
NCBI nr blastxgi|1892410120.049.73%PREDICTED: similar to calmodulin-binding transcription activator [Tribolium castaneum]
Group
Gene OntologyGO:00056342.1e-48nucleus
GO:00063552.1e-48regulation of transcription, DNA-dependent
GO:00055162.1e-48calmodulin binding
GO:00055151.1e-05protein binding
KEGG pathway 
InterPro domain[4-107] IPR0055592.1e-48CG-1
[526-607] IPR0147563.4e-20Immunoglobulin E-set
[693-792] IPR0206832.2e-07Ankyrin repeat-containing domain
Orthology groupMCL11713 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206305-TA
ATGGCCCTGGAAATTGCGGCCATACTGATCAGCTTCGATAAACATGGCGATTGGCAGTCCAAGGAAGTCAAAATACGACCAAAAAGTGGGTCTATGCTTCTGTACAGCAGAAAAAAAGTCCGTTACCGACGTGATGGATACTGTTGGAAGAAGCGGAAAGACGGAAAGACCACCCGAGAAGACCACATGAAGCTTAAAGTGCAAGGAACTGAGTGCATCTATGGTTGTTATGTGCACTCGGCAATTCTACCTACTTTCCATCGGAGATGTTACTGGCTTTTGCAGAATCCCGACATAGTACTGGTCCACTATCTGAACGTGCCGTACCCAGATGACAACAAGTTGGCTACCGTAGCCCCGAGCCTCGCGCTCTGGGCTGATAAAAAGGAATGGACGAAAGATGAACTCGTCAGTCAATTGAAGCCAATGTTCTTTAGCGAAGATGAACCGGACGTAAATAGTGACTTGGAAATATCTACAACGGAGACGGTGGAAGCTATTGTTGGTCAGTTGATGGAGAAGCAAAGAGCAGCGAGAGCTGCCGCTTTGGCTCGCCAACTTGAATGCGGCTGCCCAGACTCCACCTGCCAAGATTCAAGGACATGTGCTCACCCAATGCGGCGTATTCAAACAGCAAAGGCACCGGCTTCTGACCATCATGTTTCCTCAACTACTGGCCCGTCACCAAGACCAATGGCTCAACCACCACGACAGTACACCAGAGACCATAGAGCTACGACACAATCGTCACCGTTACTGCTGTCGCTGGGTCAGATACAAGGTGGTGGGGGACTTCTCATATTAAATGGCACCAGCAACAGTTCTCAACAGCATTCATCATTAGTTTCACCTCTGTCTGTTACATCATTTGTTTGCGAGGAACCTAGAGACAGGTATCGTCAACAGTACAAACCGACATTCGTCCTGAAAAGGGAAATACCGGATAGTCAACAAAACACATGTTTGACTAATACTGAATCAACGTTTGAAGTGGAGAGTCGAGTTGAAGAAAAAGTTGAAATTGAAACTTTTGATCGAAAAATAAAGATGGAACCTAGAAGTAGAAATAATATAATAGCTAGTGCACCAGCGACGCCGTCACGTTACCCAGACTTGGTGGAACGATTGGAAAGTAAAATTCATACAGACCATTGTGAAGATACGCTGGTTTTGCTTGGGACTGATGCCCATTTGGAATCATCTAGTGGGTTTTTCGATGAAACATTGGAGCTATCTCACGAGGATATACAGAAGACATTGTCAGCGAATATGCCAACATGTGAATTAAATCGAAGTGGAGTGAGATCAACTGAAACCGCCAATGTGATGGTATCGGGAATAGATACTATGGACTTTATAGAGAGTTGTGAAGCTGTCGCTTCCCCTACACATGTGGTTGATGATAATGTGTTTGTAAATTTAGATGCTTTTGATATGCTCGGTGACTTTCCGGAATTGGAGGTATTGGATCCCAGCACTATATCTACTAATCCCGCGAATCTTTGTGGAAATTCTCCTCAAACGGAGGAAAACAACGATAAAATGCAGACTGATAGTCCAAGGGAAGGTGCACTTAGCATTACTGACTATTCTCCTGAATGGGCGTATCCTGAAGGTGGTGTCAAGGTACTGGTAGCTGGGCCTTGGACGGAAACCTCCGATCAGTATACCATTCTTTTTGACAACTTTCCGGTACCTTCAATATTGGTGCAGAATGGTCTACTTAGATGTTATTGTCCAGCTCATGAGGCCGGGTTGGCAGCATTGCAAGTAGCTCGAGCTGGTCGCGTAGTATCTGACACGGTGGTGTTCGAATATAAGGCAGGTCCAATGTTGGCGCCGTCCTCACCCGCTTCAGCGCCGCTGCCTTCTTTGGATCTTAGACGATTCTCGTTGTTGCAGCGTCTGCAGCGGCTGCACGGGCGTTTGCAACTGAAGACGGAACCAATGGATGATAATAACCAGATTGAAGATGTGCAGTTATATTCAAATCCAAAATTTGAGGATCGTCTCGTGGTTTTTTGCCAATTCTTAAGTAACCGGTCGTTCGGTAACTCCGAAGGATTCACTACGGAGCCTGGTGAAGACAGTTCCACCATATTACATCTAGCTGCAGCTTTGGGTTACACGAAGCTGACAACAGCTTTGTTGAGGTGGAGACAGGACGATAATAGCTTAGCTTTGGAGAAGGAAGTTAATTTGGGAGCTAGAGATAGCGACAATTGTACCCCCTTGATGGTAGCAAGCGCGCTCGGTCACTCAGACACCGCGTTAGTGTTGGCTCGCTGGGCGGCGGGGACGCGGCGGGAGGCCGGGGCTAGAGCGGCGGTTGCTGCAGCACGGCGCGGGGGGCACAGCACCCTCGCAGCTGCCTTAGAGAGAATACAGGGGGACTGCGTGTTCAGAAGACCGCTCAGTTTATCTCAAAAGAATAGAGCTGGCAGTTTGGAGAGTAATTTAGTGAAACGGCCCTCCATCGACAGCGGTATCAACATGGCTGATGCCTTTAGATCCAGTTCAGCTATAGACAAAACTGACACTAATTCGTCCAGATGGGAACGAAGTATGTCGCTTCCACTGGACTCGGATACCGAGGACAGCTTCGGTGACATGAAACTTGGTCGCAGGATGGACCTAGCTCTATGGGAGCAGGATGACCGTGTCTTCACGTTGGCTGAGCAGATTATAGCCGCTATGCCGGAGAGAATTAAGAATGAGGGTATTCTTTCGTGCGACCTGGACAGCGGCGCTTGCAGCGAGGACGTGCTGATGGTGCCGTTGTTAGATGACGCTTCAACCTTCAGCAGCGAGTTCAGCTTTGAGTTTTGTGATAACACATACAGATACACTGGGGCATCTACTCCATCATCAGGCTCCGTGTCTCCAGGCTCTGCGCTGTCTCCTCCGCCCTCTTCACCCCTCGCCCCCGCCTCTGCCACTCTACAAGAGTTCCTCAACACGACGCACTTTTCCAGCTTAACTCTAAACGACCGGGAGCAGCGCGAGTTATACTCAGCGGCGATCACGATCCAGAAGGCCTATCGTCAGTACCGCGGTAGACAGTTGCAGCGCCGGGCTGCCGCCGCTGCAATCACCATACAGAACTGCTATCGTCGATACAAACAGTTCGCGTACTTGAAGCAAATGCACGCAGCGGCGACGGTTATCCAACGAGGATACCGCGGATTGAGGGAGAGACGGCTCAATAACACCAACTACGTCAAGCGAACATACTCTCAGAGGAGACAACACCAAGCAGCGAGGAAAATCCAACAGTTCATGAGGCAAACCAAGATCAAGTTGCAGAGAGAGCGAGCCGCAAACGCGAAGGCGGCCCTGCGCTCCCCGGATGCCCACCAAAGCTCGTCGCAGCCCATCACCAGTACACCCAATAGGATCATTGACTATCTAGCACCTGAATCACCGATGAACGCAGATGATGACCTTCTGATCGAGCTTCTGTTTAAAATGTGA

Protein sequence:

>DPOGS206305-PA
MALEIAAILISFDKHGDWQSKEVKIRPKSGSMLLYSRKKVRYRRDGYCWKKRKDGKTTREDHMKLKVQGTECIYGCYVHSAILPTFHRRCYWLLQNPDIVLVHYLNVPYPDDNKLATVAPSLALWADKKEWTKDELVSQLKPMFFSEDEPDVNSDLEISTTETVEAIVGQLMEKQRAARAAALARQLECGCPDSTCQDSRTCAHPMRRIQTAKAPASDHHVSSTTGPSPRPMAQPPRQYTRDHRATTQSSPLLLSLGQIQGGGGLLILNGTSNSSQQHSSLVSPLSVTSFVCEEPRDRYRQQYKPTFVLKREIPDSQQNTCLTNTESTFEVESRVEEKVEIETFDRKIKMEPRSRNNIIASAPATPSRYPDLVERLESKIHTDHCEDTLVLLGTDAHLESSSGFFDETLELSHEDIQKTLSANMPTCELNRSGVRSTETANVMVSGIDTMDFIESCEAVASPTHVVDDNVFVNLDAFDMLGDFPELEVLDPSTISTNPANLCGNSPQTEENNDKMQTDSPREGALSITDYSPEWAYPEGGVKVLVAGPWTETSDQYTILFDNFPVPSILVQNGLLRCYCPAHEAGLAALQVARAGRVVSDTVVFEYKAGPMLAPSSPASAPLPSLDLRRFSLLQRLQRLHGRLQLKTEPMDDNNQIEDVQLYSNPKFEDRLVVFCQFLSNRSFGNSEGFTTEPGEDSSTILHLAAALGYTKLTTALLRWRQDDNSLALEKEVNLGARDSDNCTPLMVASALGHSDTALVLARWAAGTRREAGARAAVAAARRGGHSTLAAALERIQGDCVFRRPLSLSQKNRAGSLESNLVKRPSIDSGINMADAFRSSSAIDKTDTNSSRWERSMSLPLDSDTEDSFGDMKLGRRMDLALWEQDDRVFTLAEQIIAAMPERIKNEGILSCDLDSGACSEDVLMVPLLDDASTFSSEFSFEFCDNTYRYTGASTPSSGSVSPGSALSPPPSSPLAPASATLQEFLNTTHFSSLTLNDREQRELYSAAITIQKAYRQYRGRQLQRRAAAAAITIQNCYRRYKQFAYLKQMHAAATVIQRGYRGLRERRLNNTNYVKRTYSQRRQHQAARKIQQFMRQTKIKLQRERAANAKAALRSPDAHQSSSQPITSTPNRIIDYLAPESPMNADDDLLIELLFKM-