Monarch geneset OGS2.0

DPOGS214121
TranscriptDPOGS214121-TA3018 bp
ProteinDPOGS214121-PA1005 aa
Genomic positionDPSCF300014 - 1635453-1641761
RNAseq coverage480x (Rank: top 26%)
Annotation
HeliconiusHMEL0113870.090.00% 
BombyxBGIBMGA006169-TA0.083.44% 
DrosophilaNipped-A-PA0.062.21% 
EBI UniRef50UniRef50_E2AUX10.069.92%Transformation/transcription domain-associated protein n=8 Tax=Formicidae RepID=E2AUX1_CAMFO
NCBI RefSeqXP_393981.20.074.02%PREDICTED: similar to transformation/transcription domain-associated protein isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838549000.074.12%PREDICTED: transformation/transcription domain-associated protein isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3838549020.074.12%PREDICTED: transformation/transcription domain-associated protein isoform 2 [Megachile rotundata]
Group
Gene OntologyGO:00055153.2e-55protein binding
GO:00167724.8e-45transferase activity, transferring phosphorus-containing groups
GO:00167731.2e-10phosphotransferase activity, alcohol group as acceptor
KEGG pathway 
InterPro domain[20-325] IPR0031513.2e-55PIK-related kinase, FAT
[575-969] IPR0110094.8e-45Protein kinase-like domain
[680-893] IPR0004031.2e-10Phosphatidylinositol 3-/4-kinase, catalytic
Orthology groupMCL12356 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214121-TA
ATGAAGGATGCTCTCGCAGAAGTCGAGTATAATTGTCCAAAGGAATTAGCGTGGCTGGTGAACCTGTACCGCGGCTACCTGTGCATCTGTGCGGGCGGCGAGCAGCAGCTGAGCGGCGTGGAGCGGCACGCGGAGGCGGCGGCGGCTCAGTGCCTGCGGGAGTGGCGCCGCCTGCCGCGACTCGTTGCGCACGCGCATCTGCCGCTCCTGAGAGCCGCGCAACAGCTCATGGAGCTCAGCGAGGCCGCGCAGATACATACCGGTCTCCTTCATAGTCGGCCAACGTCGCTGCACGACATGAAGGCCATAGTGAAGACGTGGCGCAACCGCCTGCCCGTAGTAGCTGACCCGCTGTCTCACTGGGGCGCCATCTTTACTTGGCGACAACACCATTACCAGTTCATCGCCTCCCATTACGACTCCCAGACAGACCACGCCTCAAACCATAGTATGCTAGGAGTGCATGCCAGCGCTCAGGCGATTATCCATTTTGCCAAAATAGCAAGGAAGCACAATTTATCTGGCGTCTGCCTTGATTCCTTGCACCGTATATACACCATCCCGAGCGTTCCAATAGTGGATTGCTTCCAGAAAATAAGACAGCAAGTCAAATGTCACATTCAGATGTCCTGGACGGAGGGCAAAGACGAGTTACAGGAAGGATTGGATATGATAGAGTCGACTAACTTTAAATACTTCACGAAGGAAATGACGGCAGAGTTTTACGCGTTTAAAGGACTGTTACTAGCTCAGTTAGGCCGCTCGGAGGATGCCAACAAAGCATTCGCCGCGGCGGTGCAGTTGCATGACACTCTGGTGAAAGCTTGGGCTCTATGGGGAGATTATTTAGAACAGATTTTCATCAGAGACCCGAGACAGACACAAGTCGGCGTCTCCGCCATGACATGCTTCTTACATGCCTGTAGACATCAGAATGAATCAAAATCAAGAAAATATTGTGCGAAGGTTCTTTGGATGTTAAGTTTTGACGATGAAAAGAATAGTCTAGCGGATGCTCTGGATAAGTATTCTGTGGGTGTCCCGCCTGTGCAATGGTTGCCATGGATACCACAATTGTTGGCTTGCCTTGTACAATATGACGGCAATGTTATATTAAACTTGTTGAGTCATGTGGGACGGCTCTATCCCCAAGCAGTATACTTCCCCATCCGCACTCTGTATTTAACTTTGAAGATCGAGCAACGTGAAAGACACAAAAGCGCCGAAAACCTCGCTGCTAACCAACCGACCACCACCACAGCGAACACGGGTGTAAAGACGGAAGGCGCGTCGACATCATCGGGATCGGGCGGCGAGGCGGGGCCGATTAAGGCGACGCTGCCTATGTGGCGTTGCTCCAAGATCATGCAGCTTCAGAGGGAAATACATCCCACCGTACTTTCCTCGCTAGAAGGCATCGTCGACCAGATGGTGTGGTTCCGTGAGAACTGGTACGAGGAGGTGCTCCGTCAGCTTCGTGCCGGTTTGGCCAAATGCCACGTGGTGGCATTCCATCACCGCGCCGCCGTGGCCGTGGCCACCGTCACGCCGCACACGCTCAATTTCATTAAGAAGGTTGTCTCTACCTTCGGCATAGGGATCGGCATCACCGCCTCGGAGAGTTTGGCGCGGCGAGCGCAGGCCACGGTCCAGGACCCCGTGTTCCACAGGATGAAGACCCAGTTCACGGCCGACTTCGACTTCACACAACCCAACGCCATGAAGCTGCAGAATCTCATACAGAAGTTGCGCAAGTGGGTCAAGATACTCGAGGCTAAGACGAAAGTGTTGCCCAAATCGTTCCTGATAGAAGAGAAGTGCAGATTCCTGTCCAACTTTAGTCTTAAGACGGCAGAAGTGGAGCTGCCAGGGGAATTCTTACTGCCGAAACACACTCACTACCACGTCAGAATCGCAAGATTTATGCCTAGAGTGGAGATTGTACAGAAGCATAACACATCAGCTCGAAGATTGTATATACGAGGTCATAACGGGAAAATATATCCTTACTTGGTAGTCAACGACTCCGGTCTGGGAGACGCTAGAAGAGAGGAACGCGTGCTCCAGTTGCTGCGGATGCTGAATCATTATCTCGGCAAACAGAAGGAAACGTCGAGAAGATTCCTTCACTTCACGGTACCCCGCGTCGTATCGGTATCTCCGCAGATGCGTCTCGTAGAGGACAATCCTAGTTCGATATCACTGCTTGATATATATCGAACTGAATGTGCCAATAGGGGCGTGGAGTATGACGCGCCGGTGGCTCGGTACTATGAACGTCTGGCGGCCGTGCAGGCGCGAGGCTCGCAGGCCTCACATCAGGTGCTGCGGGATATCCTACGAGAGGTTCAAGCTACGATGGTTCCCCGCGGTCTGGTCCGCGAGTGGGCAGCTCGTACGTTCTCGTCGCCCACTGATTATTGGACCTTCCGCAAGATGGTGACACTGCAGCTGTCTCTGGCCGCGTGCGCGGAGTATGTGCTCCACCTCACCAGACTCAACCCGGACATGTTGTACGTGCACCACGACTCCGGCCTGCTCAATGTGGCGTACTTCAAGTTCGACGTCGATGACACCACAGGCGAGCTGGATGGTAACCGACCTGTGCCGTTCCGTCTAACTCCTAACATCTCTGAGCTGGTGACTCACATCGGCATCACGGGTCCTCTGACAGCCTCCGTCATAGCCGTAGCCCGCTGTCTGGTGACACCGAATTTCAAGATACAATCGATCCTGCGCACCATACTTCGAGACGAAATGGTCACCGGTTACAGGAAGCGGCTGGAAGATAAATCCGGTTTGCCGACGGCCGGAACATCCGCGGAGAATAAACCCATGGAGATAGACAACGAGACCATCATAAACATGGTCACCAAGGCCGTTACTGTTATTATGAACAGACTGAACTCGCTGGCCCTCTTCGACGGCCCGGACAGTAAGGTGGCCACTCTAGTGACGGCAGCCAACAGTCATGACAATCTCTGCAGGATGGACCCCGCGTGGCATCCGTGGCTGTAG

Protein sequence:

>DPOGS214121-PA
MKDALAEVEYNCPKELAWLVNLYRGYLCICAGGEQQLSGVERHAEAAAAQCLREWRRLPRLVAHAHLPLLRAAQQLMELSEAAQIHTGLLHSRPTSLHDMKAIVKTWRNRLPVVADPLSHWGAIFTWRQHHYQFIASHYDSQTDHASNHSMLGVHASAQAIIHFAKIARKHNLSGVCLDSLHRIYTIPSVPIVDCFQKIRQQVKCHIQMSWTEGKDELQEGLDMIESTNFKYFTKEMTAEFYAFKGLLLAQLGRSEDANKAFAAAVQLHDTLVKAWALWGDYLEQIFIRDPRQTQVGVSAMTCFLHACRHQNESKSRKYCAKVLWMLSFDDEKNSLADALDKYSVGVPPVQWLPWIPQLLACLVQYDGNVILNLLSHVGRLYPQAVYFPIRTLYLTLKIEQRERHKSAENLAANQPTTTTANTGVKTEGASTSSGSGGEAGPIKATLPMWRCSKIMQLQREIHPTVLSSLEGIVDQMVWFRENWYEEVLRQLRAGLAKCHVVAFHHRAAVAVATVTPHTLNFIKKVVSTFGIGIGITASESLARRAQATVQDPVFHRMKTQFTADFDFTQPNAMKLQNLIQKLRKWVKILEAKTKVLPKSFLIEEKCRFLSNFSLKTAEVELPGEFLLPKHTHYHVRIARFMPRVEIVQKHNTSARRLYIRGHNGKIYPYLVVNDSGLGDARREERVLQLLRMLNHYLGKQKETSRRFLHFTVPRVVSVSPQMRLVEDNPSSISLLDIYRTECANRGVEYDAPVARYYERLAAVQARGSQASHQVLRDILREVQATMVPRGLVREWAARTFSSPTDYWTFRKMVTLQLSLAACAEYVLHLTRLNPDMLYVHHDSGLLNVAYFKFDVDDTTGELDGNRPVPFRLTPNISELVTHIGITGPLTASVIAVARCLVTPNFKIQSILRTILRDEMVTGYRKRLEDKSGLPTAGTSAENKPMEIDNETIINMVTKAVTVIMNRLNSLALFDGPDSKVATLVTAANSHDNLCRMDPAWHPWL-