Monarch geneset OGS2.0

DPOGS213804
TranscriptDPOGS213804-TA1182 bp
ProteinDPOGS213804-PA393 aa
Genomic positionDPSCF300106 - 43749-45278
RNAseq coverage712x (Rank: top 18%)
Annotation
HeliconiusHMEL0147250.086.29% 
BombyxBGIBMGA009298-TA2e-15581.93% 
DrosophilaIP3K1-PA2e-8949.27% 
EBI UniRef50UniRef50_D6WZL69e-11258.38%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=D6WZL6_TRICA
NCBI RefSeqXP_397256.35e-11461.83%PREDICTED: similar to Inositol 1,4,5-triphosphate kinase 1 CG4026-PA [Apis mellifera]
NCBI nr blastpgi|3407111731e-11462.97%PREDICTED: inositol-trisphosphate 3-kinase A-like [Bombus terrestris]
NCBI nr blastxgi|3838523241e-11056.82%PREDICTED: inositol-trisphosphate 3-kinase A-like [Megachile rotundata]
Group
Gene OntologyGO:00084402.7e-71inositol trisphosphate 3-kinase activity
KEGG pathwayame:4138171e-113 
 K00911 (E2.7.1.127, ITPK)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
    Calcium signaling pathway
InterPro domain[108-394] IPR0055222.7e-71Inositol polyphosphate kinase
Orthology groupMCL15115 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213804-TA
ATGACTGTAACCAGCAAGATGAACAGTGTCTCTCGCCAGTCGATGCTGAGCCGTCTCGGCCCTGGCGGGCTCCTTTATGCCGCAAACAAAATTATAGCGGTTGCTTGCAAGGATGAAGAACGCTGGCGTGTTAAGAGTGAGGGTGCTTTGTTCAAACGCTGGAAGCGTAATAACTCTTCCGCGGAGAACACCTCCCCTGTTCCACCAGTATCAGCCAGCCCAGCATCGCAACCGTCAGATCTGCTCAAATTCTTAGCAATCAATGCACTAGAGCTTTCAGCACCGGCATCTGATGCTTTGTTGAAATCTCGTTCCTCGGAATGGTTTCAATTGTCGGGACACCCAGGTTCCCTAGCACCTGCTGGCCCTGGAACAGTTTGGAAACGTCGCGCAGCGGGTGACCGTCCTGGACACAACCCTGAGCGCGACGCTTACGAAGCCCTCGCAGCATGCCCCCACATGCGCACCGCGATACCCCGCTATTACCGTGATTTGGAATATGACGGTGAACACTTCATTGAACTGCAAGACCTCCTTCACGGGTTCCGTGATCCACATGTTATGGATGTTAAAATGGGAACCCGGACATTCCTCGAAGATGAAGTTAGTAACGCTCGCGCCCGACAAGATCTTTATGAGAAAATGGTACGTGTAGACCCCAATGCACCCACTGAAGAAGAGCACGCCGCTAGAGCGGTCACCAAACTGCGATATATGCAGTTCCGTGAGCAGTGCTCATCATCAGCCCAGCAAGGCTTCCGCATAGAAGCTGTTAAAGTCCCAGGCCAGCCACCACTTACTGACCTTCAAAAAGTTCGTGAACCGGAACAACTTAAGGCTACTGTTGCTCGGTTCCTCGGTAACGATAAGCGCGCCCAACGCGCCATAGCAGCTCGATTGCGCGAGATAAGAAGTCTATTTGAAAAATCTGAATTCTTCCGGGAACATGAGATTGTTGGCAGCAGCATTTTTATAATCTACGACGATGAACGCGTGGGAGCCTGGCTGATAGACTTCGCAAAGACGCGTCGCTTGCCGAAAGACGTTAAAGTAACACATAGAGAAGAATGGCAGCAAGGTAACCACGAAGAAGGTTTCTTGTATGGATTAGATAGATTAATAAATACCATAGAGGCAGCAGAACTCTCCGAAACAATAATTGAACCGGACGTATCACGCTGA

Protein sequence:

>DPOGS213804-PA
MTVTSKMNSVSRQSMLSRLGPGGLLYAANKIIAVACKDEERWRVKSEGALFKRWKRNNSSAENTSPVPPVSASPASQPSDLLKFLAINALELSAPASDALLKSRSSEWFQLSGHPGSLAPAGPGTVWKRRAAGDRPGHNPERDAYEALAACPHMRTAIPRYYRDLEYDGEHFIELQDLLHGFRDPHVMDVKMGTRTFLEDEVSNARARQDLYEKMVRVDPNAPTEEEHAARAVTKLRYMQFREQCSSSAQQGFRIEAVKVPGQPPLTDLQKVREPEQLKATVARFLGNDKRAQRAIAARLREIRSLFEKSEFFREHEIVGSSIFIIYDDERVGAWLIDFAKTRRLPKDVKVTHREEWQQGNHEEGFLYGLDRLINTIEAAELSETIIEPDVSR-