Monarch geneset OGS2.0

DPOGS208514
TranscriptDPOGS208514-TA1059 bp
ProteinDPOGS208514-PA352 aa
Genomic positionDPSCF300064 - 263023-268050
RNAseq coverage238x (Rank: top 43%)
Annotation
HeliconiusHMEL0021620.086.97% 
BombyxBGIBMGA008451-TA1e-16079.82% 
DrosophilaIP3K2-PB4e-13975.08% 
EBI UniRef50UniRef50_Q5TX951e-14280.34%AGAP002194-PA n=6 Tax=Anopheles gambiae RepID=Q5TX95_ANOGA
NCBI RefSeqXP_565625.36e-14478.03%AGAP002194-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479673634e-14280.34%AGAP002194-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479673612e-13879.33%AGAP002194-PB [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00084403.1e-149inositol trisphosphate 3-kinase activity
KEGG pathwayaga:AgaP_AGAP0021942e-143 
 K00911 (E2.7.1.127, ITPK)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
    Calcium signaling pathway
InterPro domain[32-345] IPR0055223.1e-149Inositol polyphosphate kinase
Orthology groupMCL14747 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208514-TA
ATGTACTCCGGTGACATCTGTTTGAACATTAAGGATTTTATGGATGTCGTGTACTGTGCTTATATTTTTAATTCTTCTAAGCAATCAGAACGATGGAGGAAGCTGAGGAATATTGTGCAATGGACACCCTTCTTTCAAACCTACAAGAAGCAGAGGTACCCTTGGGTACAGCTGGCTGGTCACCAAGGGAACTTCAAAGCCGGCCCTGACCAAGGAACCATATTAAAAAAGCTGAGCCCTCAAGAGGAGAGATGCTTTAAGTTGCTGATGAAGGATGTGTTACGACCTTTCGTCCCTGGTTACAAAGGGCAGGTCACATGCGAAGACGGCGAATTATATTTACAACTCCAAGATCTCCTCAGCGATTTTGACTGTCCCTGCGTTATGGATTGCAAGATTGGCGTACGGACTTATCTGGAAGAGGAATTAGCTAAGGCTAAGGAGAAAACCAAGTTAAGAAAAGACATGTACGAGAAAATGATCCAAATAGATCCAAAGGCGCCAACAGAGGAAGAGCACAGAAGCAAAGGAGTTACAAAGCCACGGTACATGATTTGGAGGGAAACAATCAGTTCTACTTCGACATTGGGTTTCAGGATAGAGGGAGTGAAAAAGGCTGATGGAACGAGCACAAAAGACTTCAAGACCACAAAAACGAGAGATCAAATTGTCGAAGCCTTTAAAGATTTCGCTAACACCTCCACTGCCGTGCCAAAATATCTCGAACGGCTGAAGGCTATTCGGACGACTCTTATGGAATCAAACTTCTTCAGAACTCACGAACTTATAGGCAGTTCCTTGCTCTTCGTTCACGACAAAAGAAAAGCCTCTATTTGGATGATAGATTTCGCTAAAACAGTACCTGTGCCAGAGGATATAACTATCGACCACGATTCCGCTTGGAAGGTCGGTAACCATGAAGACGGCTACCTTATCGGCATCAATAACTTAATATCAATCTTCGAATCCCTTATCAAGGACGATAACGGTAACATAGATCAGTGTTATTCGAATTTAAGTTTAGATAAAAACGTAAGAAGAGACAGCTTAGCCACTTGA

Protein sequence:

>DPOGS208514-PA
MYSGDICLNIKDFMDVVYCAYIFNSSKQSERWRKLRNIVQWTPFFQTYKKQRYPWVQLAGHQGNFKAGPDQGTILKKLSPQEERCFKLLMKDVLRPFVPGYKGQVTCEDGELYLQLQDLLSDFDCPCVMDCKIGVRTYLEEELAKAKEKTKLRKDMYEKMIQIDPKAPTEEEHRSKGVTKPRYMIWRETISSTSTLGFRIEGVKKADGTSTKDFKTTKTRDQIVEAFKDFANTSTAVPKYLERLKAIRTTLMESNFFRTHELIGSSLLFVHDKRKASIWMIDFAKTVPVPEDITIDHDSAWKVGNHEDGYLIGINNLISIFESLIKDDNGNIDQCYSNLSLDKNVRRDSLAT-