Monarch geneset OGS2.0

DPOGS209523
TranscriptDPOGS209523-TA1500 bp
ProteinDPOGS209523-PA499 aa
Genomic positionDPSCF300127 + 495072-507522
RNAseq coverage1134x (Rank: top 11%)
Annotation
HeliconiusHMEL0162620.073.70% 
BombyxBGIBMGA007441-TA0.068.33% 
DrosophilaInos-PA5e-17157.50% 
EBI UniRef50UniRef50_O974778e-16957.50%Inositol-3-phosphate synthase n=10 Tax=Eukaryota RepID=INO1_DROME
NCBI RefSeqXP_002016409.15e-17357.09%GL11560 [Drosophila persimilis]
NCBI nr blastpgi|665467862e-16959.29%PREDICTED: inositol-3-phosphate synthase 1-B isoform 1 [Apis mellifera]
NCBI nr blastxgi|1258085167e-16457.09%GA24769 [Drosophila pseudoobscura pseudoobscura]
Group
Gene OntologyGO:00086541.1e-221phospholipid biosynthetic process
GO:00060211.1e-221inositol biosynthetic process
GO:00045121.1e-221inositol-3-phosphate synthase activity
GO:00054883e-80binding
KEGG pathwaydpe:Dper_GL115601e-172 
 K01858 (E5.5.1.4, INO1)maps-> Inositol phosphate metabolism
    Streptomycin biosynthesis
InterPro domain[5-480] IPR0025871.1e-221Myo-inositol-1-phosphate synthase
[380-455] IPR0160403e-80NAD(P)-binding domain
[284-382] IPR0130217.5e-38Myo-inositol-1-phosphate synthase, GAPDH-like
Orthology groupMCL11545 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209523-TA
ATGACTACAACCTCAAACCTAACGGTGTCTTCGCCAAATATAAAATATACCGACGATTATATATTTTCCCAGTATGAATATCAGGAAACTCTTGTTGATACGGTGGATAACAAGTTGGTGGCAAAGCCCTACAGTACATCACTATCAATTAGAACTGATCGAAGAGTGGGGAAAGTTGGTGTAATGCTGGTGGGTTGGGGAGGCAACAATGGGTCCACGTTCACAGCCGCTGTTATAGCTAACAGAGAAAAGCTGACTTGGAACACCAAGGATGGTGAGATGGATTCTAACTGGTATGGCTCTATAACTCAAGCATCCACAGTTCGTCTTGGTATTGACGAGAGGGGTATGGACGTGTACGTTGCTATGTCACAGCTGCTGCCAATGGTTGATCCCAATGATTTGGTTATAGATGGATGGGATATAAGCCCCATGAATCTAGCTGACGCCATGGTGCGTGCCAAGGTCATTGACTACGACCTGCAGCAGAAGCTCCGCAAGAAGATGCAGACCATGAAACCCCGTCCGGCTATATACGATCCGGATTTCATAGCTGCCAACCAGGCCGATCGTGCTTTAAACCTCATCCGCGGCACTCGACACGAGCAGTACCTACAGATAAGAGCTGACATCAAAGACTTCAGAGACAAGAATAACCTGGATAAAGTCATCGTATTGTGGACAGCGAACACTGAAAGATTCTGCGAAGTAACTCCGGGTGTTCATGATACAGCTGAGAATTTGCAGAAGGCTTTACAGCAGAATGTGTCCGAGGTGTCCCCTTCAACCATCTTCGCCATGGCAGCCGTCGACGAGGGGCAATGTAAAGCGCATGGCTCTCTGATTCCCGAGTGTTTGAAACCGGTGTCCATAGTAAGCTACAACCACCTGGGCAACAACGACGGAAAGAACCTCTCGGCTCCGAAGCAGTTCCGCTCTAAAGAGATAACGAAAAGCAACGTGGTAGACGACATGGTGGAGGCGAACCGTCTGCTGTACTCCGAGGGGGAGAAGCCCGACCACGTGGTCGTCATTAAATACGTCCCCTACGTCGGCGACTCCAAACGCGCCATGGACGAGTACACGTCCAAGATCCTGCTGCACGGGACCAACACCATCGCCGTCCACAACACCTGCGAGGACTCGCTGCTGGCCACGCCGCTCATCCTCGACCTGCTGCTGCTGGCCGAGCTGTTCACCAGGGTCAGCTTCCGCAGGGACGAGTCGGAAGAGTGGAGCCCCATGCACGCGGTGTTGTCTTCCCTGGCGTACTTGTTGAAGGCGCCCCTGGTCCCGGCCGGAGCGCCCGTGGTCAACGCCCTCTTCAAGCAGCGAGCCAACATAGAGAACCTGCTCCGCGCTTGTCTCTCGCTGCCTCCGCTCCACCACCTGCAGCTGGAACACAAGGTGCCGTTCCTGATGAAGGAGCTGCGTTCGGGCGCCATGTTCGAGTCGCCGCCGAAGAAACAGAAGCTGTCCCACCAGAACGGGGATCACTGA

Protein sequence:

>DPOGS209523-PA
MTTTSNLTVSSPNIKYTDDYIFSQYEYQETLVDTVDNKLVAKPYSTSLSIRTDRRVGKVGVMLVGWGGNNGSTFTAAVIANREKLTWNTKDGEMDSNWYGSITQASTVRLGIDERGMDVYVAMSQLLPMVDPNDLVIDGWDISPMNLADAMVRAKVIDYDLQQKLRKKMQTMKPRPAIYDPDFIAANQADRALNLIRGTRHEQYLQIRADIKDFRDKNNLDKVIVLWTANTERFCEVTPGVHDTAENLQKALQQNVSEVSPSTIFAMAAVDEGQCKAHGSLIPECLKPVSIVSYNHLGNNDGKNLSAPKQFRSKEITKSNVVDDMVEANRLLYSEGEKPDHVVVIKYVPYVGDSKRAMDEYTSKILLHGTNTIAVHNTCEDSLLATPLILDLLLLAELFTRVSFRRDESEEWSPMHAVLSSLAYLLKAPLVPAGAPVVNALFKQRANIENLLRACLSLPPLHHLQLEHKVPFLMKELRSGAMFESPPKKQKLSHQNGDH-