Monarch geneset OGS2.0

DPOGS202266
TranscriptDPOGS202266-TA1041 bp
ProteinDPOGS202266-PA346 aa
Genomic positionDPSCF300032 - 498920-500891
RNAseq coverage338x (Rank: top 34%)
Annotation
HeliconiusHMEL0056038e-17482.56% 
BombyxBGIBMGA004919-TA6e-12769.84% 
DrosophilaCG7789-PA1e-9451.61% 
EBI UniRef50UniRef50_Q9VAG92e-9251.61%CG7789 n=14 Tax=Metazoa RepID=Q9VAG9_DROME
NCBI RefSeqXP_975068.15e-10255.81%PREDICTED: similar to AGAP004654-PA [Tribolium castaneum]
NCBI nr blastpgi|3123748544e-10155.85%hypothetical protein AND_15426 [Anopheles darlingi]
NCBI nr blastxgi|910871013e-9955.81%PREDICTED: similar to AGAP004654-PA [Tribolium castaneum]
Group
Gene OntologyGO:00044378.5e-137inositol or phosphatidylinositol phosphatase activity
KEGG pathwaytca:6639481e-101 
 K01082 (E3.1.3.7)maps-> Sulfur metabolism
InterPro domain[1-344] IPR0007608.5e-137Inositol monophosphatase
Orthology groupMCL13502 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202266-TA
ATGTCATCAACGGCACCCTTAATTATAAGGTTATTAGCTTCGTCGGTGTCTGTGGCAGGTCGAGCTGGCAAAATTGTGAGAGATGTTATGAGCAGAGGGGAACTAGGAATAGTTGAAAAGGGCAAAGACGACTATCAGACTGAAGCTGATAGATCAGCACAGCGATGTATCATATCATCCTTATCATCACTGTATCCTAAAGTCAATATTATTGGAGAGGAAGGAAATCCTGACTCAGAGGGTGAAATCCCAAGTGAATGGCTTCAGATTGAGGCAGACAAGGAAGTTATGTTATTGGAATGTCCTTCAAGTTTGCAAGGTGTTAAAGAAGAGGATATTGTTGTATGGGTCGACCCTCTGGACGGGACTTCCGAGTATACTCAAGGTGCGTTAATGTTGAAGCGGGATCCTTGGGAAAGATGGAAAATGTATTTAACACATCCACTGATAAGTATCAAATGGTATTTAAGTTTACGACACTCAAACTATGGTTTCCTCGAGCATGTTACTGTGCTTATCGGTATATCAGTCAATGAGAAGCCAGTGGCCGGTGTTATACATCAGCCATATTTCAAAACTATGGTCGGGGAGGAGAAGCAAATGGGTAGAACGATTTGGGGCCTCCAAGGAGTTGGTGTCGGAGGCTACATACCAGCTCCTCCACCAGATTCGCTTGTCATCACGACAACAAGAAGTCATTCAAACCCAGTTGTAGAGAAAGCTTTGCAAGTCATGAATGCAGCACAAGTTTTGAGGGTTGGCGGTGCTGGTTATAAGGTGTTACAGTTACTAGAGGGTAAAGCATCAATATATGTATTTGCAAGTCCCGGATGCAAAAAGTGGGATACATGCGCTCCTGAGGCCGTACTTAGTGCTGCTGGCGGGAAATTGACAGATATTTTGGGGGATTTCTACAAATATGGAGCATCGGTCACCCATCCTAACAAGACTGGCGTGTTGGCTGCGGTTAACGATGAGCTTCACAACTATGCATTGAGTCGGATACCACAAGAATTAAAAGACAAACTATCCGGTAATTAA

Protein sequence:

>DPOGS202266-PA
MSSTAPLIIRLLASSVSVAGRAGKIVRDVMSRGELGIVEKGKDDYQTEADRSAQRCIISSLSSLYPKVNIIGEEGNPDSEGEIPSEWLQIEADKEVMLLECPSSLQGVKEEDIVVWVDPLDGTSEYTQGALMLKRDPWERWKMYLTHPLISIKWYLSLRHSNYGFLEHVTVLIGISVNEKPVAGVIHQPYFKTMVGEEKQMGRTIWGLQGVGVGGYIPAPPPDSLVITTTRSHSNPVVEKALQVMNAAQVLRVGGAGYKVLQLLEGKASIYVFASPGCKKWDTCAPEAVLSAAGGKLTDILGDFYKYGASVTHPNKTGVLAAVNDELHNYALSRIPQELKDKLSGN-