Monarch geneset OGS2.0

DPOGS200817
TranscriptDPOGS200817-TA1719 bp
ProteinDPOGS200817-PA572 aa
Genomic positionDPSCF300071 - 821962-833227
RNAseq coverage423x (Rank: top 29%)
Annotation
HeliconiusHMEL0130370.075.36% 
BombyxBGIBMGA009872-TA0.076.07% 
DrosophilaCG5150-PA1e-12448.77% 
EBI UniRef50UniRef50_G3GBT48e-13850.30%Alkaline phosphatase n=2 Tax=Obtectomera RepID=G3GBT4_TRINI
NCBI RefSeqNP_001036856.22e-14053.22%soluble alkaline phosphatase [Bombyx mori]
NCBI nr blastpgi|2556832833e-13953.22%alkaline phosphatase [Bombyx mori]
NCBI nr blastxgi|2556832834e-13653.01%alkaline phosphatase [Bombyx mori]
Group
Gene OntologyGO:00081521.1e-165metabolic process
GO:00038241.1e-165catalytic activity
GO:00167918.4e-161phosphatase activity
KEGG pathwayaag:AaeL_AAEL0090775e-124 
 K01077 (E3.1.3.1, phoA, phoB)maps-> Two-component system
    Folate biosynthesis
    gamma-Hexachlorocyclohexane degradation
InterPro domain[58-534] IPR0178491.1e-165Alkaline phosphatase-like, alpha/beta/alpha
[88-530] IPR0019528.4e-161Alkaline phosphatase
[55-540] IPR0178503.5e-143Alkaline-phosphatase-like, core domain
Orthology groupMCL24814 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200817-TA
ATGACTGTAGTGCAGTTGGTTAAGACGCGGGATCGCGGTCGAACGGCTATGATGGTGCGCTTGACATTTGCTGTTCTGTTCGTAATAACGTGCGCCGGCGCAGAAGATGTCGTCTCCACACCGAAGCCGAAAAAAGTTCTGGACACTGTTATGAATCCAGCGTACATACCTCTGGAAGAGAAGCATGGATCGTTCTGGAAGAAATCTGCTTCGGAAGCCTTGAAATCGAAATTAAAAGAAACGGTTAACACTAACAAAGCTAAAAATGGTATACTCTTCATCGGTGATGGAATGTCTATTGCGACTATCATGGCAGCGAGAACACTGGCCGGCCAAGTCGAGAGGGGTCTAGGTGAAGATAATGTGCTCGCCTTTGAGAAGTTCCCGATAGCTGGACTGGCGAGGACATATTGCATTGATGCACAAGTCGCGGATTCGGCGTGCACCGCCACTTCCTACCTCTCTGGTGTGAAAACCAAATACGGAGTCATAGGCTTAGATGGCAACGCGACCAGGGGTTCCTGCTTATCCCAGTTGCATAAAGCGAACTGGTCTCCTTCTATAGGCCAGTGGGCTTTGGAGAACGGATTGGATGTTGGTTTGGTAACCACCACAAGAGTTACTCACGCCTCACCGGCTGGTATGTACGCCCATACATCCGAGCGGAACTGGGAGTCAGATGCTGATGTACCCGAAGAATGCCTCAGTTTAGGATGCCAGGATATCGCCTACCAGCTCGTCACTGGCAACCCTGGGCGACATTTCAAGGTCATCATGGGAGGAGGTCGTCGTGAATTTTTACCAAATGTTACATCTCCATTGACGAATTCGACAGGCAGAAGACGAGATGGAGTGGATCTCACAGAGCTGTGGCATCAGGATAAATTGGAAAGAAACGCAACACATCAATATGTCACCGAGAGGAATGAGCTAATGAAAGTCTTCGAATCAGATGATTTGCCGGAATACCTACTAGGATTGTTCCAAGATGATCATATGGATTACCACTTACAAGCTAAGGACCAACCAACCCTCGAAGAGATGGTGGAAGTCGCCATTAAAGTGCTATCAAGGAGCTCAAAGGGATACTTTTTGTTTGTTGAAGGTGGAAGGATCGACCACGCCCATCACGATAGCTATGCCTACTTGGCCCTAGACGAAACCATAGAATATTCGAAGGCAGTACAAAAAGCAAAATCCCTAACAAATGAAACGGACACATTGATAGTTGTTAGCTCAGATCACGCACACACTATGACAGTAGCTGGTTACCCCTCCAGGGGTAACGATATTCTGGGTCTTGTGGATGCAGCTCATGGGTCAGATGGCAAGCCATACACAACAATAAGTTACGCCAACGGTAAAGCCACGTCTATCACTGATAAAGGAAGAGTAGATTTGTCTTTGGAAAGCGATTTCAAGAACAGGAAATTGGACTATTCTTATCCAAGTTTGGTTCCGTTGGACTCCGAAACCCACGGGGGTGAGGATGTTGCTGTTTTCGCGAGTGGACCCTGGCAGCATCTGTTCACATCCAGCTACGAGCAGTCCGCTATACCACATTTCATGAGTTTTGCCATGTGCCTCAACGATATAAAACATGAACAATGCAAACGGCATAGAACTTTATGGACGTCTAGTAGTACCATGAATAAACCAATCAAGTATTATATCTTAAGTTTATTAGCTGTGATTTTTATAAGTATATCCTATTTCTAG

Protein sequence:

>DPOGS200817-PA
MTVVQLVKTRDRGRTAMMVRLTFAVLFVITCAGAEDVVSTPKPKKVLDTVMNPAYIPLEEKHGSFWKKSASEALKSKLKETVNTNKAKNGILFIGDGMSIATIMAARTLAGQVERGLGEDNVLAFEKFPIAGLARTYCIDAQVADSACTATSYLSGVKTKYGVIGLDGNATRGSCLSQLHKANWSPSIGQWALENGLDVGLVTTTRVTHASPAGMYAHTSERNWESDADVPEECLSLGCQDIAYQLVTGNPGRHFKVIMGGGRREFLPNVTSPLTNSTGRRRDGVDLTELWHQDKLERNATHQYVTERNELMKVFESDDLPEYLLGLFQDDHMDYHLQAKDQPTLEEMVEVAIKVLSRSSKGYFLFVEGGRIDHAHHDSYAYLALDETIEYSKAVQKAKSLTNETDTLIVVSSDHAHTMTVAGYPSRGNDILGLVDAAHGSDGKPYTTISYANGKATSITDKGRVDLSLESDFKNRKLDYSYPSLVPLDSETHGGEDVAVFASGPWQHLFTSSYEQSAIPHFMSFAMCLNDIKHEQCKRHRTLWTSSSTMNKPIKYYILSLLAVIFISISYF-