Monarch geneset OGS2.0

DPOGS209233
TranscriptDPOGS209233-TA1680 bp
ProteinDPOGS209233-PA559 aa
Genomic positionDPSCF300204 + 281226-292424
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0175130.077.23% 
BombyxBGIBMGA011463-TA3e-13376.07% 
DrosophilaCG5656-PA5e-7637.32% 
EBI UniRef50UniRef50_E0VJV34e-16159.35%Alkaline phosphatase, tissue-nonspecific isozyme, putative n=7 Tax=Neoptera RepID=E0VJV3_PEDHC
NCBI RefSeqXP_973094.14e-17456.13%PREDICTED: similar to tissue-nonspecific alkaline phosphatase [Tribolium castaneum]
NCBI nr blastpgi|910765007e-17356.13%PREDICTED: similar to tissue-nonspecific alkaline phosphatase [Tribolium castaneum]
NCBI nr blastxgi|910765004e-17456.13%PREDICTED: similar to tissue-nonspecific alkaline phosphatase [Tribolium castaneum]
Group
Gene OntologyGO:00081526e-132metabolic process
GO:00038246e-132catalytic activity
GO:00167912.6e-127phosphatase activity
KEGG pathwayxla:3805892e-93 
 K01077 (E3.1.3.1, phoA, phoB)maps-> Two-component system
    Folate biosynthesis
    gamma-Hexachlorocyclohexane degradation
InterPro domain[58-472] IPR0178496e-132Alkaline phosphatase-like, alpha/beta/alpha
[8-463] IPR0019522.6e-127Alkaline phosphatase
[26-473] IPR0178502.1e-114Alkaline-phosphatase-like, core domain
Orthology groupMCL17411 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209233-TA
ATGCTTGCTTGGTGCTGTATCGTGATCATGATGTGCACGATCAGCACTGGAGTGTTATCGCCTGCACCGATAAATGCTGAAGTTAAGGAAACATATGATCGACATTACTGGTATGCAAAAGCACAGGAAACTCTACAAAAACGCTTGCAATACGCATCGGAAAAAAACCCAACATACAATATCGACGCTCAAACTGGAGAGTCATCTGCTTGTGCTACAGCCCTCTTATGTGGAGTAAAGGCGAGATACGAGACTCTAGGTCTTGACTCTGGAGCTCGTTTCAACAACTGTGCTTCAGCAATAAACTCTAAAGTGACTTCTCTCATAGATTGGGCTCACGAAGCTGGTAAATCTAGTGGGATAGTAACAACAGCCCGTATAACACATGCCACACCTGCGGCACTCTATGCCCATGCTCCTTCAAGGTATTGGGAGGATGACAGTCGCGTTCCTCCTACAGTACGACGAGACTGCAAGGACATAGCATTGCAGCTTGTTGAAAATGATCCCGGAAAGAATATAAATGTAATCATGGGTGGTGGACGACGCCATTTTCTACCAACGGTTACGTCAGACCCTGAACATCCCAATAGAGAAGGAAGAAGGCTCGATGGACGAAATTTAGCTGAAGATTGGGCTAGAGAAAAGAAAAGACGACGTTTAAGAGCACAATATATCCACTCAAAGGAGCAATTAGCGAAACTGGATCCAAGAACTGTAGATTATGTTTTAGGATTATTTGAATACTCTCATATGGAGTTTAATGCGGAGCGAGGTGTGGTATTGGATACAGACGAAAATATAGGTTCAAGTTCAACAGTTAAAGCAGATGACCCGTCACTAGCAGACATGACCCGTACTGCATTATCAATTTTGTCTAAAAACGATAAAGGATTTTTTCTGTTATTAGAAGGAGGTAGAATAGACCACGCTCATCACTACAATAATCCTTATCGGGCTCTTGATGAAACTTTGGAATTGGAAACTGCACTATTAGCTGCCTTAGAGAAAGTAAACCCAGCGGAAACCCTAATTGTGGTGACGGCTGATCACGGCCACGTAATGACTTTTGGTGGTCAAGCTACCCCTAGGGGTCATCCAATTTTAGGTGCTGATACGGTAGTTTCCGACATAGATGGCTTGCGATACACGACATTGTTGTATGGAACTGGACCGGGACACTCGGAACCGAGAACGTTGCCATTGAATGGAACTTCAACAACTCCGGCTGATGCAGTGCACGCGGCAGCTGTGCCAAAACAATGGGCTACTCACGGTGGAGAGGACGTACCTATATATGCTTTAGGTCCCATGGCAACGGTATTGTTTGCCGGCGTTGTAGAACAAAGCTACATTCCACATGCCATCGCGTATGCGGCATGTTTAGCTCACCAGTCTCATCGTTGTCATGAAAATATGTTAAACCTTACTAAGCCGGAGGTAAAACTACCAAGTTGCATACAACCCGAAGTGAGTAGTGTATCCGCAGACGAAGCTAGAAGTGAAAGTACTAGCGTTCGAAGAGTCATCGTAGCATCAAGCGTTATGTCAGATGAACGTTTACCAAGGTCATCCACTACACTCTCCTCTCCGCTTAAGATGTCGTACTACCTTATAATTTTGATGTTTCACTCCATTGGATACTCTACTATTAGCGTTTCTATCGCTAGTAATATTTGA

Protein sequence:

>DPOGS209233-PA
MLAWCCIVIMMCTISTGVLSPAPINAEVKETYDRHYWYAKAQETLQKRLQYASEKNPTYNIDAQTGESSACATALLCGVKARYETLGLDSGARFNNCASAINSKVTSLIDWAHEAGKSSGIVTTARITHATPAALYAHAPSRYWEDDSRVPPTVRRDCKDIALQLVENDPGKNINVIMGGGRRHFLPTVTSDPEHPNREGRRLDGRNLAEDWAREKKRRRLRAQYIHSKEQLAKLDPRTVDYVLGLFEYSHMEFNAERGVVLDTDENIGSSSTVKADDPSLADMTRTALSILSKNDKGFFLLLEGGRIDHAHHYNNPYRALDETLELETALLAALEKVNPAETLIVVTADHGHVMTFGGQATPRGHPILGADTVVSDIDGLRYTTLLYGTGPGHSEPRTLPLNGTSTTPADAVHAAAVPKQWATHGGEDVPIYALGPMATVLFAGVVEQSYIPHAIAYAACLAHQSHRCHENMLNLTKPEVKLPSCIQPEVSSVSADEARSESTSVRRVIVASSVMSDERLPRSSTTLSSPLKMSYYLIILMFHSIGYSTISVSIASNI-