Monarch geneset OGS2.0

DPOGS210009
TranscriptDPOGS210009-TA933 bp
ProteinDPOGS210009-PA310 aa
Genomic positionDPSCF300327 + 1321-3650
RNAseq coverage157x (Rank: top 52%)
Annotation
HeliconiusHMEL0101832e-12666.45% 
BombyxBGIBMGA008401-TA2e-11662.58% 
DrosophilaCG16771-PA3e-7246.90% 
EBI UniRef50UniRef50_E0VGW06e-8250.52%Alkaline phosphatase n=4 Tax=Pediculus humanus corporis RepID=E0VGW0_PEDHC
NCBI RefSeqXP_002425354.11e-8250.52%Alkaline phosphatase, tissue-nonspecific isozyme precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3071837487e-8447.74%Alkaline phosphatase, tissue-nonspecific isozyme [Camponotus floridanus]
NCBI nr blastxgi|3071837488e-8348.21%Alkaline phosphatase, tissue-nonspecific isozyme [Camponotus floridanus]
Group
Gene OntologyGO:00081528e-82metabolic process
GO:00038248e-82catalytic activity
GO:00167912.1e-66phosphatase activity
KEGG pathwayame:4105308e-76 
 K01077 (E3.1.3.1, phoA, phoB)maps-> Two-component system
    Folate biosynthesis
    gamma-Hexachlorocyclohexane degradation
InterPro domain[1-289] IPR0178498e-82Alkaline phosphatase-like, alpha/beta/alpha
[1-294] IPR0178501.5e-71Alkaline-phosphatase-like, core domain
[1-283] IPR0019522.1e-66Alkaline phosphatase
Orthology groupMCL17730 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210009-TA
ATGGGCGGTGGAAGGCAGAGTTTGATGCAGAATGTCACCGAAACCTCCTCAGACCCCATGAACAGCTGGACCTGTTCCAGAAGAGATGGACGTGACCTCATCAAGGTATACAAAAAGGACAAAGAAGATAGGAAACTTAAATACAGTGTTCTATCCAATAACAGAGACCTGAAAAACCTGGACGTGGCCGAGACTGATTACGTGTTAGGAATATTCGCAAACGAGCACTTGCGATATGAAAGCCAGAGAGACAAAGGTCCGGAGGGAATGCCATCTATCAGCGACATGGTGGAAGCGGCTATAAAAGTATTACGGAAGAATAACAATGGATACTTTTTAATGGTGGAGGGCGGTAACATAGATATGGCACACCACAGAGGACGCGCAAAGCTAGCTGTCAACGAGTCGTCCGCCATGGATGAAGCTGTCAAGAAAGCTTTGGAAATTACAAATGAAGAAGAAACGTTGATAGTTGTTACAAGTGATCACTCACACACTTTGACGATCAACGGCTACCAGGACAGAGGCGGCAATATATTTGGTACAACAGGCCCATCCAAATACGACGGTCTTAACTACACGGTTATCTCTTACGGCACGGGCGGGCCGGGTTCATTCAAACATTCAATGACAACCATCGACAATGTCACTCGCGTCGTCAGAAGAGATCCGTCAGCTGTCAATACGGATGACATGCTGTACGAGCAGATCGCGGCCATAACGTTGGAAGAGAACAAACACGGCGGAAATGACGTCACAGTTTACGCTAAAGGTCCATTTTCTCACCTCTTCCACAACGTTCACGAGCAACATTACGTATTCCACGCCATATCCTACGCGGCCAAGCTCGGGGTATATTCGTCGGGTGAAAGTATAAGACACAATGTTGCCATAATAGCTGTTGTATTATTACCATTACTGCAATTGTTGTAA

Protein sequence:

>DPOGS210009-PA
MGGGRQSLMQNVTETSSDPMNSWTCSRRDGRDLIKVYKKDKEDRKLKYSVLSNNRDLKNLDVAETDYVLGIFANEHLRYESQRDKGPEGMPSISDMVEAAIKVLRKNNNGYFLMVEGGNIDMAHHRGRAKLAVNESSAMDEAVKKALEITNEEETLIVVTSDHSHTLTINGYQDRGGNIFGTTGPSKYDGLNYTVISYGTGGPGSFKHSMTTIDNVTRVVRRDPSAVNTDDMLYEQIAAITLEENKHGGNDVTVYAKGPFSHLFHNVHEQHYVFHAISYAAKLGVYSSGESIRHNVAIIAVVLLPLLQLL-