Monarch geneset OGS2.0

DPOGS215432
TranscriptDPOGS215432-TA819 bp
ProteinDPOGS215432-PA272 aa
Genomic positionDPSCF300436 + 93204-99785
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0101831e-9068.56% 
BombyxBGIBMGA008401-TA1e-8470.59% 
DrosophilaCG16771-PA6e-4750.26% 
EBI UniRef50UniRef50_G9FJB73e-5954.81%Alkaline phosphatase 3 n=1 Tax=Aphis glycines RepID=G9FJB7_APHGL
NCBI RefSeqXP_001600930.11e-5955.32%PREDICTED: similar to CG16771-PA, partial [Nasonia vitripennis]
NCBI nr blastpgi|3503977056e-6757.55%PREDICTED: alkaline phosphatase, tissue-nonspecific isozyme-like [Bombus impatiens]
NCBI nr blastxgi|3503977058e-6556.94%PREDICTED: alkaline phosphatase, tissue-nonspecific isozyme-like [Bombus impatiens]
Group
Gene OntologyGO:00081521.2e-68metabolic process
GO:00038241.2e-68catalytic activity
GO:00167915.6e-46phosphatase activity
KEGG pathwayame:4105304e-59 
 K01077 (E3.1.3.1, phoA, phoB)maps-> Two-component system
    Folate biosynthesis
    gamma-Hexachlorocyclohexane degradation
InterPro domain[22-205] IPR0178491.2e-68Alkaline phosphatase-like, alpha/beta/alpha
[17-201] IPR0178506.4e-57Alkaline-phosphatase-like, core domain
[49-203] IPR0019525.6e-46Alkaline phosphatase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215432-TA
ATGATAAGAAGCGTGGTGGTATTGTTATTGATATCAAATATTGTGTCAGTGCGAAATACATTACGAAAAGATCCAGCCTATTGGAACAAACTAGCTGCCGATGAACTAGATGAAGCATTAAAATTAAAATGGAATCAAAACACTGCGAGGAATGTGATACTGTTCATTGGGGATGGGATGGGTCCGAACACTGTCACCGCTACGAGGATATACAAGGGAGGCGAAACCCATAAGCTTTCATACGAGAAGTTTCCACACATTGGATTGCTAAAAACATATTCCGCAAACAAAATGGTACCGGACTCGGCCAGCACAGCCACCGCTTTGCTCTGTGGTGTCAAAGTGAACCAGAACACAATAGGAGTAGATGCTTCAGTGAACCACAACGACTGCGTAGCGTCCCTCGATAATAGCACTAAACTGGAATCGCTGGCGACTTTAGCGCTGAAAGCAGGAAAAAGTGCTGGTTTTGTGACAACAATGCGTGTGACCCACGGTACACCTGGACCGATGTACGCACACAGCGCGGCTCGAGAATGGGAGTGTGACGCGAATATGGAGGATGAAGCGAAGGCCTGCAAGGATATAGCAAGACAACTGGTGGAAGACTGGCCGGGAAGAGATCTTCAGGGTTCCGTGACGCCGCCCGGCCCGCCCGGCGGCTCCGGCGGGTTCTGGTGTTGCAGGCGTTGCAAGTCACTCTCCCTTGAGCCGCCTGCTGAACTCCTCCTGCTGCAGCTCGCGCCACTCCTTCTTGGCCTTCATCCTCTTCAGCTGTCCGTCCCTGTAAGACGAGGTGTTAGTTCCGCCCCCGCCTAG

Protein sequence:

>DPOGS215432-PA
MIRSVVVLLLISNIVSVRNTLRKDPAYWNKLAADELDEALKLKWNQNTARNVILFIGDGMGPNTVTATRIYKGGETHKLSYEKFPHIGLLKTYSANKMVPDSASTATALLCGVKVNQNTIGVDASVNHNDCVASLDNSTKLESLATLALKAGKSAGFVTTMRVTHGTPGPMYAHSAAREWECDANMEDEAKACKDIARQLVEDWPGRDLQGSVTPPGPPGGSGGFWCCRRCKSLSLEPPAELLLLQLAPLLLGLHPLQLSVPVRRGVSSAPA-