Monarch geneset OGS2.0

DPOGS211688
TranscriptDPOGS211688-TA1599 bp
ProteinDPOGS211688-PA532 aa
Genomic positionDPSCF300374 - 191533-198661
RNAseq coverage56x (Rank: top 69%)
Annotation
HeliconiusHMEL0142192e-9646.30% 
BombyxBGIBMGA011204-TA2e-6336.73% 
Drosophila% 
EBI UniRef50UniRef50_P199262e-7637.50%Glucose-1-phosphatase n=356 Tax=Enterobacteriaceae RepID=AGP_ECOLI
NCBI RefSeqXP_969328.12e-3445.41%PREDICTED: similar to Parcxpwfx02 [Tribolium castaneum]
NCBI nr blastpgi|1573714034e-8038.58%glucose-1-phosphatase/inositol phosphatase [Serratia proteamaculans 568]
NCBI nr blastxgi|1835984603e-7538.50%hypothetical protein PROSTU_01855 [Providencia stuartii ATCC 25827]
Group
Gene OntologyGO:00039931.4e-31acid phosphatase activity
KEGG pathwayspe:Spro_31646e-81 
 K01085 (agp)maps-> Glycolysis / Gluconeogenesis
InterPro domain[158-472] IPR0005601.4e-31Histidine phosphatase superfamily, clade-2
Orthology groupMCL16722 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211688-TA
ATGCTTCCCCGTTCATATTACACGGAACAAGACATAAGGAAGCGGCAGATTGATACATTAAAAATATTTAATGACAATGTTACTGAGATTGTTGAAAATACGGAATACAAAGTCGATTTTTCCGCTAATGGACACAACTTAAATCTTACAGTGATTCTTAACTCTGAATTTCCTAATGAGAAGCCTAACATATTCGTAAGCCCTGTCTTTCCTCACCCGTGGCTTGCTGAAAACTCAAACCAAGTGATTGGTGCACCAGGATTGGTGAACTATAGTCCACATTCTGATTTAGGACGGGTTGTTCAAGTTATTATACGCGAATTTCAGCGTTCAGCACCTAATATATACGGTCACGAAGATAAGTCCACAGACACCAGCCCAATGTCGCACTATAGCAACCAATCGCTGATGTTTCCAGAGTTGAATGAACTATCTATTGACGAATTACAGGAGATCATTGAGAATCCTGATCTACAGCAAGTGGTTATATTGAGCAGGCACAACATAAGGAGCCCTTTGGCGAGTTTTTTGAAGAAGTTCTCGCCTCATCCTTGGCCGGAATGGAATATAAGTGTTGGTTATTTGACAGAAAAAGGTGCTACTATGGAAGAAGACATGGGTGAATATATGTCCACTTGGTTGTGCACTGAGCTCTTCAAAGACAGCTGTCCCGAGGAGAGCTCCTTGCAAATATTCTCAAATTCTACTCAGAGAACTTACGAATCATCGAAAGCGTTTATTCGTGGTACTTTCAAAAATTGCAATAAAGTTTTAAGAGTTGGATCTGAGGAAATGGCGTCGTTGTTTGAAACTGTTGTCCGCAATGATTCAAAAGTGATGAAGGACCTTGTTCTTAACGAAATGAATACGAAAATAATGGAATTGGATACAAAAGAATCTTATAATTTATTGGAAGACATATTGGATATGAAAAATGCTGAAGTGTGCAAAATCGAGGGCATATGCAACTTTGATAAAGAAGACAGCGAAATTACATATGAATTCGGTAATTTGCTGAACGTCGAGGGCTCCTTGCTGTGGGCGAACCTGATAGTCGATTCGTTTCTTATGAGCTACTACGACGGATTTCAAATAGAAAACGTAGCTTGGGGAATGATCAAAGATTCTGGACAGTGGCGGACGCTCACAAGACTGATGATACAGTATCAGCACGTTGTTTTTAACAGTAAGTTAGTAGGGAGACAAGTGTCAAAACCTCTCCTTAGCTATATATCGTCTAAGTTTACGGCGGAAACAGAAAAAAAATTCATTTCGCTTCATGCCCATGACGCAAATTTATATTTTGTTCTGGCGGCACTGGAAGTTGAGGAGTTTGTGTTGCCAGAGCAATATGAAAGGACACCGATAGGCGGGAAGTTGGTGTTCCAGAGATGGCACGACGCTACACAGGGTAGAGATCTGTTTAAATTGAATTTTGTGTATTTAACCGTAGATCAGATAAGAGATGGGTCCAAACTATCAGCTAGTAATCCCCCACGATGGGTGCAGCTGTTTTTCAAGGATTGTCCCGTAGACTCAGACGGGTTCTGTTCTTGGGAAGATTTTGTTAATGTTCTAAATGATGCAGCCAGTTTTTAA

Protein sequence:

>DPOGS211688-PA
MLPRSYYTEQDIRKRQIDTLKIFNDNVTEIVENTEYKVDFSANGHNLNLTVILNSEFPNEKPNIFVSPVFPHPWLAENSNQVIGAPGLVNYSPHSDLGRVVQVIIREFQRSAPNIYGHEDKSTDTSPMSHYSNQSLMFPELNELSIDELQEIIENPDLQQVVILSRHNIRSPLASFLKKFSPHPWPEWNISVGYLTEKGATMEEDMGEYMSTWLCTELFKDSCPEESSLQIFSNSTQRTYESSKAFIRGTFKNCNKVLRVGSEEMASLFETVVRNDSKVMKDLVLNEMNTKIMELDTKESYNLLEDILDMKNAEVCKIEGICNFDKEDSEITYEFGNLLNVEGSLLWANLIVDSFLMSYYDGFQIENVAWGMIKDSGQWRTLTRLMIQYQHVVFNSKLVGRQVSKPLLSYISSKFTAETEKKFISLHAHDANLYFVLAALEVEEFVLPEQYERTPIGGKLVFQRWHDATQGRDLFKLNFVYLTVDQIRDGSKLSASNPPRWVQLFFKDCPVDSDGFCSWEDFVNVLNDAASF-