Monarch geneset OGS2.0

DPOGS210764
TranscriptDPOGS210764-TA1188 bp
ProteinDPOGS210764-PA395 aa
Genomic positionDPSCF300312 - 104337-105524
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0142193e-10549.74% 
BombyxBGIBMGA011201-TA3e-7140.92% 
Drosophila% 
EBI UniRef50UniRef50_P199263e-8340.05%Glucose-1-phosphatase n=356 Tax=Enterobacteriaceae RepID=AGP_ECOLI
NCBI RefSeq%
NCBI nr blastpgi|2389198424e-8640.34%unnamed protein product [Edwardsiella ictaluri 93-146]
NCBI nr blastxgi|2389198424e-8140.89%unnamed protein product [Edwardsiella ictaluri 93-146]
Group
Gene OntologyGO:00039936.5e-25acid phosphatase activity
KEGG pathwayeic:NT01EI_19467e-87 
 K01085 (agp)maps-> Glycolysis / Gluconeogenesis
InterPro domain[18-335] IPR0005606.5e-25Histidine phosphatase superfamily, clade-2
Orthology groupMCL16722 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210764-TA
ATGATTGTCAGAGTTTTTTGTTTGAATCTGATTGCTGTTGTTGCTTGTTATGAATTAAAACAAGTGCTTATATTGAGCAGGCATAATATAAGAAGTCCTTTAACCGAAAAATTGAAAAACTTTTCTCATAATCCTTGGCCGGAATGGAATGTTAGTGATCTATATTTGACCGAAAAAGGTGCCTTGATGGAAGAATACATGGGAGGATATATTTATAACTGGCTTGTTATGGAGAAGCTTCTTGTGGACGGTTGTCCAAAAGAAAGTTCTGTACATATTTATGCAAACACTAAGCAGAGGTGTCGGTCGTCTGCGAAAGCTTTTGTTCGTGGTGCCTTTGACAAGTGCAATATACGTGTATTCAGCATGTATCCCGATGATATGGATCCCATTTTTAATCCCATTCTTCGCAATGATTCTGAAATAGTCAAAGAGCCTATTCTGAAAGAAATGGAAAATAAATTAAGGAAACTGGATTTTAGTGATTCTTATTTGGTTCTGGAAAATATATTAGATTTGAAAAATTCTGTTATATGTCGAATCGGAAACCTGTGTCACTTTTGTGACGATGATTATAGCATTATTTATAATGTTGGAGAGGTGCCACACATAATTGGTCATTTTTCATGGGCATATCTAATAGTTGATTCTTTTCTTATGAGTTACTATGATGGACATGCTATGGAAGATGTAGCTTGGGGAAGGATTAAAGATTCAGCACAGTGGAAGACATTGACACGTATCATAAACGAAAATCTAAATATTTGTTTTAACAGTAAATTATTGGGAAGACAAGTCGCGAAACCTCTTCTTAAATATATATCGTCAGTGGTGACTCGTGAAAAACCAATAAAATTCACTTTACTACATGGTCATGATGCAAATTTATATTCTGTTTTAGCAGCACTGGACGTTGAGGATTTTCTGTTGCCAGAGCAATATGAAATCATACCAATAGGTGGGAAGTTGGTGTTCCAGAGATGGTACGACGCTACACAGGATAGAGATCTGTTTAAATTGGATTTTGTATATTTGACCGTAGATCAGATAAGAGATGGATTCAAACTATCAGCTAGTAATCCGCCTCGACGGGTGCAGATTTTCGTAAAAGATTGCCCTGTAGACTTAGATGGGTTCTGTTCTTGGGAAGAGTTTGTTAAAGTATTAAATGATGCAGCTAGTTTTTAA

Protein sequence:

>DPOGS210764-PA
MIVRVFCLNLIAVVACYELKQVLILSRHNIRSPLTEKLKNFSHNPWPEWNVSDLYLTEKGALMEEYMGGYIYNWLVMEKLLVDGCPKESSVHIYANTKQRCRSSAKAFVRGAFDKCNIRVFSMYPDDMDPIFNPILRNDSEIVKEPILKEMENKLRKLDFSDSYLVLENILDLKNSVICRIGNLCHFCDDDYSIIYNVGEVPHIIGHFSWAYLIVDSFLMSYYDGHAMEDVAWGRIKDSAQWKTLTRIINENLNICFNSKLLGRQVAKPLLKYISSVVTREKPIKFTLLHGHDANLYSVLAALDVEDFLLPEQYEIIPIGGKLVFQRWYDATQDRDLFKLDFVYLTVDQIRDGFKLSASNPPRRVQIFVKDCPVDLDGFCSWEEFVKVLNDAASF-