Monarch geneset OGS2.0

DPOGS205806
TranscriptDPOGS205806-TA1164 bp
ProteinDPOGS205806-PA387 aa
Genomic positionDPSCF300144 + 288176-294811
RNAseq coverage109x (Rank: top 59%)
Annotation
HeliconiusHMEL0122112e-10954.80% 
BombyxBGIBMGA010352-TA2e-7637.53% 
DrosophilaCG6465-PA5e-6734.01% 
EBI UniRef50UniRef50_E2BIJ94e-6335.46%Aminoacylase-1A n=3 Tax=Harpegnathos saltator RepID=E2BIJ9_HARSA
NCBI RefSeqXP_001952931.16e-7537.03%GF17518 [Drosophila ananassae]
NCBI nr blastpgi|2897415314e-7438.64%aminoacylase-1 [Glossina morsitans morsitans]
NCBI nr blastxgi|1947409063e-7037.31%GF17518 [Drosophila ananassae]
Group
Gene OntologyGO:00167871.5e-26hydrolase activity
GO:00081521.5e-26metabolic process
KEGG pathwayame:4089696e-69 
 K01436 (E3.5.1.14)maps-> Arginine and proline metabolism
InterPro domain[96-378] IPR0029331.5e-26Peptidase M20
Orthology groupMCL23625 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205806-TA
ATGCTTGGCTTTGTGTACCATTTATATGTAATGGTGAGCGTGGTTTTGTGCAATCCTATACATTATAATTATACATTAAAAGATTTCAATAATAATCCCGCTGTCAAGAAATTACAGGAATATATAACGATAGACTCGAGCCGTGTAGAAAATATCGAATTAGTAGTTGACTTCTGGAAGAGGCAAGCAGCGGATGTCGGCCTGTCTTTTGCGGTGTATAGACCAGCTGTGTTGCCAATATGTGTACTTACCTTAATAGGTCGTCAGCCGGACCTGCCTAGCATTATGCTAAATCATCACGGAGATGTGGTCCCAGCATACCACAGCATGTGGAAGTATCCTCCATATTCGGCACATATTGATGAAAACGGCGATTTATACGGACGAGGGGCCCAAGACACTAAAAGTGTTGGAATACAATATATAGAAGCTGTTAGAAGACTAATAAAAAATAACGTAACATTAGAAAGGACGTTACATCTTACTGTTATGCCAGATGAGGAATACGGCGGTAGCAAAGGTATCAAAGCTTTTATTTTGACGGATGTTTTTAAATCATTAAACATTGGATTTGCATTAGATGAAGGATTTACATCTGAAGATGACGTGATGCTCGCGTCTTACCAGGATAAGAGACCAGTTCAGGTGCGATTCAATATTATCGGTCAAGGTGGTCATGGTTCATCATTGGTGAACGGAAGTGCTATAGAAAAGGGAGGCATCGCCCCGAATGTTATTCCTAAAAATATTAGTGTAGTTATGGATATCAGGTTGGCGACATCAGTGAATGCCGCAGACGTACAAGCTATGTTGGATTCCTGGTTATCAAACCTCGGTGATGATTCTACTATGGAGTTTATCAGACTGGACGAGCGTTCACCAGCGACAGCAGTCGATAGCACCAATCCATTCTGGATAGCTATGAAGGATACATTGAATAATCGTGGCATCACAGTCACGCCTGTAGTGTTACCAGCTACATCAGACATGCTGGTATTGCGAGAGAAATTTTCAATACCAGCCATTGGATTCGCTCCCAGAAATAATATGAAAAATAAAATTCACGACGCCAACGAATACATACCTGTTAATAATTTCCTCAAAGGAATCGATATTTATTATGATTTAATACAAAAACTTGCCAACTTATCTCAAAATAGTTGA

Protein sequence:

>DPOGS205806-PA
MLGFVYHLYVMVSVVLCNPIHYNYTLKDFNNNPAVKKLQEYITIDSSRVENIELVVDFWKRQAADVGLSFAVYRPAVLPICVLTLIGRQPDLPSIMLNHHGDVVPAYHSMWKYPPYSAHIDENGDLYGRGAQDTKSVGIQYIEAVRRLIKNNVTLERTLHLTVMPDEEYGGSKGIKAFILTDVFKSLNIGFALDEGFTSEDDVMLASYQDKRPVQVRFNIIGQGGHGSSLVNGSAIEKGGIAPNVIPKNISVVMDIRLATSVNAADVQAMLDSWLSNLGDDSTMEFIRLDERSPATAVDSTNPFWIAMKDTLNNRGITVTPVVLPATSDMLVLREKFSIPAIGFAPRNNMKNKIHDANEYIPVNNFLKGIDIYYDLIQKLANLSQNS-