Monarch geneset OGS2.0

DPOGS210490
TranscriptDPOGS210490-TA870 bp
ProteinDPOGS210490-PA289 aa
Genomic positionDPSCF300186 - 258311-260445
RNAseq coverage72x (Rank: top 66%)
Annotation
HeliconiusHMEL0163426e-9672.48% 
BombyxBGIBMGA012581-TA3e-8261.60% 
DrosophilaCG32147-PA2e-3238.64% 
EBI UniRef50UniRef50_E1ZWM63e-4047.59%Pyroglutamyl-peptidase 1 n=3 Tax=Formicidae RepID=E1ZWM6_CAMFO
NCBI RefSeqXP_001121977.11e-4044.86%PREDICTED: similar to Pyroglutamyl-peptidase 1 (Pyroglutamyl-peptidase I) (Pyrrolidone-carboxylate peptidase) (5-oxoprolyl-peptidase) (PGP-I) [Apis mellifera]
NCBI nr blastpgi|3504193226e-4245.64%PREDICTED: pyroglutamyl-peptidase 1-like [Bombus impatiens]
NCBI nr blastxgi|3504193225e-4145.64%PREDICTED: pyroglutamyl-peptidase 1-like [Bombus impatiens]
Group
Gene OntologyGO:00065083.7e-21proteolysis
KEGG pathway 
InterPro domain[8-208] IPR0161257.3e-51Peptidase C15, pyroglutamyl peptidase I-like
[12-187] IPR0008163.7e-21Peptidase C15, pyroglutamyl peptidase I
Orthology groupMCL11985 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210490-TA
ATGTCCGACCTGGATTTAATGTTCAAACCTATAGTGCTCGTTACTGGGTTTGGACCATTTTTAAATCATCCTTTGAACGCTAGTTGGGAAGCTGTGAAGTTAATGAATAAAGAATCTATTGAGAAGAAACATAACGTCGAGCTAGTTCAGTTAGAAATTCCAGTAACTTACGAAAATGTTGACGAATTTGTGCCTGCTCTTTGGGATACCCACGAACCTAAATTAATGATCCACGTAGGTGTGTCAGGCATAGCCGACTCTATCACTCTGGAATGCCAAGCCCATAGGAAGGGATATCAAAGGCTTGACTACTTCGACAAATGTCCAGCGAATCACGCATGCCCAGCTACAGGAGCACTTCGCATCAAAACTAGATTGGATGTAGAGAAAATATGTAAAGAATTCAACGACGACTGTCCTCCGGAAACAAATGCTATAGTTTCCCTCGATGCTGGAAGATACCTGTGCGAGTACATCTACTACACATCACTAAGCGTCGACAACACCAGAACACTGTTTGTGCATGTTCCTGATGTCAATAAGTATAAATCGGAACAAACGGCCCGAGCACTGGAAGTAATTCTAGACCTTTGTATGAAACAGATAACGGCCATGGATGCAGCAGATAGCATGACTGAGAACTTACAAAGTAAATGTCAAGTATACCTGTGCGAGTACATCTACTACACATCACTAAGCGTCGACAACACCAGAACACTGTTTGTGCATGTTCCTGATGTCAATAAGTATAAATCGGAACAAACGGCCCGAGCACTGGAAGTAATTCTAGACCTTTGTATGAAACAGATAACGGCCATGGATGCAGCAGATAGCATGACTGAGAACTTACAGAGTAAATGTCAAGTGTGA

Protein sequence:

>DPOGS210490-PA
MSDLDLMFKPIVLVTGFGPFLNHPLNASWEAVKLMNKESIEKKHNVELVQLEIPVTYENVDEFVPALWDTHEPKLMIHVGVSGIADSITLECQAHRKGYQRLDYFDKCPANHACPATGALRIKTRLDVEKICKEFNDDCPPETNAIVSLDAGRYLCEYIYYTSLSVDNTRTLFVHVPDVNKYKSEQTARALEVILDLCMKQITAMDAADSMTENLQSKCQVYLCEYIYYTSLSVDNTRTLFVHVPDVNKYKSEQTARALEVILDLCMKQITAMDAADSMTENLQSKCQV-