Monarch geneset OGS2.0

DPOGS214444
TranscriptDPOGS214444-TA1149 bp
ProteinDPOGS214444-PA382 aa
Genomic positionDPSCF300494 + 53219-54459
RNAseq coverage7x (Rank: top 87%)
Annotation
HeliconiusHMEL0026228e-11955.24% 
BombyxBGIBMGA001275-TA9e-13661.26% 
Drosophila% 
EBI UniRef50UniRef50_E2ALP29e-9848.80%Plasma glutamate carboxypeptidase n=11 Tax=Endopterygota RepID=E2ALP2_CAMFO
NCBI RefSeqXP_001601839.13e-10548.70%PREDICTED: similar to ENSANGP00000013946 [Nasonia vitripennis]
NCBI nr blastpgi|3214580192e-10448.04%hypothetical protein DAPPUDRAFT_189393 [Daphnia pulex]
NCBI nr blastxgi|3214580193e-10148.04%hypothetical protein DAPPUDRAFT_189393 [Daphnia pulex]
Group
Gene OntologyGO:00082335.1e-17peptidase activity
GO:00065085.1e-17proteolysis
KEGG pathwaymxa:MXAN_01008e-56 
 K01423 (E3.4.-.-)maps-> Biotin metabolism
    Lysine degradation
InterPro domain[193-354] IPR0074845.1e-17Peptidase M28
[58-153] IPR0031372.9e-07Protease-associated domain, PA
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214444-TA
ATGAAGGATCTTACAAACAAGAACGGAGTATTTGATGTTCACACAGAGGATGTCATGGTTCCTGAATGGAAACGTGGGCTTGAATCCCTTAAAATGCTAAGACCACGACTAAAGCAACTGTCTATTCTTGGTATTGGACTCAGCGTTCCAACGGATGGAAAAATAACAGCAGAAGTTATTGTGGTGTCTAGTTTCGAAGAATTGGAAAGTATCGATAATGCCATAATTAAAGGCAAAATCATTCTTTTTAATACTAATTTTACTACTTACGAAAATACTATTCAGTACAGAAGAAATAGTGCCGTTAAAGCTGCTAGAAAGGGTGCTGTAGCAGTTTTAGTTCGAAGTATTACTCCGATTTCCTTATATACAACACATACTGGAGATTTAGTTTATGAACCCCACGTTAAAAAAATACCAGCTGCAGCAATTACAATAGAAGATGCTGATTTTTTACAGCGAATTCATAACCGTGGGGAAGTTATTGTTATAGAAATACAAATGTCAAATGAACTGAAAGCAAATATATCTAGAAATTTAATTATTGATGTAAAAGGCTATGATATTCCGGATAAAATGGTTATAGTTTCTGGACATATAGATAGTTGGGATGTCGGTCAGGGTGCTATTGACGATGGGGGCGGTATGATGATTAGTTGGTTTGTGCCTGTTGTTTTAAATTACCTAAAATTAAAACCAAGAAGAACTCTGAGGGCAATACTATGGACGTCTGAAGAAGTAGGCCTTAATGGTGCGAAGGCCTACTTGGAAAGACACAGTGATGAATTAGATAACATAGATTTTATAATGGAATCTGATGAAGGAACATTCAAACCTTTGGGTTTGGAAGTAGCTGGATCTAAAAACGTTACATGCTTAATTAATGAAATTTTACAATTATTTAAACCATGGGATTTAAATAGGCTGAAAGTAGCCAATTCCACAGGATCAGATATTTCAATTTTTATTGATAAGGGCATTCCTGGAGCCTCTCTTTTAAATAAGGATGATCGTTATTTCTGGTATCATCATTCAAATGCTGATACCTTAACTGCCCAAAATAAGTCCGATGTTCTAGACTGCGCTGCGTTTTGGGCTGCAATATCATATCTTATTGCTGAATTACCTGTAGATATTCGCAGAAGTTAA

Protein sequence:

>DPOGS214444-PA
MKDLTNKNGVFDVHTEDVMVPEWKRGLESLKMLRPRLKQLSILGIGLSVPTDGKITAEVIVVSSFEELESIDNAIIKGKIILFNTNFTTYENTIQYRRNSAVKAARKGAVAVLVRSITPISLYTTHTGDLVYEPHVKKIPAAAITIEDADFLQRIHNRGEVIVIEIQMSNELKANISRNLIIDVKGYDIPDKMVIVSGHIDSWDVGQGAIDDGGGMMISWFVPVVLNYLKLKPRRTLRAILWTSEEVGLNGAKAYLERHSDELDNIDFIMESDEGTFKPLGLEVAGSKNVTCLINEILQLFKPWDLNRLKVANSTGSDISIFIDKGIPGASLLNKDDRYFWYHHSNADTLTAQNKSDVLDCAAFWAAISYLIAELPVDIRRS-