Monarch geneset OGS2.0

DPOGS208933
TranscriptDPOGS208933-TA3156 bp
ProteinDPOGS208933-PA1051 aa
Genomic positionDPSCF300009 + 63411-72424
RNAseq coverage367x (Rank: top 32%)
Annotation
HeliconiusHMEL0047580.064.32% 
BombyxBGIBMGA002404-TA0.074.89% 
DrosophilaNnaD-PA9e-3327.78% 
EBI UniRef50UniRef50_D7EIR96e-13063.41%Carboxypeptidase A n=1 Tax=Tribolium castaneum RepID=D7EIR9_TRICA
NCBI RefSeqXP_967549.11e-13063.41%PREDICTED: similar to ATP/GTP binding protein-like 5 [Tribolium castaneum]
NCBI nr blastpgi|910936432e-12963.41%PREDICTED: similar to ATP/GTP binding protein-like 5 [Tribolium castaneum]
NCBI nr blastxgi|910936434e-12563.41%PREDICTED: similar to ATP/GTP binding protein-like 5 [Tribolium castaneum]
Group
Gene OntologyGO:00065081.5e-19proteolysis
GO:00082701.5e-19zinc ion binding
GO:00041811.5e-19metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[195-311] IPR0008341.5e-19Peptidase M14, carboxypeptidase A
Orthology groupMCL16166 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208933-TA
ATGGATCGCAATAATATAATAGAGTGCGGCGGATTTTACTTTATACACAATTTTGACTCAGGGAATTTAGGGCATGTAGAACGAGTGCCCACAGAATTTATTGCTCCAACGTTAAATCCGAAAACAAATGTTTCGGAGACTCCCGATTATGAGTTTAATTTATGGACGCGACCTGATTGCGCTGGCACAGAATTCGAGAATGGCAATCGAACTTGGTTTTATTTTGGCATACAAGCCAGTGAGCCTAATGTACAGGTGCGACTTAACTTGATCAACCTTAACAAACAAGGCAAGATGTATAACCAGGGTATGGCTCCAGTGACACGGACCCTTCCAGGGAAGCCACAGTGGGAAAGGATAAGGGATCGTCCAGTGCATTCAACAGATGACAACACATTTACACTGTCTTTCCGATATAGAACATCAGATAATCCGAAAGCTACAACCTTCTTTGCATTCACATACCCATTCTCATTTGCCGAGCTACAAATAGCTCTGAACTCTATTGATCTTAAAATGTTGCCAGTCCCGCCACCTCAATCACCTGATGATATATATTATTGCAGAGAATGTTTAATATATTCATTAGAAGGAAGGCGTGTAGACTTATTGACAATTTCATCCCACCATGGTATAACAATGGAGCGAGAGGACAGATTAAAGAATTTGTTCCCAGAAAATCAGGAGAGGCCTTTCAAATTCCAAAATAAGAAGGTCATATTTATATCTGCTCGGGTGCATCCAGGAGAAACTCCATCGAGCTTTGTGTTCAACGGATTCCTAAACTTACTACTGACAAGAAACGATCCAATTGCAATCCAATTGAGGAAACTCTATGTGTTCAAAATGATTCCGTTTTTAAACCCAGATGGTGTTGCCAGGGGTCATTACAGAACCGATACTAGGGGGGTTAATTTAAACAGGGTCTATTTGAATCCATCACTTCTCTATCATCCCACTGTGTATGCATCAAGGTCTCTTATAAGATATCACCATTTTGGATTTGAGAAAGATGAAGATAATTGTGAGGATATTAAGAGCTTTGCATCCCGCAGCATACAGAACATTAGTGAAAGTGTTGAGCTGGTTGAAACGAAGAAAAAGAAATCCCCCGGTTCCCCAAACATCAAGGGTGACTTCAAGCGTGACAAGGCAAAGACACAACCAGCAAAGCCGTCAGCTCTGTACTCAGAAGACCACAGCAGGGACGAGTTCAGTGGAGACGCCGCCAACTTAGCCGATCAGGTCCTCGACATGAAGCTCCAAGAGATGCCATCACAAACAGACAATATATCCTCGAATCCGTCTAACGCTAACCTGGAGGAGAGCTCGTGTCTCCTCAACGATAACGTGCTGCGGTCGTGTCTGGGGTCTAACGTTCACCTCAGCACGAGCGAGGAACTCACTATAAACGGCCTCAACCCCCTGAAGCCGCTGAGGGATACTTTGAAGAACAGCATCAGTCTACTCATGGAGTCCAGCTCGTCCGTCGCCGGTGAGAGTATTTCGCAGGAACTTCCTGTAGTGAAGATGGGCTATTGTAAAGTGTGCCGAAGGGACAGGGAGTCGATGCTGTCCGACCTGCCATCGTATAGGAATATCGAGGAGTACGGAGAACATCAGAGACAAAAAATAGACACTAAGGAGCAGAAGGTCGACGACCTTGAAGTTATCGAGGGTTCGGTGAACGTGTTCTTCTGTACGAACTGCTTCAAGCGATACATTGTGACGGAGGGCAACGAGGAAATTGCGACCGCTACGTCTTCAGGTGATTGTGTGGAGGGCCCCCCTCTATCTCCAAGGCCTCCGCCTCCGGAACGCCCTCAGACGCGTTCCCCCGCCGGCAGCACTGGGGACTCGTTGCCGCCAGCGACGGTCCGAAAGGTCGACAAACCGAAGTCAGCTCCTAAGTCGTCTAAGAAGAGGTCCCCGGCCGTGACCGCAACCACGGCCCCCTCCCCCGCCGCAGCACCCACCGTCTTGAGACCGCACAAGGACGTCGAGTCCGGCCTGTACCTTTATATAGATCTACACGGACACGCCTCTAAGAAAGGCATCTTCATGTACGGCAACCACTTTGAGGACCTGGAGAGTTCGGTGGAGTGCATGTTGCTGCCTCGCATCATGTCGCTCAACAACCTGCACTTCCACTTCTCGTCCTGCAACTTCACCGAGAGGAACATGTATCTGAAGGACCGTCGCGACGGCATGTCCCGCGAGGGCTCGGGTCGCGTGGCCGTGCTGAAGGCCACCGGTCTGGTCCGCTCCTACACCCTGGAGTGCAACTACAACACGGGCCGCCTGGTGAACGTGCTGCCGCCGCCCTGCCGCGAGCCCGCCGCCACCGCCCAGCCCGCGCCCCCGCCACCCAAGTACACGCCGCACATCTTTGAGGAGGTAAACCCGTTCCAGGTCGGGCGATCTCTCGGAGCGTCCATACTTGATCTGACGGGGCAGCATCCTAACTCGCGAATCCCGTGCTCCGAACATCGTAATCTGGCTGCCGTGCGCGACTGGCTCAGGACGCACTCGAGGACCGCGCGCCCTCAGTTGACTATGTCGAGACTGCGGCCGAAGACTTCCTCCCCGACGAGGATGCCGTTGTTCGCGCGCTCCAAGGCCAAGGTGACGGACGAGAGGAAAGAGAACGCGTACATAGCGGCAAAGAGCGACACGGAAAGGCGCCGCAGCCCGCCCATACTGGCACCGCGCTCAGGGCTCGACCTCACAAACCTCAACACCAAGTTCGGCAAGAAAAACGAACCAGCAAAATCGTCATCACGAACACGCTACCTGGCAGACAGCGAGCCGAAACCTAAGACGCTATCCACCAAGAGGCGCAACGTCCTCGCTATCAGGAAACCAAATACAAGCAAGACGCAGATGAGCGGCATTGTGAAGGCGAAGGCGAACCGAAGAGCCGCGGACGATTCAGACGACCGAGCGACATCCGCCAAGCTCGGCAAGCGAGGGTATGTCCGTCCAGGGAGAGCGAGGCGCCAACCGACATCCACTTCTTCATCAGAGGCCGCCGGAGGCTCCAGCTCTTGGGAGGCGGGCGGTTCCCACGAGACAGCCTTGGCCGCTAAGAGGCGGCAGTTCCCGAACCCCGCGCCCTCACACCTCAAGAAGATACGCCTCAAGAACGGCTTGTAG

Protein sequence:

>DPOGS208933-PA
MDRNNIIECGGFYFIHNFDSGNLGHVERVPTEFIAPTLNPKTNVSETPDYEFNLWTRPDCAGTEFENGNRTWFYFGIQASEPNVQVRLNLINLNKQGKMYNQGMAPVTRTLPGKPQWERIRDRPVHSTDDNTFTLSFRYRTSDNPKATTFFAFTYPFSFAELQIALNSIDLKMLPVPPPQSPDDIYYCRECLIYSLEGRRVDLLTISSHHGITMEREDRLKNLFPENQERPFKFQNKKVIFISARVHPGETPSSFVFNGFLNLLLTRNDPIAIQLRKLYVFKMIPFLNPDGVARGHYRTDTRGVNLNRVYLNPSLLYHPTVYASRSLIRYHHFGFEKDEDNCEDIKSFASRSIQNISESVELVETKKKKSPGSPNIKGDFKRDKAKTQPAKPSALYSEDHSRDEFSGDAANLADQVLDMKLQEMPSQTDNISSNPSNANLEESSCLLNDNVLRSCLGSNVHLSTSEELTINGLNPLKPLRDTLKNSISLLMESSSSVAGESISQELPVVKMGYCKVCRRDRESMLSDLPSYRNIEEYGEHQRQKIDTKEQKVDDLEVIEGSVNVFFCTNCFKRYIVTEGNEEIATATSSGDCVEGPPLSPRPPPPERPQTRSPAGSTGDSLPPATVRKVDKPKSAPKSSKKRSPAVTATTAPSPAAAPTVLRPHKDVESGLYLYIDLHGHASKKGIFMYGNHFEDLESSVECMLLPRIMSLNNLHFHFSSCNFTERNMYLKDRRDGMSREGSGRVAVLKATGLVRSYTLECNYNTGRLVNVLPPPCREPAATAQPAPPPPKYTPHIFEEVNPFQVGRSLGASILDLTGQHPNSRIPCSEHRNLAAVRDWLRTHSRTARPQLTMSRLRPKTSSPTRMPLFARSKAKVTDERKENAYIAAKSDTERRRSPPILAPRSGLDLTNLNTKFGKKNEPAKSSSRTRYLADSEPKPKTLSTKRRNVLAIRKPNTSKTQMSGIVKAKANRRAADDSDDRATSAKLGKRGYVRPGRARRQPTSTSSSEAAGGSSSWEAGGSHETALAAKRRQFPNPAPSHLKKIRLKNGL-