Monarch geneset OGS2.0

DPOGS200937
TranscriptDPOGS200937-TA1425 bp
ProteinDPOGS200937-PA474 aa
Genomic positionDPSCF300301 + 174816-181912
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0045150.077.43% 
BombyxBGIBMGA000307-TA0.072.69% 
Drosophilasvr-PG3e-7039.52% 
EBI UniRef50UniRef50_E0W2S81e-14460.05%Carboxypeptidase E, putative n=7 Tax=Bilateria RepID=E0W2S8_PEDHC
NCBI RefSeqXP_001807518.15e-15059.50%PREDICTED: similar to Zinc carboxypeptidase family protein [Tribolium castaneum]
NCBI nr blastpgi|1892420169e-14959.50%PREDICTED: similar to Zinc carboxypeptidase family protein [Tribolium castaneum]
NCBI nr blastxgi|1892420161e-14660.94%PREDICTED: similar to Zinc carboxypeptidase family protein [Tribolium castaneum]
Group
Gene OntologyGO:00065081e-73proteolysis
GO:00082701e-73zinc ion binding
GO:00041811e-73metallocarboxypeptidase activity
GO:00041801.5e-15carboxypeptidase activity
KEGG pathwayxla:1001583682e-110 
 K01294 (CPE)maps-> Type I diabetes mellitus
InterPro domain[31-334] IPR0008341e-73Peptidase M14, carboxypeptidase A
[352-437] IPR0147661.5e-15Carboxypeptidase, regulatory domain
[347-433] IPR0089693.3e-12Carboxypeptidase-like, regulatory domain
Orthology groupMCL18991 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200937-TA
ATGGCGATGTATGCGTTTTTCTACTGTGCTCTTTTTCTTGGGGTATCAACTGAATTTGTTTGGAAACACCATAATAATGAGGAGTTATCTTCGATACTGGAAGAAGTTCACGAAAACTGCCCTAATATTACTAGAGTTTATGCCTTAACAGAACCATCAGTGAGGAATGTACCATTGTATGTTATTGAATTTTCTGACACCCCGGGATTTCACCAACCATATAAACCAGAAGTTAAATATGTCGGAAATATACATGGTAATGAGGTTTTGGGACGTGAATTACTTTTGGGCTTGGCATATTACCTTTGTGAAGAATATAATAAACACGATCGTCGTATAAGAAATTTGATTCACAACACTCGCATACATTTATTGCCTTCCATGAACCCTGATGGCTGGCAGTTATCAACTGACACAGGTGGTCAGGATTTTTTGTTGGGACGTAACAACAATCATTCAGTGGATTTAAACAGGAACTTCCCAGATCTGGATGCAATAACATTTGAATTTGAAAGACAAGGCATCAGTCACAACAATCATTTACTCAAAGACCTCACACGTCTTGCAGCACCACTGGAGCCGGAAACTCGAGCTGTTATGAGATGGATAATGTCCGTTCCATTTGTACTGAGTGCAGCCATGCATGGTGGAGATTTGGTAGCAAACTATCCTTATGATGAGAGCAGGAGTGGAGCTCCTGTGTCTGAATATTCAGCCAGTCCGGATGATGAGACTTTTAGGGAGTTAGCTATGACATATGCCGAAGCTCATGCAGATATGGCATCTGCTAATAGACCAGGCTGTCGTTTTGGGGATGAAACTAATGCATACAACTTTGGAAAGCAAGGAGGTGTTACTAACGGAGCAGCCTGGTATAGTCTGAGAGGAGGCATGCAGGATTTTAATTATCTAGCGACGAATGCTTTCGAAGTGACTCTAGAGCTGGGATGCCAGAAGTATCCTTACGAGAAAGACCTGGAAAAGGAGTGGTTTCGTAACAAGGACGCGTTGTTAGCTTATATATGGAAAGCCCATACTGGCATCAAGGGTATTGTGAAAGATGACTCCGGCTTCATACAAAACGCTGTGATATCCGTCGTCAACATAACTGGATCTGTACCACGGCCGATAAGACACGACATTACCAGCGGTATATACGGTGATTACTACCGTCTCCTGACCCCTGGTCACTACGAGGTGACAGCGAGTCACCCCGGGTACTTCCCCGTGTCACGCGTCGTCACCGTCCCCACACACCAGACCTCGGCCAGGATAGTCAACTTCAAACTGGAGCCTACAACGAGCTGGTTCGATGATTATACTTTCGGCGTATACCCTCACGGTCTGAGAGACGGCCAGCCGAGGATTTACAAGCGATCGCTCTACCACAAAGTCGCCAACGCCATGCTGGATAAGACGCACTGA

Protein sequence:

>DPOGS200937-PA
MAMYAFFYCALFLGVSTEFVWKHHNNEELSSILEEVHENCPNITRVYALTEPSVRNVPLYVIEFSDTPGFHQPYKPEVKYVGNIHGNEVLGRELLLGLAYYLCEEYNKHDRRIRNLIHNTRIHLLPSMNPDGWQLSTDTGGQDFLLGRNNNHSVDLNRNFPDLDAITFEFERQGISHNNHLLKDLTRLAAPLEPETRAVMRWIMSVPFVLSAAMHGGDLVANYPYDESRSGAPVSEYSASPDDETFRELAMTYAEAHADMASANRPGCRFGDETNAYNFGKQGGVTNGAAWYSLRGGMQDFNYLATNAFEVTLELGCQKYPYEKDLEKEWFRNKDALLAYIWKAHTGIKGIVKDDSGFIQNAVISVVNITGSVPRPIRHDITSGIYGDYYRLLTPGHYEVTASHPGYFPVSRVVTVPTHQTSARIVNFKLEPTTSWFDDYTFGVYPHGLRDGQPRIYKRSLYHKVANAMLDKTH-