Monarch geneset OGS2.0

DPOGS204048
TranscriptDPOGS204048-TA1377 bp
ProteinDPOGS204048-PA458 aa
Genomic positionDPSCF300138 + 449360-451683
RNAseq coverage282x (Rank: top 39%)
Annotation
HeliconiusHMEL0048411e-12452.53% 
BombyxBGIBMGA004798-TA7e-12952.91% 
DrosophilaCG7025-PA1e-7938.17% 
EBI UniRef50UniRef50_B0WS125e-9139.27%Zinc carboxypeptidase A 1 n=10 Tax=Culicimorpha RepID=B0WS12_CULQU
NCBI RefSeqXP_001851495.18e-9342.15%zinc carboxypeptidase A 1 [Culex quinquefasciatus]
NCBI nr blastpgi|3807138502e-13649.57%carboxypeptidase [Bombyx mori]
NCBI nr blastxgi|3807138504e-13350.00%carboxypeptidase [Bombyx mori]
Group
Gene OntologyGO:00065085e-102proteolysis
GO:00082705e-102zinc ion binding
GO:00041815e-102metallocarboxypeptidase activity
GO:00041802.7e-19carboxypeptidase activity
KEGG pathway 
InterPro domain[121-417] IPR0008345e-102Peptidase M14, carboxypeptidase A
[21-112] IPR0090206e-21Proteinase inhibitor, propeptide
[23-110] IPR0031462.7e-19Proteinase inhibitor, carboxypeptidase propeptide
Orthology groupMCL10178 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204048-TA
ATGGCATTCAAATTAAAATTATTTGTAATTTTATTTACAGTAGTGTGCATAAGTGCAGAATATATCAGTTATAGAGATTACAAAGTTTACAAAATAACACCAGAGTCGGAAAATGAGTTGGCGATTCTTAAAAAGGTTGAGAAAGAAAATGAGTTTTTTTTTTGGATGGAAGTCGGAAAAGTTGGGAGAGATGTAAGGATTATGGTCCGTCCAGATAAACAGAATAAATTTGAAAAGTATATGGCGAATCTTGGACTAAAACCTTTTTTATTCATAGATGATGTACAAAGTTTAATAGACGACCAGTTAAAAAAACCGTCAAGAGCATCTCGACGCTCCGCCAAATATGATTGGACGTATTACCAAAATTTTGAAGAGATTTATGACTGGATGAACGAAACCGCCGCAGAGTATCCCGATATTGTGTCATTAATAGACATTGGAAGAAGTGTTGAAAATCGACCCATTATAGGAATGAAAATTGATTATAAAAAGAAAGAGAATCCAGTTATTGGGGTGTTTGAAGGAACCCTTCACGCTAGAGAATGGATAACTCCGGTCACTCTTACTTGGATTGTAAAAGAGTTTCTTACCAGCCGCGATGAAAAGATACGATTTTTAGCAGAAAATATTGTTTGGCACGTTTTTCCTATTACTAATCCAGATGGGTTCATATATACATTTACCGGTAATAGAATGTGGAGAAAAAACCGGAGCCGAGCTAATTTCACATCCTGCGGTCAATATTTGGATGATGACATGAGCAACGGTGTTGATCTTAATAGAAACTTCGATTTTGTATGGATGGAGGTCGGAGCATCACAAAACCCATGTACTAGCACTTTTGCTGGACCAAGGGCATTTTCTGAACCAGAAAGTACTGCCATAGAAAATCATGTATTAAGATTGAAAAAGGAGGGCAACCTTATGTATTACTTTGCCGTGCATTCTTACGGTCAACTGATCTTGGTTCCATACAGTCATGTTGGAGGTGCTGATGTCCTTGAAGTGTCCAACTACGGAGACCTGTTTGAAATGGCAATAAAAGGAGCTGCAAAATTAACAGAACGTCATAATACATCATACGCTGTTGGGACATCGCTTGATATATTATATCCAGCCTCTGGGACCGGTTTTGACTGGGCAAAGGGAGGGGCAAATATACCACTGGTGTTTTTGTATGAACTTAGAGACCTGGGACAGTATGGTTTTCTTTTACCACCAGAACAAATAATTCCGAACAGCGAAGAAGTATTGGATTCCCTCATCGAGATTGATAGGGTAGCTAGACAAATAGGATATTACTCGGATGGTTATGCTATTAAAATAAGCACAATGTTACTGTTAATAGCATTGATATTTTCTTGTATGATATAA

Protein sequence:

>DPOGS204048-PA
MAFKLKLFVILFTVVCISAEYISYRDYKVYKITPESENELAILKKVEKENEFFFWMEVGKVGRDVRIMVRPDKQNKFEKYMANLGLKPFLFIDDVQSLIDDQLKKPSRASRRSAKYDWTYYQNFEEIYDWMNETAAEYPDIVSLIDIGRSVENRPIIGMKIDYKKKENPVIGVFEGTLHAREWITPVTLTWIVKEFLTSRDEKIRFLAENIVWHVFPITNPDGFIYTFTGNRMWRKNRSRANFTSCGQYLDDDMSNGVDLNRNFDFVWMEVGASQNPCTSTFAGPRAFSEPESTAIENHVLRLKKEGNLMYYFAVHSYGQLILVPYSHVGGADVLEVSNYGDLFEMAIKGAAKLTERHNTSYAVGTSLDILYPASGTGFDWAKGGANIPLVFLYELRDLGQYGFLLPPEQIIPNSEEVLDSLIEIDRVARQIGYYSDGYAIKISTMLLLIALIFSCMI-