Monarch geneset OGS2.0

DPOGS204784
TranscriptDPOGS204784-TA1047 bp
ProteinDPOGS204784-PA348 aa
Genomic positionDPSCF300217 - 132209-134708
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0021082e-5235.22% 
BombyxBGIBMGA009476-TA1e-11152.15% 
DrosophilaCG8560-PA3e-3730.53% 
EBI UniRef50UniRef50_Q3T9051e-4330.60%Carboxypeptidase B n=5 Tax=Noctuidae RepID=Q3T905_HELZE
NCBI RefSeqXP_968597.11e-4335.03%PREDICTED: similar to carboxypeptidase B [Tribolium castaneum]
NCBI nr blastpgi|461982867e-4933.70%midgut carboxypeptidase 1 [Trichoplusia ni]
NCBI nr blastxgi|461982864e-5033.70%midgut carboxypeptidase 1 [Trichoplusia ni]
Group
Gene OntologyGO:00065081.4e-55proteolysis
GO:00082701.4e-55zinc ion binding
GO:00041811.4e-55metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[54-334] IPR0008341.4e-55Peptidase M14, carboxypeptidase A
Orthology groupMCL30660 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204784-TA
ATGGTTCCTCCTTCACATTTTGGTTGGTTTGAAAAACAATTGAATAATATGAATATTGAGAAGAATATTTATATAGATGATGTTTATGAATATTTAAAGGAGCATGACACCCGTCAAGCACGAAGAAACGATGAGGACGATGTATTCAGTTTCGACGGCTATCATAGATTTGATCAAGTTTTGAATTATATGAAAGGGTTAAATGGGACTTCTTTACCAGGTGTTGATGTAGAATTCGTTGAAGCTGGATATACGGATGAGAACAGAACTTTGGCATATTTGAGAATAAATTCCAAAACAAGTGAAGAGGGAACACAAAGGCCCATAGTAATTATTGAGGCTGGGGTTAATCCTAGAGAGTGGATAACAATACCGACTGCTCTTAATATTGCAAATAAACTTATTGAAGGAAATCAAACGAAATTAGCACAAAATTTGGAATGGATTATTTTACCAGTTCTCAATCCGGATGGTTATGAATTTACACATAACTCGAATCGACTTTGGACGAAATCAAGAAGCACCAGAAGTAACTTAGGTTTTATATGCCCAGGAGTGAATATTAACAGAAACTTTGACATTGACTGGATGTTTTCTGACTCCAGCACTAGTCCGTGTAGTCATCTGTATGGTGGGATTGAAAGCTTTTCAGAACCGGAGTCCCAAATAATAAGAAAACTCATCGAAGAACATGGTAATCGAATCAAACTATACATTTCCCTACAAAACAATGGAGGATTTGTATCTTATCCCTGGCAATACGAGAGAGCTGCAAGTGGAATGTTCAGACAACACCATTTATTGGGATTAGAAATGATTTCAGCCATAGCAGATAATTACAAGTTAGACATAGGCTCCTTAGCTTTAGGAGATAGGGCTTCTGGAACTAGCAGTGATTATGTTATGAGTAGAAATGTTTTATACACATTCAATATTGATATAAAACAATGCGAGGGTGATGTTCTTGTACCTGAGGCTGAAATAAGACCAATCGCTGAACGGGTATGGAGAGCAGTCGCTGTAGCCGCTGGAAATATGATAAGTTGA

Protein sequence:

>DPOGS204784-PA
MVPPSHFGWFEKQLNNMNIEKNIYIDDVYEYLKEHDTRQARRNDEDDVFSFDGYHRFDQVLNYMKGLNGTSLPGVDVEFVEAGYTDENRTLAYLRINSKTSEEGTQRPIVIIEAGVNPREWITIPTALNIANKLIEGNQTKLAQNLEWIILPVLNPDGYEFTHNSNRLWTKSRSTRSNLGFICPGVNINRNFDIDWMFSDSSTSPCSHLYGGIESFSEPESQIIRKLIEEHGNRIKLYISLQNNGGFVSYPWQYERAASGMFRQHHLLGLEMISAIADNYKLDIGSLALGDRASGTSSDYVMSRNVLYTFNIDIKQCEGDVLVPEAEIRPIAERVWRAVAVAAGNMIS-