Monarch geneset OGS2.0

DPOGS206828
TranscriptDPOGS206828-TA1218 bp
ProteinDPOGS206828-PA405 aa
Genomic positionDPSCF300001 - 3444618-3445835
RNAseq coverage108x (Rank: top 60%)
Annotation
HeliconiusHMEL0095960.074.81% 
BombyxBGIBMGA012773-TA8e-15260.74% 
DrosophilaCG4572-PA5e-10948.88% 
EBI UniRef50UniRef50_G6CI991e-12052.99%Vitellogenic carboxypeptidase n=4 Tax=Pancrustacea RepID=G6CI99_DANPL
NCBI RefSeqXP_969249.13e-12353.32%PREDICTED: similar to salivary/fat body serine carboxypeptidase [Tribolium castaneum]
NCBI nr blastpgi|910794506e-12253.32%PREDICTED: similar to salivary/fat body serine carboxypeptidase [Tribolium castaneum]
NCBI nr blastxgi|910794506e-12253.32%PREDICTED: similar to salivary/fat body serine carboxypeptidase [Tribolium castaneum]
Group
Gene OntologyGO:00065083.5e-132proteolysis
GO:00041853.5e-132serine-type carboxypeptidase activity
KEGG pathway 
InterPro domain[12-404] IPR0015633.5e-132Peptidase S10, serine carboxypeptidase
Orthology groupMCL22307 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206828-TA
ATGTCCCGTGTTTTGTTTACTGAAAAAATCGGCTTTAGAAGTCATGCAGGATTTTTAACTATTAATGCTAAGTATAATTCCAACCTATACTTTTGGTACTTCCCACCTTTTAATGAAAACACTGGAGCACCTGTTGTATTATGGCTTCAGGGTGGTCCAGGAGGAAGTTCATTGTTTGGCTTATTTACAGAAAATGGTCCCCTCATAGCCAGAAAAGATGGATTCTCATTGAGAAAATACCATTGGGCTCATGAAAATTATCTTATTTATATTGACAACCCAGTCGGAACAGGATTTAGTTTTACAGATAATGAAAATGGTTACTGTTCTGACGAAAATTGTGTAGCAAAAGGTTTGTACAACTTTTTGCAGCAATTTTACAAGCTTTTTCCACATTTAAGGAATAATAATTTTTTCATTAGTGGAGAATCATATGCAGGGAAATACCTTCCCTCCTTAGCTATGGAAATTCATCAGCAAAACCATCGAGGGTTGACTAAGATAAACTTAAAGGGACTAGCTTTGGGAAATGCTTACTGTGATCCGTTGAACCAGATGGATTATGGAAATTATCTCTACCAACATGGCATGATAGATGATAAACAAAAATTAGTTTTCCTCAAGATGCAGAAGAAAATATCTGATGAAATCAAGAAACAAAATTGGGCTGAAGCTGGCATTCTAATGAACACATTAATGGATGGAGACCTCACTAACTTCTCGTATTTTAACAACTATACAGGCTTTGATAATTATTACAACATACTAGAGCCTATAGATAAAACAAATGTTAGTATTTTTGAGGCACTTCTAAACAGTGATAAAATAAGGCGCAGTGTACATGTTGGCGGTCTACCATTTCACTCTGGTAAAGATGTTCAAATGCACTTAGCATTTGACATTTTAAAGTCAGTTGCATTATCAATATCAGAACTTCTTTCACATTATCGCCTTATGTTTTACAACGGCCAGTTGGATATTATAGTTGCATATCCATTGACAGAAAACTTCCTACGTAACTTAAATTTTTCATCAGCAGCTGAATATAAAGTGGCTAAAAGAAGAATCTGGAGGGTCGGAGATGAAATTGCGGGATATATAAAAAAGGCTGGTAATCTCACCGAGGTTTTAGTCAGAAATGCTGGTCATATGGTGCCACATGACCAGCCTAAGTGGATGTTTGATCTTATAACACGATTTATTAAGAATCAATTATAA

Protein sequence:

>DPOGS206828-PA
MSRVLFTEKIGFRSHAGFLTINAKYNSNLYFWYFPPFNENTGAPVVLWLQGGPGGSSLFGLFTENGPLIARKDGFSLRKYHWAHENYLIYIDNPVGTGFSFTDNENGYCSDENCVAKGLYNFLQQFYKLFPHLRNNNFFISGESYAGKYLPSLAMEIHQQNHRGLTKINLKGLALGNAYCDPLNQMDYGNYLYQHGMIDDKQKLVFLKMQKKISDEIKKQNWAEAGILMNTLMDGDLTNFSYFNNYTGFDNYYNILEPIDKTNVSIFEALLNSDKIRRSVHVGGLPFHSGKDVQMHLAFDILKSVALSISELLSHYRLMFYNGQLDIIVAYPLTENFLRNLNFSSAAEYKVAKRRIWRVGDEIAGYIKKAGNLTEVLVRNAGHMVPHDQPKWMFDLITRFIKNQL-