Monarch geneset OGS2.0

DPOGS207125
TranscriptDPOGS207125-TA1155 bp
ProteinDPOGS207125-PA384 aa
Genomic positionDPSCF300001 + 3448364-3449518
RNAseq coverage761x (Rank: top 17%)
Annotation
HeliconiusHMEL0095950.083.11% 
BombyxBGIBMGA013085-TA0.080.52% 
DrosophilaCG4572-PA4e-11656.64% 
EBI UniRef50UniRef50_G6CI990.0100.00%Vitellogenic carboxypeptidase n=4 Tax=Pancrustacea RepID=G6CI99_DANPL
NCBI RefSeqXP_001850350.13e-13161.89%vitellogenic carboxypeptidase [Culex quinquefasciatus]
NCBI nr blastpgi|1700455115e-13061.89%vitellogenic carboxypeptidase [Culex quinquefasciatus]
NCBI nr blastxgi|1700455111e-12961.89%vitellogenic carboxypeptidase [Culex quinquefasciatus]
Group
Gene OntologyGO:00065087.6e-128proteolysis
GO:00041857.6e-128serine-type carboxypeptidase activity
KEGG pathway 
InterPro domain[6-365] IPR0015637.6e-128Peptidase S10, serine carboxypeptidase
Orthology groupMCL14164 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207125-TA
ATGTTGGCCAACAACAAAGATGCACCGGTGATCGTATGGCTGCAAGGAGGACCAGGTGCGTCTTCACTATATGGTCTCTTTACTGAAAATGGTCCGCTAAGGGTTCGTAATAATAAATTTGAGAGGAGAAAATATAACTGGGCTCTAAGCCACCACCTTATTTACATTGATAACCCAGTGGGAACTGGTTTCAGTTTTACCAAGGATTCTCGGGGCTACTGTCAAAACGAAACCCAAGTAGGAGAACAGCTTTACTCCACAATTATACAATTCTTTCAGTTGTTCCCAGAATTGCAAGGCAACAAATTTTTTATTACCGGAGAGTCATATGGTGGTAAATATGTACCAGCATTTGCCTATACTATACACAAGAAAAACCCATCCGCAAAATTGAAAATCAATCTCAAGGCCTTGGCAATAGGAAATGGATTAAGTGACCCTGAACACCAATTGGTATACAGCAAATACTTGTATCAAATAGGACTTTTAGATTGGAACCAAGCCCAAGTATTTGCTGATGCTGAAAGCAAAGTTGTTGATTTAATCAAACAGCAGAAATTTGATAAAGCATTTGAGGCCTTTGATACTTTACTAAACGGAGATCTCATTGATGGAAAAAGTGTTTTCTATAACATGACTGGTTTTGAGTTTTATTTCAATTTCCTTCACACCAAGGACTATAAACAGTTTGAAGACTTTGGTCCAATGTTACAAAAAAGTTTTGTTAGAAAGGCAATACATGTAGGAAATATGACATTCAATGATGGGAAACTAGTAGAACAGCATCTAAAACAAGATGTAATGAAGTCAGTTGCACCTTGGATAGCAGAGTTACTCGATCATTACTATGTTGTGGTATACAATGGGCAGTTAGATATAATAGTAGCATATCCAATGACAATTAATTATTTAAGAAATCTCAATTTTACTGGATCTGATGAGTATAAGAATGCCAAAAGATACCAGTGGTATGTTGACGGAGAGCTGGCTGGATATGTGAAACAAGGAGGGAAATTAGTCGAGATTATGGTAAGAAATGCTGGTCACATGGTACCCGGTGACCAACCTAAGTGGGCTCTGGATTTGATTACAAGATTGACTCATGAGAAAACATACAATAGTTTTGATCATAAGCAGTCTCTTGGAAATCTATAA

Protein sequence:

>DPOGS207125-PA
MLANNKDAPVIVWLQGGPGASSLYGLFTENGPLRVRNNKFERRKYNWALSHHLIYIDNPVGTGFSFTKDSRGYCQNETQVGEQLYSTIIQFFQLFPELQGNKFFITGESYGGKYVPAFAYTIHKKNPSAKLKINLKALAIGNGLSDPEHQLVYSKYLYQIGLLDWNQAQVFADAESKVVDLIKQQKFDKAFEAFDTLLNGDLIDGKSVFYNMTGFEFYFNFLHTKDYKQFEDFGPMLQKSFVRKAIHVGNMTFNDGKLVEQHLKQDVMKSVAPWIAELLDHYYVVVYNGQLDIIVAYPMTINYLRNLNFTGSDEYKNAKRYQWYVDGELAGYVKQGGKLVEIMVRNAGHMVPGDQPKWALDLITRLTHEKTYNSFDHKQSLGNL-