Monarch geneset OGS2.0

DPOGS200614
TranscriptDPOGS200614-TA1254 bp
ProteinDPOGS200614-PA417 aa
Genomic positionDPSCF300076 - 43598-48116
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0147405e-11467.68% 
BombyxBGIBMGA008975-TA2e-12074.52% 
DrosophilaCG3108-PA2e-7646.20% 
EBI UniRef50UniRef50_Q9W4753e-7446.20%CG3108 n=26 Tax=Opisthokonta RepID=Q9W475_DROME
NCBI RefSeqXP_002059539.13e-7647.96%GJ14824 [Drosophila virilis]
NCBI nr blastpgi|2195531921e-7646.47%molting carboxypeptidase A [Helicoverpa armigera]
NCBI nr blastxgi|2195531924e-7546.47%molting carboxypeptidase A [Helicoverpa armigera]
Group
Gene OntologyGO:00065082.6e-99proteolysis
GO:00082702.6e-99zinc ion binding
GO:00041812.6e-99metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[118-395] IPR0008342.6e-99Peptidase M14, carboxypeptidase A
Orthology groupMCL30625 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200614-TA
ATGGAAAAACAAACTCACCCACATAACACAAGTGGCGAAGAATTTTCCGATAATGAAGGCTGCGTTTACGAAAATGATTCTTACGAAAGTATCCTAGATGGAGTCTACAGTAACGAATATATATATGACTCTGAGATGACCATCCAAAATAAATATCAAACAAAACACGAAGTTAATCACGTCAATGTGGATGACAGTCGTTACGGGGTTTATAGACGTCCAATGAAAGTAGGGAGTAAAAGATTTTCATACAATCATCTTTCAAAAGACAAACCAGGTATTCAGAAAAAGGGACAAGAAGATAATTATCATGCCGAAGATCGTAAAGCAAAACGAATGGACTGGAATGATTACCAACGCCTAGATGTTATACACTCGTTCTTAGATGAGTTGGAATACGACTATCCTTCCATATGCACAGTGGGAACAATTGGTTTATCTCGAGAAGACAGAAATCTGAAGATTCTTAAAGTATCTAACAGCGACGCCCACAACCCAGCTGTATGGTTGGACGCAGGTATTCACGCCCGAGAATGGATCGCGCCGGCGGTCGCCACTTACATCGCCAACCACATCGTCCGCAACTTCAGCTCTCTGCCGTCCAGCATAACGAATAAGGACTGGTATTTCCATCCGGTTGTAAATCCTGATGGATATGAATATTCTCACACCGTTGATAGGATGTGGAGGAAAAATAAAGCCTATATTGGAGGAAAACTGGTCGGGGTGGATCTCAACAGGAATTTCAGTTATGGCTGGGGAGGCAAAGGCTCGTCGGAGACTCCTACCAGCGTGTTCTATCGAGGTCCAGAACCGTTCTCAGAACCTGAATCCTGTGCTGTCAGGGACGTGCTTCTTTACTCTGGCATTCCGTTTAAAGTATATATAACTCTACATAGTTATGGCCAAATAATTTTATTCCCGTTTGCCTATAAAGATGAACTTTGTCCCGACTACGTACGGCTCTTAGAAGGGGCAACTGTAATGTCAAAGGCTATCCACGAAAGCAGTGGAAACACATACAAGGTGGGACTATCCAGGGATGTAATGTACGGAGCGGCTGGCACTAGTAATGACTTCAGTTACGGGGTAGCTAAAATACCGTACTGCTATCTTCTAGAACTGAGAAGTAAGAAACATAAGTTCAGATTACCTAAAGAAGAAATAGAAGAGACGGGGAACGAAATACTAAGCTGTATAAAGGCTTTAATGGAGTTTATAGACGAGAAATCTGCAACTGAAAAAACTGAATAG

Protein sequence:

>DPOGS200614-PA
MEKQTHPHNTSGEEFSDNEGCVYENDSYESILDGVYSNEYIYDSEMTIQNKYQTKHEVNHVNVDDSRYGVYRRPMKVGSKRFSYNHLSKDKPGIQKKGQEDNYHAEDRKAKRMDWNDYQRLDVIHSFLDELEYDYPSICTVGTIGLSREDRNLKILKVSNSDAHNPAVWLDAGIHAREWIAPAVATYIANHIVRNFSSLPSSITNKDWYFHPVVNPDGYEYSHTVDRMWRKNKAYIGGKLVGVDLNRNFSYGWGGKGSSETPTSVFYRGPEPFSEPESCAVRDVLLYSGIPFKVYITLHSYGQIILFPFAYKDELCPDYVRLLEGATVMSKAIHESSGNTYKVGLSRDVMYGAAGTSNDFSYGVAKIPYCYLLELRSKKHKFRLPKEEIEETGNEILSCIKALMEFIDEKSATEKTE-