Monarch geneset OGS2.0

DPOGS200623
TranscriptDPOGS200623-TA1569 bp
ProteinDPOGS200623-PA522 aa
Genomic positionDPSCF300076 + 211579-217204
RNAseq coverage160x (Rank: top 52%)
Annotation
HeliconiusHMEL0031455e-15986.67% 
BombyxBGIBMGA008910-TA0.066.15% 
DrosophilaCG3108-PA2e-11548.43% 
EBI UniRef50UniRef50_Q60F930.066.15%Molting fluid carboxypeptidase A n=5 Tax=Neoptera RepID=Q60F93_BOMMO
NCBI RefSeqNP_001036933.10.066.15%molting fluid carboxypeptidase A [Bombyx mori]
NCBI nr blastpgi|1129830460.066.15%molting fluid carboxypeptidase A precursor [Bombyx mori]
NCBI nr blastxgi|1129830460.066.15%molting fluid carboxypeptidase A precursor [Bombyx mori]
Group
Gene OntologyGO:00065086.1e-138proteolysis
GO:00082706.1e-138zinc ion binding
GO:00041816.1e-138metallocarboxypeptidase activity
GO:00041803.6e-13carboxypeptidase activity
KEGG pathway 
InterPro domain[227-507] IPR0008346.1e-138Peptidase M14, carboxypeptidase A
[87-175] IPR0090209.6e-15Proteinase inhibitor, propeptide
[87-172] IPR0031463.6e-13Proteinase inhibitor, carboxypeptidase propeptide
Orthology groupMCL11156 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200623-TA
ATGGCAAAGCTCTGGTGGACGGTCGTGTGCCTCCTCGCATCCTTGGAGCTCTGCACTCCGCTTATAAACGAATTGCAACCGGGTAAAGAATGGCCCAAACGTCAATCAGTAAGACAACCTCAGGACGAGCTAGACAACCCTGATGTCACAACGGTTATAGCTGATACAACGGTAGGAGGTTCAGTAGATATACCCGAAGATGTACAAGAAGATGTTGAAGAGAACATTCAAACAAAAGCTATAGATGTAGAAGACTCCAAAGTAGATTACTCCGGAGCACAACTGTGGAAAGTGGCGACTGATAAGAACGGAGTAAGAGTACTTTTAGGTCGATTGCGTCGTAGAAATCTCATTTCGACGTGGTCGGGGAACCAAACGTACATCGATGTCCTTGTGAAACCCGACGCCGTACAGAACGTTACACGGATATTCAAGAGAGAGAACATTACTTTTGACGTCATTATCGAGGACCTACAAAGGAGAATCAATGAAGAAAACCCTCCGCTCGATGAGAACGAAATTGAGCTACAGGACAGACGAGCAAGATCTTACAATTTTTGGCGTTACAGGGCCACCCGTTTAGGTGTGATAAAAGCCTTCATGGGCTACAGCTCTCTTAAACTTCCAATGGAGTTTTGGCCATATGCCAACTGGGGTCACCGTATGACATGGAAACAATATCACAGATTGGAAGACATTCACGGCTTTATGGATTATTTAGCCAAAACGTATCCCAAGATCGTGAGTGTGAACTCAATAGGAAAATCCTATGAAGGAAGAGACCTTAAAGTTCTCCGTATATCAGATGGCAAGCCTTCAAATAAGGCGGTTTTTATCGACGGTGGTATACACGCTAGGGAATGGATCAGCCCGGCTACGGTTACATACTTCATCAACCAAATAGCTGAAAACTTCGACGAAGAATCCGATGACATAAGGGATATTGATTGGTATTTCTTGCCTGTTGTCAATCCTGATGGATACGAATACACGCATATCAAAGATCGTTTGTGGAGAAAAAATAGAAAGCCGGCAGTTTACGGTGTGAGACAGTGTGTCGGGACTGATTTGAACAGAAATTTCGGTTATCGTTGGGGTGGTAAAGGTTCCTCGAGTAATCCCTGCAGTGAAATATATAGAGGAAATAGAGCTTTTTCTGAACCAGAATCCAGAGCAGTATCGGAATTCATCAAAACAAGTGCAGCTAATTTCTCAGCATACCTGACATACCACAGTTATGGTCAATATTTATTATACCCTTGGGGATATGACAACGCAGTCCCACCAGATCATAAAGAATTAGATCTTGTTGGCAAAAATATAGCAGCGGCTATTCAAGCGACTGGAGGCTCTAAATATTCTGTTGGGTCGTCTAGTGGCCTCCTTTATCCCGCTTCAGGCGGTTCAGATGACTGGGCCAAAGGCCAGGGCATTAAATATGCATACACAATTGAACTTAGCGATACTGGCCGCCATGGATTTGTTTTGCCGACAACCTTCATTGAGCCAGTAGCAAGGGAATCATTGTCAGGCTTAAGAGTGCTTGCAGCCCAATTAAGAAAGAACTAA

Protein sequence:

>DPOGS200623-PA
MAKLWWTVVCLLASLELCTPLINELQPGKEWPKRQSVRQPQDELDNPDVTTVIADTTVGGSVDIPEDVQEDVEENIQTKAIDVEDSKVDYSGAQLWKVATDKNGVRVLLGRLRRRNLISTWSGNQTYIDVLVKPDAVQNVTRIFKRENITFDVIIEDLQRRINEENPPLDENEIELQDRRARSYNFWRYRATRLGVIKAFMGYSSLKLPMEFWPYANWGHRMTWKQYHRLEDIHGFMDYLAKTYPKIVSVNSIGKSYEGRDLKVLRISDGKPSNKAVFIDGGIHAREWISPATVTYFINQIAENFDEESDDIRDIDWYFLPVVNPDGYEYTHIKDRLWRKNRKPAVYGVRQCVGTDLNRNFGYRWGGKGSSSNPCSEIYRGNRAFSEPESRAVSEFIKTSAANFSAYLTYHSYGQYLLYPWGYDNAVPPDHKELDLVGKNIAAAIQATGGSKYSVGSSSGLLYPASGGSDDWAKGQGIKYAYTIELSDTGRHGFVLPTTFIEPVARESLSGLRVLAAQLRKN-