Monarch geneset OGS2.0

DPOGS204330
TranscriptDPOGS204330-TA2373 bp
ProteinDPOGS204330-PA790 aa
Genomic positionDPSCF300142 - 81635-87155
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0023172e-6435.65% 
BombyxBGIBMGA007227-TA7e-3231.14% 
DrosophilaCG4408-PA2e-2530.91% 
EBI UniRef50UniRef50_Q9VCM83e-2330.91%CG4408 n=21 Tax=Drosophila RepID=Q9VCM8_DROME
NCBI RefSeqXP_001955443.12e-2531.45%GF18768 [Drosophila ananassae]
NCBI nr blastpgi|1947459415e-2431.45%GF18768 [Drosophila ananassae]
NCBI nr blastxgi|3245174063e-2429.23%Carboxypeptidase B [Ascaris suum]
Group
Gene OntologyGO:00065089.9e-39proteolysis
GO:00082709.9e-39zinc ion binding
GO:00041819.9e-39metallocarboxypeptidase activity
KEGG pathwaytgu:1002291531e-15 
 K01300 (CPB2)maps-> Complement and coagulation cascades
InterPro domain[463-723] IPR0008349.9e-39Peptidase M14, carboxypeptidase A
Orthology groupMCL25324 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204330-TA
ATGAATTTTGGTATATTTTTGATAATATTAGGTTTTGCTCATTTAGTTTGTTGTTCTAATAGTAATGTCACTAATAATGAGTGCGATGTATGTACATTATCATGGTTCCGTCGTAAACGGGATCTTAAGGAGTACACGCGATGGAAGTCGAACGTAGAGACGACACAAGAATTTATTGTTAAAGTTTTAGAAGACGATGTGTCATTACCTCAAATTAAGAAGTTTGCGGCTATAAACATTGCGCCGTTCACTAAAGAAAGAGAAAAAATTATAATTGGTGGTCCAATAACTCCATCACCTACAACGCCTTCAGATTCAGATAAGTTAAAAGATTATACTCGATTTGGTCCTCCCTCGACCTTGTCTGTTACTGCCATTGATGGAAACATTATCCTTCAGCCCATTAAAATAAACGAAAATGCAATTGCACCGATTCGACCCCCAGGTCGTAAATCCGGATTAGAACCTGCAAAAAACCCACTTCGAGATGAAAGTCCTCCAGCGCGACCCCAATTCTTACCAAAACAAGAGCCATTAATAGCTCGGCAATTCGCTCCAATAGCGCCGGTATCTTTTACTAAACGTAATTCTCTGGGCCCTTTGACTGGTCGTGGAATAGACAAATCGATGTTGTTCAGAGAAACAACATTTCAACATCCCACAAGAACTTCCCATGCAGAGCTAGAAGTCCCAAAAAGGACAACAGAATTTACGACTTCTGTTACATATGACTACAATTATGAATTAAGATCTCAATTTGTTCCAATGCGGACTTCACAAAAGTCGATAATAGATTTACATAGAACAATAACTGACCAAATAGAAACGGAAAATAGTGATTTTATGCCCACGATTGCCAGTAAAAAAATAGCAGCGCAGAAAGGTTCTTCAAAAATAGGACACAAAGAAACAAAAATGGCTTACGATCTACTTTACCACGATGAAGATGAAATAAGCGACCCGAGAAGAAAAGAACATCAAAGAATCACAGAGAGTTATGACGATCTCCATGTAATTGACAACGAGAATGTTACACGAGATGATGCTTTAAATAATGCTGAAGAACGTGAAAGTACCACAGACGACTCCGATTCAAATAATAACGATAAAGACAATTTGGAAACGACTATAAGTGGTTTTCAAGCTGAACAAAAGCTATTTGGTGTGGCGACTATAGAATCAGTTTCTAATAAATGGGATAGCGAGGAAGATGAAGATAAAAGCAAAACAACGCTGAAAGAAAAAAAAGTATTTTTATGTAACATATTAAAAAGAAGACAACTTTCTTTCAGCACTCCTTTGACTTTGTACGAGATAGTAACGCAACTAAAACAATGGGCCGATGAAAGTCCTGTTGCCAAATGGATGGACCTTACCGAAGGAAATTACACGATAATGGAAAATCCGATACACATGATGATGGTGGATGATCCCAGTAGTGGACAAATTATGTCTGCTAAAAAAACCGTTATGATCGTCGCAGGAATCCAAGGAAGAGATCATCATGCTGTTACGGCGGCGATGTACGTTCTGTATCAGTTGATTGAACGCAGTGAAGCACATTCTGATTTGCTTACAAAGTTCCGCTTTTGGATTGTTCCTGTTTTTAATCCAGACGGATATGATTATTCAATGACTTTCCCCCAAAGACGTGAATGGACGAAGAATGTACGACAGTCGTGGGATTCGTGCCAAGGTAGGATTCTATGTAGGACCTGCGAAGAGTTCGGAGTGGGTTGTACGATACGGCCATGTTACGGGGTCAACTTGGATCGCAATTTTGAATATCAATGGATACCCACAGAAGAGTTGCGCTCTGAACATCCGTGTGGAATGCTGTATGCTGGACCTCGTCAACTGAGCGAGGCTGAGACGATAGCGCTCACGCATTTTCTTCACAGTCAGCGCACGCCGATTCACACTTTCATAGCTTTCAAAGAAGGAGATGTTCTAGGAATAATGTACCCTTACTCACACACAAAGAAGAGACGTGCCTTCGATCACATTTATAGACACAGAGCCTCGAGAGCTGCAGCCGCTGCTAATAGCATTAGCGGTCGGCCATACGTTGCTGGTCAAACCTCCGAGTTTCTACCGCTTTACGCTGGTGGCGTTGAAGACTGGGTGGATGGTCATCTGGGGATAGACAATACTTATACCGTCATGATGTTCCGACCCTCTGATTCCTATAGTTCAAAACTTATCACGGAGCGCGTCGTTCACGAGGCGTACGCTGCGGTGGATACTTTGCTTCTAGAGAGTGTCGAGACTCCGAGATCGCCGCAAACAGTTTTAACCAGAGCCAAAGCTTCTGCAAACACCTTATTCGCGAATATCTTTATATTATTACCTATGGTTCTGGGCTTCGGTTAA

Protein sequence:

>DPOGS204330-PA
MNFGIFLIILGFAHLVCCSNSNVTNNECDVCTLSWFRRKRDLKEYTRWKSNVETTQEFIVKVLEDDVSLPQIKKFAAINIAPFTKEREKIIIGGPITPSPTTPSDSDKLKDYTRFGPPSTLSVTAIDGNIILQPIKINENAIAPIRPPGRKSGLEPAKNPLRDESPPARPQFLPKQEPLIARQFAPIAPVSFTKRNSLGPLTGRGIDKSMLFRETTFQHPTRTSHAELEVPKRTTEFTTSVTYDYNYELRSQFVPMRTSQKSIIDLHRTITDQIETENSDFMPTIASKKIAAQKGSSKIGHKETKMAYDLLYHDEDEISDPRRKEHQRITESYDDLHVIDNENVTRDDALNNAEERESTTDDSDSNNNDKDNLETTISGFQAEQKLFGVATIESVSNKWDSEEDEDKSKTTLKEKKVFLCNILKRRQLSFSTPLTLYEIVTQLKQWADESPVAKWMDLTEGNYTIMENPIHMMMVDDPSSGQIMSAKKTVMIVAGIQGRDHHAVTAAMYVLYQLIERSEAHSDLLTKFRFWIVPVFNPDGYDYSMTFPQRREWTKNVRQSWDSCQGRILCRTCEEFGVGCTIRPCYGVNLDRNFEYQWIPTEELRSEHPCGMLYAGPRQLSEAETIALTHFLHSQRTPIHTFIAFKEGDVLGIMYPYSHTKKRRAFDHIYRHRASRAAAAANSISGRPYVAGQTSEFLPLYAGGVEDWVDGHLGIDNTYTVMMFRPSDSYSSKLITERVVHEAYAAVDTLLLESVETPRSPQTVLTRAKASANTLFANIFILLPMVLGFG-