Monarch geneset OGS2.0

DPOGS200613
TranscriptDPOGS200613-TA1122 bp
ProteinDPOGS200613-PA373 aa
Genomic positionDPSCF300076 - 51639-53867
RNAseq coverage57x (Rank: top 69%)
Annotation
HeliconiusHMEL0147402e-9554.46% 
BombyxBGIBMGA008976-TA7e-12567.22% 
DrosophilaCG3108-PA1e-6643.46% 
EBI UniRef50UniRef50_Q60F938e-6945.82%Molting fluid carboxypeptidase A n=5 Tax=Neoptera RepID=Q60F93_BOMMO
NCBI RefSeqXP_973407.21e-7245.54%PREDICTED: similar to CG3108 CG3108-PA [Tribolium castaneum]
NCBI nr blastpgi|1892399582e-7145.54%PREDICTED: similar to CG3108 CG3108-PA [Tribolium castaneum]
NCBI nr blastxgi|1892399581e-7044.78%PREDICTED: similar to CG3108 CG3108-PA [Tribolium castaneum]
Group
Gene OntologyGO:00065086.3e-101proteolysis
GO:00082706.3e-101zinc ion binding
GO:00041816.3e-101metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[78-358] IPR0008346.3e-101Peptidase M14, carboxypeptidase A
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200613-TA
ATGGCTAAAAATTTCTCAGGATCCAGTGTTTTAAATGATGATTACTACATAGACACGGATTCGTCGGAAGAAAAAGACAAAGCAGAAAATGGCACGCAAACCAGGTTTCTTAAATGTGGCAAAGGGAAATCAAGCAAAATAACTTTTAAATATATAGGTAAAGGAATCGGTCAAATCGCTCCAAAAGAAAAACCAAAAAGGAAACAACTTTTAGTAATGGACTGGAAAAAATTTCATAGGCTGAGTGTTATCTATTCTTTTTTGGACCATTTAGAAAAAGACTTCCCGGCTATATGTACTGTGTCGGTTATCGGTAAATCAGTTGAGGGAAGGGACATTAAGATGTTAAAGATATCGAACAGCAATGCCAGTAATGCGGCAGTATGGCTTGATGCCGCAATACACTCTAGGGAATGGATCAGCACAGCTGTAGTCACATACCTCGCCGACTTCATCGCCAGGAACTTCCAAGACTTATCCAATTCGGTCACGAATAAAGATTGGTATATAGTACCGGTACTAAATCCAGACGGCTACGAGTACACTCACACACGCGACCGAATGTGGCGTAAGAATAGGGCTCGACGGGACGGTGCCTGTGTGGGTGTCGATCTCAATAGAAACTTCAGCTGTGGATGGGGAAACAATGGTGAGGAGGGGTCCTCAGACAATCCAAACAGTGTGTTTTATAGAGGTCCAGAACCATTTTCAGAACCTGAATCATCAGCAGTGCGGGATACAATATTGAGCTCAGCGACGGCTTTCAAGGTCTTCCTATCATTCCACAGCTATTTCGAACTAATTATATTCCCTTGGGGCTACAAAACAGACCCCTGTCCACACTACTTAGACCTTCTGGAAGGCGCCTCAATAATGGCAAGGGCCATTTATGAAAGCAGCGGAATAGTATACAAAGTGGGTTGTACAAAAGACCTTACATATTATGCTTGTGGGACGAGTATAGACTGGAGCTATGCTATAGCGAAAATTCCATATTCGTATATGGTCGAATTGAGGAGTAAAAAGTACAGGTTTAAGTTGCCCAAAGACCAAATTGAAGAGACATGCGTGGAGATTTGGAATGCAGTTAAAAGTTTAATGGATTATGTGGACCAACCTTGA

Protein sequence:

>DPOGS200613-PA
MAKNFSGSSVLNDDYYIDTDSSEEKDKAENGTQTRFLKCGKGKSSKITFKYIGKGIGQIAPKEKPKRKQLLVMDWKKFHRLSVIYSFLDHLEKDFPAICTVSVIGKSVEGRDIKMLKISNSNASNAAVWLDAAIHSREWISTAVVTYLADFIARNFQDLSNSVTNKDWYIVPVLNPDGYEYTHTRDRMWRKNRARRDGACVGVDLNRNFSCGWGNNGEEGSSDNPNSVFYRGPEPFSEPESSAVRDTILSSATAFKVFLSFHSYFELIIFPWGYKTDPCPHYLDLLEGASIMARAIYESSGIVYKVGCTKDLTYYACGTSIDWSYAIAKIPYSYMVELRSKKYRFKLPKDQIEETCVEIWNAVKSLMDYVDQP-