Monarch geneset OGS2.0

DPOGS200619
TranscriptDPOGS200619-TA1071 bp
ProteinDPOGS200619-PA356 aa
Genomic positionDPSCF300076 + 49874-51470
RNAseq coverage70x (Rank: top 66%)
Annotation
HeliconiusHMEL0147402e-8645.48% 
BombyxBGIBMGA008976-TA3e-11252.99% 
DrosophilaCG3108-PA6e-6542.14% 
EBI UniRef50UniRef50_Q60F933e-6643.05%Molting fluid carboxypeptidase A n=5 Tax=Neoptera RepID=Q60F93_BOMMO
NCBI RefSeqXP_001122133.15e-6940.26%PREDICTED: similar to CG3108-PA [Apis mellifera]
NCBI nr blastpgi|3123759383e-6838.40%hypothetical protein AND_13384 [Anopheles darlingi]
NCBI nr blastxgi|3123759384e-6638.40%hypothetical protein AND_13384 [Anopheles darlingi]
Group
Gene OntologyGO:00065082e-96proteolysis
GO:00082702e-96zinc ion binding
GO:00041812e-96metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[48-328] IPR0008342e-96Peptidase M14, carboxypeptidase A
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200619-TA
ATGCAGATACAGAAAATGCTCCGAGACAGAGACATTTCGTTTGAAGTGACACCTTCCAGTCTAAGCGGATCTCGTAACAGAAGCCCTGTCTTACTGAGCCCAAGACCAAGATCCCCACGACGTGAAATGGATTGGAAGGATTTCTATCCCCTTGAGAAGATTTATAGATTCATAGATAGTCTTGAGGTACAATTTCCTTCAACTTGCACCTCGACTGCCATCGGACGGACTGTTGAAGGGAGAGACATAAAGATGTTAAAAATTTCTAATAGTGATGCTTGTAACACTGGTGTTTGGATTGATGGGGGCATACATGGTCGGGAGTGGATAGCACCAGCCGTTGTGACTTACATTGCTGATCAAATCGCGAAGAACTTCGATAATCTTCCTCAGAGCATAACGAATAAAGACTGGTATTTGCTTCCCATTGTTAATCCAGATGGCTACTATCATACACATAACTCTGATAGAATGTGGAGAAAAAATAGAGCTAAGATAGATAACACATGTTTTGGAGTTGATCTCAATCGTAATTTTGGTTATTACTGGGGTCGCAGTGGACTAGAATGTTCAACAGACGATCCCAGTCATATAAATTACCGTGGCCCTGAACCTTTTTCCGAACCGGAAACGTCAGCCGTGAAGGATATGATACTTTACTCGGGAACACCGTTCAAGATCTTCATATCCCTTCATTCATACAGCGAAGTCATCGCATTTCCCTGGTGTTTTACATCAGAACCGTGCGCTGATTACGTGAACTTACTTGAAGGTGCCACTGCTATGGCTAAAGCAATATATGATGTGAACGGACGAATGTACAAAGTTGGAAATTTCAAGGATCTTATGTATTTCGCAACTGGCACCAGCGTGGATTGGAGCTATGGAACGGCTCGGATCCCGTTCTCCTATCTCATAGAGCTCAGAAGCAAACAGCACAAGTTCCTTTTACCGAAAGAAGAGATTTTAGACTGTTGTAAAGAAATATTTAGTGGCATAAAAGCTTTAGCCGAATTTGTTGACAAGAAAAAATGTTTAAATTGTACCATGTTTTATAATAAAAATTGTTGA

Protein sequence:

>DPOGS200619-PA
MQIQKMLRDRDISFEVTPSSLSGSRNRSPVLLSPRPRSPRREMDWKDFYPLEKIYRFIDSLEVQFPSTCTSTAIGRTVEGRDIKMLKISNSDACNTGVWIDGGIHGREWIAPAVVTYIADQIAKNFDNLPQSITNKDWYLLPIVNPDGYYHTHNSDRMWRKNRAKIDNTCFGVDLNRNFGYYWGRSGLECSTDDPSHINYRGPEPFSEPETSAVKDMILYSGTPFKIFISLHSYSEVIAFPWCFTSEPCADYVNLLEGATAMAKAIYDVNGRMYKVGNFKDLMYFATGTSVDWSYGTARIPFSYLIELRSKQHKFLLPKEEILDCCKEIFSGIKALAEFVDKKKCLNCTMFYNKNC-