Monarch geneset OGS2.0

DPOGS211663
TranscriptDPOGS211663-TA1185 bp
ProteinDPOGS211663-PA394 aa
Genomic positionDPSCF300151 - 288052-292099
RNAseq coverage48x (Rank: top 71%)
Annotation
HeliconiusHMEL0048413e-11050.92% 
BombyxBGIBMGA004797-TA5e-11861.81% 
DrosophilaCG7025-PA5e-7139.26% 
EBI UniRef50UniRef50_B0WS124e-8140.21%Zinc carboxypeptidase A 1 n=10 Tax=Culicimorpha RepID=B0WS12_CULQU
NCBI RefSeqXP_001851496.18e-8240.21%zinc carboxypeptidase A 1 [Culex quinquefasciatus]
NCBI nr blastpgi|3807138504e-10550.99%carboxypeptidase [Bombyx mori]
NCBI nr blastxgi|3807138502e-10251.14%carboxypeptidase [Bombyx mori]
Group
Gene OntologyGO:00065081.2e-109proteolysis
GO:00082701.2e-109zinc ion binding
GO:00041811.2e-109metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[57-354] IPR0008341.2e-109Peptidase M14, carboxypeptidase A
Orthology groupMCL30280 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211663-TA
ATGGTTCCCGCTTATAATAAAATAAGATTCCTTGATATTGTCAAGGAATCCAAGATGGAGGTCATAGAGATTATAACGGACTTACAACAAACGATCGAAGAGCAACTGCGACCAGCAGCAAGGAGTTTAAGATCTGATGAAACGTATTTGTCAATGAACTGGAATCAGTACCACAATCTCGAGGACACTTACAAATGGTTGGACGAAGTCCAAAGAAACAACCCCAGTGTAGTGACCACTGTGGTGATGGGACGCTCGGTTGAAGGAAGAGAAATAAAAGGCCTTAAAATAAACTTCAGGAACAAAACTAATCCGGTCATAGGTTTTCTGACTGGAACACTCCATGCCAGAGAATGGATTACTCCCTCCACTCTGACGTGGATCATTAAGGAGTTCCTGACGTCGAACAACAGAGACATCAGGGCTTTGGCTGAAAATATTGAATGGCATATTTTTCCGATTGTCAACCCTGATGGTTACGTCTACACATTTACAACGAACAGAATGTGGAGGAAGAACCGCAGTAGGTTTGATTCGAAGTCCTGTTCTCATATGAACGCTAGTGACGATATGAGCAACGGAGTCGACCTAAATAGAAACTTCGACTTCGTTTGGATGGGTTCGGGATCTTCAAACGATTCCTGTGCCATCACCTATGCCGGTCCAACAGCCTTCTCTGAACCAGAAACCAGAGCCATTAGTAACTACGCCCTACGCTTGAACGCAAAAGGACAACTCCTTTACTACATTGACTTCCACTCGTTCACTCAACTGATATTAGTACCTTACAGCCATCTGCCAGAGAGCGAGGTCCACACTGTGGAAAACTACGATGACATGTATAAGGTAGCGGTTTCAGCGGCAGACAAAATTCGTGAAAGAAACGGTACCGTCTACCGAGCGGGAATATCGTCTGTGGTCATGTATCCAATGACTGGTACCAGCTTCGATTGGGTCAAGAATAACACGAAAGTGTCGTTCAGCTTCCTCATCGAGCTCAGAGACCTCGGACAGTACGGCTTCCTGTTGCCTGCGGAACAGATCATACCCAACAGCCTGGAGACCATGGACGGACTCATTGAGATGGACAAGACAGTTCAACTCCTTGGATACTACTCGTCTGGTCCATCCAACTTTATACCATCTCTCGGTATCATATTGTGTGCTTTTGTTTTGGTTTATTGA

Protein sequence:

>DPOGS211663-PA
MVPAYNKIRFLDIVKESKMEVIEIITDLQQTIEEQLRPAARSLRSDETYLSMNWNQYHNLEDTYKWLDEVQRNNPSVVTTVVMGRSVEGREIKGLKINFRNKTNPVIGFLTGTLHAREWITPSTLTWIIKEFLTSNNRDIRALAENIEWHIFPIVNPDGYVYTFTTNRMWRKNRSRFDSKSCSHMNASDDMSNGVDLNRNFDFVWMGSGSSNDSCAITYAGPTAFSEPETRAISNYALRLNAKGQLLYYIDFHSFTQLILVPYSHLPESEVHTVENYDDMYKVAVSAADKIRERNGTVYRAGISSVVMYPMTGTSFDWVKNNTKVSFSFLIELRDLGQYGFLLPAEQIIPNSLETMDGLIEMDKTVQLLGYYSSGPSNFIPSLGIILCAFVLVY-