Monarch geneset OGS2.0

DPOGS207768
TranscriptDPOGS207768-TA1407 bp
ProteinDPOGS207768-PA468 aa
Genomic positionDPSCF300042 - 237891-246404
RNAseq coverage82x (Rank: top 64%)
Annotation
HeliconiusHMEL0175620.091.37% 
BombyxBGIBMGA005320-TA2e-15689.32% 
DrosophilaCG4678-PE0.066.52% 
EBI UniRef50UniRef50_UPI000206247F0.066.23%UPI000206247F related cluster n=1 Tax=unknown RepID=UPI000206247F
NCBI RefSeqXP_001942938.10.068.11%PREDICTED: similar to GA18350-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|1950395260.070.51%GH12395 [Drosophila grimshawi]
NCBI nr blastxgi|1950395260.070.51%GH12395 [Drosophila grimshawi]
Group
Gene OntologyGO:00065081.8e-82proteolysis
GO:00082701.8e-82zinc ion binding
GO:00041811.8e-82metallocarboxypeptidase activity
GO:00041808.4e-20carboxypeptidase activity
KEGG pathway 
InterPro domain[61-338] IPR0008341.8e-82Peptidase M14, carboxypeptidase A
[352-428] IPR0147668.4e-20Carboxypeptidase, regulatory domain
[350-432] IPR0089692.6e-16Carboxypeptidase-like, regulatory domain
Orthology groupMCL14937 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207768-TA
ATGTTCCAGTGTACAAAATCCCTCTCAAGGGTTTTCCTGTTGCTACTTATATTTAAAACATCTCACGGACAATATTCTCCCGGATCGACTTTTACAGAAGATGACGCAACCAACCCTAGCGATGTGGAGTCTCGTGATGTCGGTACTGAGCTGCAGTACAAATACCATGACCACGAGGAGATGACACGCTACCTGCGGGCCGTCTCCGCCAGATATCCAGCGCTCACAGCGCTGTACTCCATAGGAAAGTCCGTCCAAGGTCGAGATCTCTGGGTGATGGTGGTGTCGGCATCGCCCTATGAGCATATGATTGGCAAGCCAGATGTCAAATACGTAGCCAATATACACGGCAACGAAGCTGTCGGAAGGGAGATGCTTTTACATCTTATACAGTACTTAGTGACTTCCTACGAGACGGATTCCTACATAAAATGGTTATTAGACAACACACGCATACATTTGATGCCATCGATGAACCCTGATGGTTTCCTCATATCTCGTGAAGGACAGTGTGACACCATTCATGGCAGGCACAACGCCCGTCGCTACGACTTGAACCGTAACTTCCCGGATTTCTTCAAACGGAACACGAAGCAGCCTCAACCGGAGACGGAAGCTGTAAAGGAATGGATAAGCAAGATACAGTTCGTACTATCAGGATCGCTGCACGGCGGCGCCCTGGTCGCCTCCTACCCTTACGACAACACGCCCAGCGCTATTTTCCAAAGCTACGCGCACAGTCCGTCGGTATCTCCCGATGATGACGTCTTCCAGCACTTGGCCCGCGTGTACTCCAGCAACCACGACAAGATGTCCCGAGGAGTCTCTTGCAAATCCGGATCACCTAAGTTTGATAACGGAATCACTAATGGCGCGGCCTGGTATCCACTGACAGGAGGAATGCAAGACTACAACTACCTGTGGCATGGATGTATGGAAATTACTCTAGAGATCTCATGCTGCAAATATCCTTTGGCTCATGAGCTACCGAAATACTGGCAGGACAACAAACAGGCGCTTATAAAGTATCTAGCTGAAGCCCACCGCGGCGCCCACGGGTTCGTGATGGACGAACACGGTAACCCGGTGGAAAAGGCTTCGATCAAGGTCAAAGGACGCGAGGTCACGTTCCATACAACTAAATACGGAGAATTCTGGCGTATACTACTCCCTGGAACTTATAGACTAGAGGTCGGAGCCGATGGATATTTACCACAAGAAGTAGAATTCTTCGTCATAGACAGCCACCCCACTCTGTTGAACGTAACGCTGCATTCCGCCAAGCGTATCGATGGCGGGGGTCCTTACTACCGGCCAGCACCGCGCCCGCCGCCACCGCCCGCGCCGGGTCTGTTCTCAACATTCACTAATTCCATCAACAGTTTCGTATCTAATATATTTGGCTAG

Protein sequence:

>DPOGS207768-PA
MFQCTKSLSRVFLLLLIFKTSHGQYSPGSTFTEDDATNPSDVESRDVGTELQYKYHDHEEMTRYLRAVSARYPALTALYSIGKSVQGRDLWVMVVSASPYEHMIGKPDVKYVANIHGNEAVGREMLLHLIQYLVTSYETDSYIKWLLDNTRIHLMPSMNPDGFLISREGQCDTIHGRHNARRYDLNRNFPDFFKRNTKQPQPETEAVKEWISKIQFVLSGSLHGGALVASYPYDNTPSAIFQSYAHSPSVSPDDDVFQHLARVYSSNHDKMSRGVSCKSGSPKFDNGITNGAAWYPLTGGMQDYNYLWHGCMEITLEISCCKYPLAHELPKYWQDNKQALIKYLAEAHRGAHGFVMDEHGNPVEKASIKVKGREVTFHTTKYGEFWRILLPGTYRLEVGADGYLPQEVEFFVIDSHPTLLNVTLHSAKRIDGGGPYYRPAPRPPPPPAPGLFSTFTNSINSFVSNIFG-