Monarch geneset OGS2.0

DPOGS215632
TranscriptDPOGS215632-TA3600 bp
ProteinDPOGS215632-PA1199 aa
Genomic positionDPSCF300041 - 1879852-1920949
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0085420.069.08% 
BombyxBGIBMGA003532-TA0.082.72% 
DrosophilaNnaD-PA2e-9041.05% 
EBI UniRef50UniRef50_E0VQZ70.044.41%Putative uncharacterized protein n=2 Tax=Neoptera RepID=E0VQZ7_PEDHC
NCBI RefSeqXP_624180.20.043.17%PREDICTED: similar to ATP/GTP binding protein 1 [Apis mellifera]
NCBI nr blastpgi|3504130470.042.75%PREDICTED: cytosolic carboxypeptidase 1-like isoform 2 [Bombus impatiens]
NCBI nr blastxgi|3504130470.043.07%PREDICTED: cytosolic carboxypeptidase 1-like isoform 2 [Bombus impatiens]
Group
Gene OntologyGO:00065083.3e-20proteolysis
GO:00082703.3e-20zinc ion binding
GO:00041813.3e-20metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[906-1144] IPR0008343.3e-20Peptidase M14, carboxypeptidase A
Orthology groupMCL16243 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215632-TA
ATGGCGGACGACGCGGGCGACTGCCTGTTCGAGCGGCTCCGACTGCACCAGCAACGCGCCCCGGACGCGACTGAGGTCGCGCGTGCTATCACAGCTAGAATCAACTCGCGTCTCACATCACACGACAAACACATCCGACAGAGCACTCTTGACAAACTATGGAACAAGCAAACTGGTGCGATACAAATGCTTCTATCTATATTAGAGAATTCAAGAGATACAGCGACATCAACCTATATAACATCAATATTTAGAGAAGCTCTCTGCCTTAAACAAGGAAAAGGAAAAAAATGTTCAGTAGCGAATGAAGCCTTGGGGTCGAAAAAGAAGGAAAGCAAAAAAGGCAAAGAAAATAAAACACCACTGAATAAAAAGGCAAACAACGTAGCGCGACAGCAGTGCTCACAACAATTCATCGCCCTAAACGGCACGCAAGCGGTGATACGTACACTACTTGCCACACATAGCAAACGCGACAGCAATGTTAGCACGGAACTAATGCTACAAGACCTCGTCTGGATACTTGCAGCACTTGCTCCAAGAGCAAACAACATACAACGGCAGCAATGTGCGCAGGATTTTGTGGAAATGAAAGGAACACAGACCATTATTAGACGAGTGATATCAACGGTCCATGATAAGAGGGAGCATCATGCTGGGGTGGAATTAGTATTAAATGATCTGGTGTGGATTCTCGCTCCGTTGGGGTCAAAAGATCCAAAATTTGCAATGAAGGTGAGAATGTTAGGATGCGTTAGGACGATGCATCTAATTTTGAAAGGACATTTCACTGACAATAAGTTGATATTTCCACTATTAGTCATCATGAAACAGCTAGCCAAAAATTCCGTAACAACTTCCATTTTAATACGCGACGGAGTCATCGCGACTTACGATCGAGTGTTAATAAGTCTAGGCTTCATTCCTACAGCGAGACTGAGACTCTGTTTGGACGCTATAGACTACTTTAGCAAGAACAAGGTGTGCTGCATGCAGATTGTGAAGACAGGACTGTGTGGAGTGCTAATAAGAGTGTTCGACCGCTGGGACCGCTACGAGGGGCGCATGAGGCTCAAGATATGCGCACACATCCTCCAGACATTACAGCACCTCTGCAACATCAAGGCTGGACGTCGTGCTCTTTGCACCAAGAAGCACGTGCAGACCCTCCACAGGTTCTGTTCCCAGTGTCCTGATGAAATCGAGTTCGACGGACTGTTGGCTAGAGTCTGTTCCGTCATAACATTGTGTCTCAAGCATCAGGCATTACCAGTGCCATCATCTAGTCCAGCTACCTTCAACCTGAACCCTATACTTAAAGGAACAAATACCACATGGCCATGTCACGAAGATGATGATGACGGCGGAAACTCAGACTCGAAGACAATCAATTCGGATTTGGAAGACGATGCTCCTGACAGTGATAACGAGGTCATTGATGACTTCCCAGATATTGATTTCGAAGAAAATGACTTAAAGAATAATGAAGATTTGGAGAAATCACATACAAAAAGTGGGGAGAGCATACAGAGTGCACTGTGGATCAATCCCAATGAGAGAGATATCGAAGACTTAAAGAGATACTATATTTTCTTCAAGGAGTTCGGTTCCTATAACAAGCAAATAAGGTTGGTAAAAAGCCGGTCCAATTCCCGGGGGTCCATACTAGATGACATTTTTATATCTCAAAATAGCGCTAACCGAAGCCAGCCGTCGCCGACAAATCTTTCTTTAACTGCAGTACTTGGGAATACCGACTACGACAGCGCTTTAGGATCATCTCAAACGCTTTCATTCCTGCAAGGATATCATAAGATACACGAAAGCACTTCGACAACATCTTGTTCTTCCTTAAAAATCCATAAAGATATATCAAAATACAGTCCGCTCGAGTCAGTTTATTCAATAATATCATCGAGAGTTAAAAGCATCATTCCATTTGTAAAAGTTGCCTACCCAGATATGACAGGCGGGCAGGGTGCAACACAACCAGAGCCATTAAATAAAATGGAGAGAACAGCTTGCAGAAATAAATTACTCGCTTGCGTCGAGAGAGCAATTAATCCGGAAGCGTATATGAATGAAGTTGTGTATGATCTGGATGCTTTGAACAGTTCGAGCTCAAACGCAGACACGACTTCGCAGAAAAGTTTAAGCAACGAGAGTTTATTTTTAATTAACACCGACGAACAAGAAATAACAAAAGTCAATAGTTTCTCATCGAGACTAAATTTTGAATCGAGATTTGAGTCGGGAAATTTAAGAAAAGCCATACAGGTAGGTCCAAGAGAATATGAATTAATTTTAATGCCAGACGTAAATTCTCCAAAACGGCATCAGTGGTTTTACTTCGAAGTGCGTAATATGCAACAGGGACGGCCCTATATATTTAATATTGTGAATTGTGAGAAATCAGATAGCCAATTCAACTTCGGCATGAAGCCTGTTATGTATTCTGTGAAGGAAGCCGTCCTTGGAAGACCCGGGTGGGTGAGAGCCGGTTCGGACATTTGCTATTACAGGAACAGCTACCACTATTCCAATCAAAGAAACAACAAGTGCTACCTAACAGTTACGTTCAACATCGACTTTCCCCACACAAACGACGTCTGCTACCTCGCTTACCACTTCCCATTCACTTACTCCATGATGATGACTAGAATTTTCCAATGGAGTTCTCAATTGCCTCCTGGCGCTTATCTACGAGCTGAGCCCTTATGTTATACACTTAACAACAACGAAGTTCCTCTGTTGACTATATCAGCTGATGATACTCCGTCCAATCCCATAGTTGACAGGGAGATAGTATTCCTTACGGCTCGAGTCCACCCTGGTGAAAGCAACGCGTCCTGGGTAATGGATGGAACGCTGCGTTTCCTGCTCACAGACACTTCATCCGCAGCGGCCCTCCGTAACAAGTACGTGTTCAAAATCGTGCCGATGCTCAACGTCGAAGGTGTCGTTAATGGCTGCCATCGATGCGGCTTAACTAATGAAGATTTAAATCGACGCTGGTGCAAGCCGAGCCCCGTTTTGCATCCTTCTATTTACCATACCAAGGGCTTAATAGAATATTTGGTGCGTGTTTGGAAGAAACCTCCGGTAGTTTATTGCGACTACCACGGTCATTCGCGCAAGAAGAACGTGTTCTTTTACGGTTGCGCCGGCGCAGAGAGCTGGTGCAGCAACGACCGGCTTGTCCCGGACGAGCCTGTTAAATATCTCATGCTTCCAGCTTTAATGCACCGGCTATCACCGGCGTTCGCTCTTGGTTCGTGTTCCTTTCGTGTTGAACGTGAGCGTGAGAGCACAGCGCGAGTCACTGTGTGGCGCCACCTAGGAGTCACACGGTCCTACACTATGGAAGCATCATTTTGTGGATTTGATAGGGGACCGTTTAAAGGATTTCATCTCAACACCCAGCATCTGCAGAGCGTGGGCAGTGACTTTTGCGAAGCTCTCAACGGTCTCGGAGATACAGCCAACAATGTTGACATACAACTCACTAAAGATCTCAATGGCGAAATAGCAATAGACAGTGAAGCTGGCTCGGGGTCGGACAGCGTGTTGAAAACAGATTCGGATGAAGATTTCGATTAG

Protein sequence:

>DPOGS215632-PA
MADDAGDCLFERLRLHQQRAPDATEVARAITARINSRLTSHDKHIRQSTLDKLWNKQTGAIQMLLSILENSRDTATSTYITSIFREALCLKQGKGKKCSVANEALGSKKKESKKGKENKTPLNKKANNVARQQCSQQFIALNGTQAVIRTLLATHSKRDSNVSTELMLQDLVWILAALAPRANNIQRQQCAQDFVEMKGTQTIIRRVISTVHDKREHHAGVELVLNDLVWILAPLGSKDPKFAMKVRMLGCVRTMHLILKGHFTDNKLIFPLLVIMKQLAKNSVTTSILIRDGVIATYDRVLISLGFIPTARLRLCLDAIDYFSKNKVCCMQIVKTGLCGVLIRVFDRWDRYEGRMRLKICAHILQTLQHLCNIKAGRRALCTKKHVQTLHRFCSQCPDEIEFDGLLARVCSVITLCLKHQALPVPSSSPATFNLNPILKGTNTTWPCHEDDDDGGNSDSKTINSDLEDDAPDSDNEVIDDFPDIDFEENDLKNNEDLEKSHTKSGESIQSALWINPNERDIEDLKRYYIFFKEFGSYNKQIRLVKSRSNSRGSILDDIFISQNSANRSQPSPTNLSLTAVLGNTDYDSALGSSQTLSFLQGYHKIHESTSTTSCSSLKIHKDISKYSPLESVYSIISSRVKSIIPFVKVAYPDMTGGQGATQPEPLNKMERTACRNKLLACVERAINPEAYMNEVVYDLDALNSSSSNADTTSQKSLSNESLFLINTDEQEITKVNSFSSRLNFESRFESGNLRKAIQVGPREYELILMPDVNSPKRHQWFYFEVRNMQQGRPYIFNIVNCEKSDSQFNFGMKPVMYSVKEAVLGRPGWVRAGSDICYYRNSYHYSNQRNNKCYLTVTFNIDFPHTNDVCYLAYHFPFTYSMMMTRIFQWSSQLPPGAYLRAEPLCYTLNNNEVPLLTISADDTPSNPIVDREIVFLTARVHPGESNASWVMDGTLRFLLTDTSSAAALRNKYVFKIVPMLNVEGVVNGCHRCGLTNEDLNRRWCKPSPVLHPSIYHTKGLIEYLVRVWKKPPVVYCDYHGHSRKKNVFFYGCAGAESWCSNDRLVPDEPVKYLMLPALMHRLSPAFALGSCSFRVERERESTARVTVWRHLGVTRSYTMEASFCGFDRGPFKGFHLNTQHLQSVGSDFCEALNGLGDTANNVDIQLTKDLNGEIAIDSEAGSGSDSVLKTDSDEDFD-