Monarch geneset OGS2.0

DPOGS212195
TranscriptDPOGS212195-TA3360 bp
ProteinDPOGS212195-PA1119 aa
Genomic positionDPSCF300323 - 274465-281996
RNAseq coverage0x (Rank: top 98%)
Annotation
HeliconiusHMEL0068300.092.49% 
BombyxBGIBMGA000989-TA0.088.87% 
DrosophilaNnaD-PA0.068.27% 
EBI UniRef50UniRef50_E0VPE00.057.25%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VPE0_PEDHC
NCBI RefSeqXP_002427984.10.057.25%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420146190.057.25%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2700059670.052.82%hypothetical protein TcasGA2_TC008100 [Tribolium castaneum]
Group
Gene OntologyGO:00065083.1e-34proteolysis
GO:00082703.1e-34zinc ion binding
GO:00041813.1e-34metallocarboxypeptidase activity
KEGG pathway 
InterPro domain[356-583] IPR0008343.1e-34Peptidase M14, carboxypeptidase A
Orthology groupMCL11295 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212195-TA
ATGTTATGGTGGAAACAACTGGAGTGCGTGCAAGAAAACGTTCTAATGCTAGAACGAGAAGAACAGCAGAAAGAACACGTCAACAGCACGACAATTACTTTAAGACTTCGTACTAGGGATCCGGTGTTAGAAGATAATTTATATTTAGCTAACATAGATCAAGTTAAATCATTACAAGCGTCTTTGTTTCCGATTTGTAATAAAGGAACCTTTATTACAAACTTTCTACAAAACAATATAAAGACAAATCAGTTGGAAATCAACACAGATGTTAAAACTTTTAAAACTACCGCCAAATTAAAGGAGCCGCGGGAATTATTTGCTCTCCCAAAAGAATTGGATTGTCCGCAACAAGCTCCTAGGTGGCCAACCGAATGTCAAGTTGTAGAAGAAAGAATTCAACATATAACATGGAGTCCAGCGTCACCAGAACCATATTACGTGTCCAGTGGCAAAGAACTCAAGCCACAGCCGGTCGGTGAGGAGGCAGGCACGGTTATATATCAATACTATCCCATGAGTGCCGTTAACTACTTCAGTCGTTCTACTGTAGGAGGGTCCCGTCTGTTCCTGTCCGCTTGCACGACAGCCGGTGGTGACGACGAATTACGTTTCGAGTCACGTTTCGAGAGCGGTAACTTAGCGAAGGCTGTTAAAATCACGTCCGCATACTACGAACTACACCTCCGTACTGACCTTTACACGAATAGACATATGCAATGGTTTTATTTCAGAGTAACCAACACGAGAAAGCAAACTATGTACAGGTTTTCCATTGTAAACCTATCTAAACCTGAAAGTTTGTATAATGAAGGCATGCGACCTTTATTATATTCTACGAAGGACGCGCAATTACATTCGATCGGCTGGAGACGCTGTGGTGACAACATTGCTTATTACAAAAATGACTCCATATGCGAGGAAGAAGAACAATTTCCGAGCTATACATTAACATTTAATATAGAATTTCCGCACACAGACGACGCTGTATACATCTCACATTGTTATCCTTACACATATTCCGATTTACAAGAGTATTTATCAAGATTACAAGCTCATCCGGTGAAATCTACTTACTCTAAACTGAGACTTCTGTGTAGAACGCTAGCTGGAAATAATGTATATTACCTCACAGTAACCTCTCCCCAAAATACAAATGAATTTGAACAAAAGAAAAAGAAAGCTGTAATTATAACAGCAAGAGTGCATCCAGGTGAGACGCCTTCGTCGTGGATGATGAAAGGGTTTATGGACTTCCTTACCGGGGACACCAACCAGGCGAGAGAACTACGAGAGAAATTTATATTCAAACTTGTTCCAATGTTGAATCCTGATGGTGTAATTGTTGGAAATAATCGCTGCTCACTAACTGGGAAAGACCTGAACAGACAGTACCGCACAGTAATAAGGGAAACATACCCTTCGGTGTGGCATACCAAAGTTATGATTCGGAGGTTACAGGAGGAATGCGGGGTAGCTATGTTTGTGGATTTGCACGCTCACTCCAGGAAGCATAATATATTCATATACGGATGCGAGAGTCGAAAGAACTCAGACAAACGATTACAGGAACAGGTCTTTCCACTTATGTTGCATAAAAATGCAGCCGATAAATTTTCCTTCGAGAATTGTAAGTTTCGAATTCAACGCAGTAAAGAAGGAACAGCTCGTGTAGTAATTTGGATGTTAGGAGTAGCCAATAGCTACACTATGGAAGCTTCATTTGGAGGGTCAGAACTAGGCAGTAGAATGTCCACCCACTTTTCAGCCCAAGACTACGAAAGTTTGGGTAGAACATTCTGTGAAACGTTGCTTGATTTCTGCGATGAAAACCCGAGCAAGGAAAGATTAAGAACCAAGATAGTCACACGTTTACTAAAAGAGGGATCCAACGCCGATGAACCTACCAACATTGATCTTTCTGATTATTCCAGTGATGAAGGTGACACATCAAGCAGTAGTTCCGAAGCGGGTGTAATAGGAGGATCCAGCAAGACTACACAACTCGCACCACCACCGTCGCCTATTCTTCCTGACATTAATAGAAATGCTATTGAAAAATCCAGAAAACAGCCAGTGCTAGAGAAAATACCAAAAATTGAAAAGAAGAAGACGAAAAGGGAAACTTTACGGGTCTCCAGAGCTACTATTGATATGACGTCGGACGCTATGACAGACGCATCATCAGATTGTGATTCGTTTGAAGAAAATCTCAGTCCACTAAGAAGAGTTAGAAAATTGTTACAGCCGCCGGGGAAAGCAAAAACTAGAAGAAAAAAGAAAATGCCACCCGAGCTAAAAATATTTCGTGCACCAACAAGTGACTCGCCCTTAAATTCAGACAAAGAGAAGAAGTCCACGAAATGTATAAGACCGAGGAGTTTGTCAATGATAACGGAACCGTTAGAACAAACAAAGTTAAAACCGGCGTCTTGGCATCAATTTCGCCATTTACCGACACATAAGTCTATAATTGAACAAAAAAGTAGGAATCTTCAGCCTGCTGAATTACAAGTGAAATTAAATGTCTTAAAGAAAAGTATATGGACTGGAATACCTGATGATGAGAAGGGACCTCTCTCATGGGGCATTTCTAGTTTTGCTACAAATTCATATTTCACGGACAGTGAAGCCCTCCTTAGATCATGTTCCAAGAAACTTGAAGAATTAGAAGGTGAAAGAAAAAAGAGAAAAGATGATAAAAAGAAGAAAAAAACTAAGAAAGTGTCTATAAAAGTTCCCAGCCCTGAAAATATATTGGAACCAATTATAAAAATGCCCAAAAGTAACAAGAAAAGGGGGAAATTGAAACATACAAAATCAGAAAATTCCAATCAAATTTATAGCGCAAGCTTTACTGAAATACCAAGGAATACTCAGAAAACTCAAAGTAAACAGGCAAAGACATTCCGAAAAGGTATGTTTGTTGCAACAGCCATCCAAACAAAACAACCAAATAGTAAATCGGCAAGGACAGATAATTCAGAATCTGATGAGTCTATACAAACAACCAAGAGGGTTAAGAAAAAGAATAGGGTAGTTAAAAATAAATTGCAAAAAGACTGGGTTAAATTATTTGAAGATGAACAGAAAGAAGATGACACTGGCGTTGTTGCTGGGAAGGCTATCACTCTCAGTTCACCGGCTGTGGAGAATTGTAGCGAATTGCTTACTCTTACTCCTGTAGAAGTATATCCAGGAGTTTGTGGTTCTCCAGTAATGGGGTTAAAACTACCGCATCGTGTTTTTGCCAAAATACATGAGCTTATAGCTATTAAATATGAATTGAACAGGACGGGCCAAAGCAATGTCGCAACTATGTCTGTAGTTTTGGCAGTTGTAAACAACATGTTTATTTAG

Protein sequence:

>DPOGS212195-PA
MLWWKQLECVQENVLMLEREEQQKEHVNSTTITLRLRTRDPVLEDNLYLANIDQVKSLQASLFPICNKGTFITNFLQNNIKTNQLEINTDVKTFKTTAKLKEPRELFALPKELDCPQQAPRWPTECQVVEERIQHITWSPASPEPYYVSSGKELKPQPVGEEAGTVIYQYYPMSAVNYFSRSTVGGSRLFLSACTTAGGDDELRFESRFESGNLAKAVKITSAYYELHLRTDLYTNRHMQWFYFRVTNTRKQTMYRFSIVNLSKPESLYNEGMRPLLYSTKDAQLHSIGWRRCGDNIAYYKNDSICEEEEQFPSYTLTFNIEFPHTDDAVYISHCYPYTYSDLQEYLSRLQAHPVKSTYSKLRLLCRTLAGNNVYYLTVTSPQNTNEFEQKKKKAVIITARVHPGETPSSWMMKGFMDFLTGDTNQARELREKFIFKLVPMLNPDGVIVGNNRCSLTGKDLNRQYRTVIRETYPSVWHTKVMIRRLQEECGVAMFVDLHAHSRKHNIFIYGCESRKNSDKRLQEQVFPLMLHKNAADKFSFENCKFRIQRSKEGTARVVIWMLGVANSYTMEASFGGSELGSRMSTHFSAQDYESLGRTFCETLLDFCDENPSKERLRTKIVTRLLKEGSNADEPTNIDLSDYSSDEGDTSSSSSEAGVIGGSSKTTQLAPPPSPILPDINRNAIEKSRKQPVLEKIPKIEKKKTKRETLRVSRATIDMTSDAMTDASSDCDSFEENLSPLRRVRKLLQPPGKAKTRRKKKMPPELKIFRAPTSDSPLNSDKEKKSTKCIRPRSLSMITEPLEQTKLKPASWHQFRHLPTHKSIIEQKSRNLQPAELQVKLNVLKKSIWTGIPDDEKGPLSWGISSFATNSYFTDSEALLRSCSKKLEELEGERKKRKDDKKKKKTKKVSIKVPSPENILEPIIKMPKSNKKRGKLKHTKSENSNQIYSASFTEIPRNTQKTQSKQAKTFRKGMFVATAIQTKQPNSKSARTDNSESDESIQTTKRVKKKNRVVKNKLQKDWVKLFEDEQKEDDTGVVAGKAITLSSPAVENCSELLTLTPVEVYPGVCGSPVMGLKLPHRVFAKIHELIAIKYELNRTGQSNVATMSVVLAVVNNMFI-