Monarch geneset OGS2.0

DPOGS210313
TranscriptDPOGS210313-TA3666 bp
ProteinDPOGS210313-PA1221 aa
Genomic positionDPSCF300025 - 983413-989580
RNAseq coverage310x (Rank: top 36%)
Annotation
HeliconiusHMEL0087120.079.09% 
BombyxBGIBMGA011879-TA0.081.01% 
DrosophilaCG3790-PA2e-17952.05% 
EBI UniRef50UniRef50_Q17PU30.050.57%Oligopeptidase n=4 Tax=Endopterygota RepID=Q17PU3_AEDAE
NCBI RefSeqXP_971989.20.050.90%PREDICTED: similar to oligopeptidase [Tribolium castaneum]
NCBI nr blastpgi|1892358130.050.90%PREDICTED: similar to oligopeptidase [Tribolium castaneum]
NCBI nr blastxgi|1892358130.050.90%PREDICTED: similar to oligopeptidase [Tribolium castaneum]
Group
Gene OntologyGO:00065085.2e-118proteolysis
GO:00042225.2e-118metalloendopeptidase activity
GO:00550852.4e-35transmembrane transport
GO:00160212.4e-35integral to membrane
GO:00228572.4e-35transmembrane transporter activity
KEGG pathway 
InterPro domain[759-1213] IPR0015675.2e-118Peptidase M3A/M3B
[1043-1215] IPR0240772e-86Neurolysin/Thimet oligopeptidase, domain 2
[139-528] IPR0161961.4e-51Major facilitator superfamily domain, general substrate transporter
[893-1042] IPR0240793.6e-36Metallopeptidase, catalytic domain
[138-519] IPR0058282.4e-35General substrate transporter
Orthology groupMCL12015 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210313-TA
ATGGATTTTGACGCCATACTCGAAGATGTCGGAACCTTTGGACGGTACCAGAAATTGGTTGTCTATTTCATTCTACTGCCAGCAGTCATACCGTGCGGTTTTCACGCGTATGCTCAACTTTTCATGGCTTCAGATGTGAAGCACTGGTGTAATGTCCCTGATCTTGAATCACTTAATAACTTAGACTTGATTAAGAATCTCAGTATACCTCTCGAATTGAAAAACGGAGAATTCGAATATTCTCAATGCAACATGTATAAATTGAACTACAGCAATGTTATAGAAAATTACCCGAATATAGAAACGCCAGTTTTAGCTGAGATTGTTCCATGTATGGACGGGTGGACTTTTGAAACCAATCAAACAACCGTAGTTTCGGAGTGGAGTTTAGTGTGCAATAAAGACTTTTACCCAACTCTTGGTCTCGTGTTACTGGCGGTCGGTGGGATCATCGGAAATTATATATTCGGCTACCTCCAGGATACCTTTGGAAGAAAGCCGTCTTTCTTTATTTATCTTATAATTGAATCCATTTTCGGAATAGCCACCGCATTCGCTCAAAATTATTACATTTGGATTGTATACAGATTTGGAGTCGGATTTACAGTACCAGCAATAATGGCCACTCCTTACGTACTAGCTATAGAACTAGTAGGACCTAAATGTAGAACTTTATGTACCATTCTCTCCAATATCGCTTACTCTTTGGGGCTTATTTTGCTATCTGCTGTAGTCTACTTAGTTCGTGACTGGCGTCACCTAGCTTTAGCCACCACTATGCCGTTTCTAGTGTTTTTCCTTTATCTATGGCCAATGCCAGAAAGCCCTAGGTGGCTATTAGCTAGAGGTCAATTCGAAAAGGCTGAAGTTATTTTAAAGAAAATGGCCAGGATAAATGGAAAATCTCTACCAGCAAATTACATGGTGCATCTACGACGTAAATACGAATCTGACAAGTTGAAACAGGACTTAGAAAAAGAAAAGTCCAGAAAATATGGAATCCTAGATCTGTTTAGAACTCCCAACTTGAGGAAAAAAACGATTATCATCACATTTATTTGGTTCACTAACACCAGCGTGTATGTGGGTCTTTCGTATTACGCACCTGTTCTAGGGGGTGACGAATATTTGAATTTCTTCTTAGCTGGTGTGGTGGAACTACCCACATATATGTTTCTCTGGCCATCAATGGAAAGACTTGGAAGAAGATGGACGCTTTGTATGAGCATGGTTGTAGGAGGAATCGCATGTTTGACAACATTTTTAGTTCAACATGAAACAAACGTGACCCTCGCTCTCTATTGCGTCGGCAAAATGGGTATCTCCTCAGCGTTCGTGGTGCTGCCACTAATGGCCTCGGAACTGTATCCTACCGTTGTCAGAGGTCTCGGTATGAGTCTGAGTTCCGTTTTGGGGATGTTAGGACCAATATTTATTCCACTAGTTAATTATTTGGGATCCGACATCATGGTCCTACCCCTAATAATTATGGGAGCGTTGTTAGTAGCTGGTGGCATAGCAAGTTTACTTCTACCAGAGACATTAAATCAGCACTTACCCCAGACTTTAGAAGATGGCGAAAAGATGGGAATTCCAGAGATTGGGGAGGATGGGGAAAAAGACACCCTTCTGTCAAACAATGGCTTGCCCGAGTTCAATGATATAACAATTGAAAAATGCATTGCAACTATCAGTAAACAAAGTATGGAATATGAAAATGGAGTCAGACAAATTGAGGAGTCCAGCGCGAACTGCAAGAACGCATTTGTTGAAATATTCCAGCCTTTAGAGAAACTGGACAACCAATTAGAACTAACGTGGGGTATGGCAAAAACATTATATCTAGGGAACAGCTCCCTCATGCCAACAAAGTCATACATACAAATACATCAAAGGGCACATAAAGCGAGGTTGGCAAAGTTTAACAGCATTCCCATATTCCATGCAGCTGTGAATGAAAAAAACCGTTTGAAAAAATTAACAGATGAAGAGCAACGGCTCTTAGATAAATATATACTTGAAGGCAGACTCAACGGTTTGGACATTAAAGGAGTGAAGCGTGAGAGACTCAACAATGTGCTGAATAATCAGCAGAAGGAGAAAAACATTATAAGGGAGAAAGTAAATGCTGCAACTAAACTGTTTAGCTGTTTTATCAATGACGAACAAGTGGCCAAAGAATTTCCTGAAAACCTTGCTATGAGTATGTCCGCCAGTAATGACGCTAGCAAAGGTCCTTGGAAATTGACTCTGCAACCGCACATTTATGAACCCTTCATGCAGTACTGTCCAGATAGAAGTTTACGTTGGAACGCATGGCAGGCCCATGTTCAACGATGTTCCGGTTATGGCAATAAGGAATTGGAAACTAGTACGCATATTGAGAAATTACGTGATTCCAGAGCCGAGCAAGCAAAAATCTTGGGTTTTGACACCTTTGTTGATATGAGTATGGAAACCAAAATGGCTGGCTCCATTGAGAATGTTACAAATATGCTTGACACGCTTTTAAATAATGCTCGTCCAATGCAAGATGTTGAGCTGGAAATGTTAGAACATTTTGCAAAAGAAAGGGGATTTGACAATAAACTACAGCTCTGGGATATTCCCTACTGGCAAAGGAAGCAGAAATGGTCATTATACAACTTCGATGAGAATAAAATCCGGGAATACTTCCCACTACCCAAAGTAATAAACAGTCTGTTCAATTTATGCAGCACACTCTTCAGGATTCAGATCGTTGAAAGACCAGAGGCGCATACTTGGCACAAGGACGTAAAATTCTATGATATTTATGACGAAGGCAGTAACGAACCGATTTCAGGGTTGTACTTCGACCCTTACGCTAGGCAGGATGAGAAAATACGTGTGTACGACGACGCGGGTTGGCATGTGTCCATACGAAATAAAAGCACATTCACCAGCACAAACCCACTTTCGGCTCTAATCTTCAACTTCCAACCGCCATCAGACGGGAAGCAATCCCTACTTTCATTTAAAGAAGTCAATGTATTGTTTCAACGGTTTGGACATTCTTTGCGGCATTTACTGGCTAGAGCCAATTATTCCGAGGTTGCGGGTCTCTCGAACGTTGAATGGGACGCTGCTGAAGTTTGCGGCCAGGTGATGACGCACTGGCTCTACCATCCGCACACTATCCGAGCAATCAGTGGCCATTACAGAACTGACGAACCTCTTCCGGATGACAGTGTCCAAAACCTGCAGAACGTCCGCAAACACATGTCGGGTTACGATTTGTGCCAGGAGCTATATCTATCACGCCTAGATCTAGATTTACACTCGAAAACAACTTTCTGGAGGGATATCGTGCGAGAGTTGTGGCCCAAATATATAGCACTGCCATTTGACAAATACAATTCACATATACTTTCGTTTACCAAGATTTTCTCTGAGGAATGGGGAGCGGCCTACTATTGCCACTTGTGGTCTAAGATGATCGCTGCCGACATTTACAGTGCCTTCGAAGAGGCCAGGGACACGGATCAAGATATACTGGAAGTGGGTAAGAGATACAAAGACACTTTCCTCACGACCGGCGGCAGCTGCCATCCTAGCGAGGTCTTCAGAAGGTTCCGAGGAAGGGATCCGTCACCACAAGCATTATTAAACAACCTTGGACTCAACCAAAAAGTCTTAGAGCAATAG

Protein sequence:

>DPOGS210313-PA
MDFDAILEDVGTFGRYQKLVVYFILLPAVIPCGFHAYAQLFMASDVKHWCNVPDLESLNNLDLIKNLSIPLELKNGEFEYSQCNMYKLNYSNVIENYPNIETPVLAEIVPCMDGWTFETNQTTVVSEWSLVCNKDFYPTLGLVLLAVGGIIGNYIFGYLQDTFGRKPSFFIYLIIESIFGIATAFAQNYYIWIVYRFGVGFTVPAIMATPYVLAIELVGPKCRTLCTILSNIAYSLGLILLSAVVYLVRDWRHLALATTMPFLVFFLYLWPMPESPRWLLARGQFEKAEVILKKMARINGKSLPANYMVHLRRKYESDKLKQDLEKEKSRKYGILDLFRTPNLRKKTIIITFIWFTNTSVYVGLSYYAPVLGGDEYLNFFLAGVVELPTYMFLWPSMERLGRRWTLCMSMVVGGIACLTTFLVQHETNVTLALYCVGKMGISSAFVVLPLMASELYPTVVRGLGMSLSSVLGMLGPIFIPLVNYLGSDIMVLPLIIMGALLVAGGIASLLLPETLNQHLPQTLEDGEKMGIPEIGEDGEKDTLLSNNGLPEFNDITIEKCIATISKQSMEYENGVRQIEESSANCKNAFVEIFQPLEKLDNQLELTWGMAKTLYLGNSSLMPTKSYIQIHQRAHKARLAKFNSIPIFHAAVNEKNRLKKLTDEEQRLLDKYILEGRLNGLDIKGVKRERLNNVLNNQQKEKNIIREKVNAATKLFSCFINDEQVAKEFPENLAMSMSASNDASKGPWKLTLQPHIYEPFMQYCPDRSLRWNAWQAHVQRCSGYGNKELETSTHIEKLRDSRAEQAKILGFDTFVDMSMETKMAGSIENVTNMLDTLLNNARPMQDVELEMLEHFAKERGFDNKLQLWDIPYWQRKQKWSLYNFDENKIREYFPLPKVINSLFNLCSTLFRIQIVERPEAHTWHKDVKFYDIYDEGSNEPISGLYFDPYARQDEKIRVYDDAGWHVSIRNKSTFTSTNPLSALIFNFQPPSDGKQSLLSFKEVNVLFQRFGHSLRHLLARANYSEVAGLSNVEWDAAEVCGQVMTHWLYHPHTIRAISGHYRTDEPLPDDSVQNLQNVRKHMSGYDLCQELYLSRLDLDLHSKTTFWRDIVRELWPKYIALPFDKYNSHILSFTKIFSEEWGAAYYCHLWSKMIAADIYSAFEEARDTDQDILEVGKRYKDTFLTTGGSCHPSEVFRRFRGRDPSPQALLNNLGLNQKVLEQ-