Monarch geneset OGS2.0

DPOGS209428
TranscriptDPOGS209428-TA5766 bp
ProteinDPOGS209428-PA1921 aa
Genomic positionDPSCF300449 - 11268-48544
RNAseq coverage158x (Rank: top 52%)
Annotation
HeliconiusHMEL0034067e-16067.40% 
BombyxBGIBMGA001638-TA0.068.27% 
DrosophilaApepP-PA3e-16247.66% 
EBI UniRef50UniRef50_F2YHL80.052.41%Aminopeptidase P-like protein (Fragment) n=1 Tax=Ostrinia nubilalis RepID=F2YHL8_OSTNU
NCBI RefSeqXP_974698.11e-16547.78%PREDICTED: similar to X-prolyl aminopeptidase (aminopeptidase P) 1, soluble [Tribolium castaneum]
NCBI nr blastpgi|3264544820.052.41%aminopeptidase P-like protein [Ostrinia nubilalis]
NCBI nr blastxgi|3264544820.052.41%aminopeptidase P-like protein [Ostrinia nubilalis]
Group
Gene OntologyGO:00099874.6e-59cellular process
GO:00167873.4e-20hydrolase activity
KEGG pathway 
InterPro domain[199-472] IPR0009944.6e-59Peptidase M24, structural domain
[1418-1557] IPR0005873.4e-20Creatinase
Orthology groupMCL12916 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209428-TA
ATGAGTCTACAAAGGTTGACGGCGCTACGAGCGCTGATGGCTGGACATCCGACAGCCTTAGCTGCATATATAATACCTACTGCGGATGCTCATAATTCGGAGTACATATCACCGGCGGACGCTCGTAGGGAGTGGATATCAGGGTTCACGGGTTCAGCCGGAACAGCCGTGGTTACAGCCAACAAGGCCCTGGTGTGGACTGATGGTAGATATTACACGCAATTTGAGAAGGAAGCCGATCTCACGATGTGGACTCTAATGAAGCAATCTTTGCCCGAAACTCCAACTATGGAGAAGTGGCTGGCGAGCAATCTGATAGCCGGTTCTGTTGTGGGGGTCGACCCCCACACTATGACGAGAGAGGAATGGACCCCCTTACAGGGGGTCAAGATAGTTGGTCGGCCGTACGATGACGTCATTGAGGGCTTGAGTAATTTGGCTCGCGAGTTATCCAATATGGGTGACGGTGAGCATTCTGTGTGGATATCAAACGAAGCGAGCGAGGCGGTCCACAGAGCTGTGTCCGGGGAAGGCGTGTTGAAAAACCCTCTTAATCTGATATCAGAAGTGTCTCCCGTGGCTTTAGCGAAGTTGGTGAAGAACGACGTCGAGCTCGAGGGTTTCCGTAAATGTCACATCCGGGACGGTACAGCCGTCTGTAGATTCTTCAGATGGCTCCACCAGGAGGTGGACTCCGGAAATAAGATCACGGAAGTGGAAGCCGCTGAGAGATTATTGGAGTTCAGGAAGGATGAAAAAGACTTCATGGGCCCCTCCTTCGAGACCATATCCGGGGCTGGTGAAAACGGCGCCGTCATACATTATACTCCATCATCAGACTCGCCCAGGATCATAACGGCTGATGACGTGTACCTCCTGGACTCCGGCGGACAGTACAAGGACGGTACGACTGATATCACCCGCACTCGTCACATGAGTGACCCCACAGACCTTCAGAAGGAAACCTTCACTAGAGTGCTCAAGGGTCAGATTGCTATCGGCGCTGCTTTGTACCCCGTTGGGGTAAAGGGTAACGTCTTAGACTCGTTGGCACGTAAGTATCTGTGGGACGTCGGTCTGGACTATGCGCATGGGACTGGCCATGGGGTAGGGCATTTCCTGAACGTCCACGAGGGTCCCTCGGGGATCTCTTGGCGGCCGTACCCCCACGACCCGGGACTAAAGATGGGTCAGATATTGAGTAACGAACCCGGTTACTACCGGGTCGGGGAATTCGGTATCCGGATAGAGGATCTAGTCGAGACTATCAGCGTCACAAACGACACGAACCACCCGAGGGCCAAAGATCTTCTGGGTGACTACAACGGGCGCGGCGTGCTGGGTTTCAACACGATAACTCTGGTACCGAATCAGAGGAAGTTCATCAAAACTGAGCTGCTGGATGACTTCGAGTGTGAAAAATTGAAAGACAGATACGAATCCGCGACTAGCGAGACGTTACGGTTCGGTCACATTACAAACCGAGCCGTCGCGTCCGTACTGGGATCTGAAATAAAAATTCTATGCGCTAAATTCACACAAACCAATGCTGCTGGAGTAACGGCGAGAAAAAAAGCAATGAGTCTACAAAGGTTGACGGCGCTACGAGCGCTGATGGCTGGACATCCGACAGCCTTAGCTGCGTATATAATACCTACTGCGGATGCTCATAATTCGGAGTACATATCACCGGCGGATGCTCGTAGGGAGTGGATATCAGGGTTCACGGGTTCAGCCGGAACAGCTGTGGTTACAGCCAACAAGGCCCTGGTGTGGACTGATGGTAGATATTACACGCAATTTGAGAAGGAAGCCGATCTCACGATGTGGACTCTAATGAAGCAATCTTTGCCCGAAACTCCAACTATGGAGAAGTGGCTGGCGAGCAATCTGATAGCCGGTTCTGTTGTGGGGGTCGACCCCCACACTATGACGAGAGAGGAATGGACCCCCTTGCAGACGGCGCTGTCTAAGGCAAAAATGCAACTAGTTGCTGTAGAGAGTAATTTGGTTGATAAAGCCAGGATCTCACTGGATGATCCTCCGCCGAAGAGACCCCAAAATGATATTATACACCTGCCTTTAGAATACACTGGAAAGACTGCTGGTGAAAAAATCCATGATCTGAGAGTAGGGATGCTGGAGAAGAAAGCTTCAGCTCTCGTTATAACAGCCCTCGATGAAGTCGCCTACACACTGAATCTGAGGGGTAGCGATATAAGATACAATCCAGTTTTTTTCTCGTATCTATTGCTGACCCCCGACACGGTGACGCTGTTCTGGAGTGGGGGTCGCATTCCGGACGACATAGAACGCAACTTATCTGACGAGGGGGTCAAGATAGTTGGTCGGCCGTACGATGACGTCATTGAGGGCTTGAGTAATTTGGCTCGCGAGTTATCCAACATGGGTGACGGTGAGCATTCTGTGTGGATATCAAACGAAGCGAGCGAGGCGGTCCACAGAGCTGTGTCCGGGGAAGGCGTGTTGAAAAACCCTCTTAATCTGATATCAGAAGTGTCTCCGGTGGCTTTAGCGAAGTTGGTGAAGAACGACGTCGAGCTCGAGGGTTTCCGTAAATGTCACATCCGGGACGGTACAGCCGTCTGTAGATTCTTTAGATGGCTCCACCAGGAGGTGGACTCCGGAAATAAGATCACGGAAGTGGAAGCCGCTGAGAGATTATTGGAGTTCAGGAAGGATGAAAAAGACTTCATGGGCCCCTCCTTCGAGACCATATCCGGGGCTGGTGAAAACGGCGCCGTCATACATTATACTCCATCATCAGACTCGCCCAGGATCATAACGGCTGATGACGTGTACCTCCTGGACTCCGGCGGACAGTACAAGGACGGTACGACTGATATCACCCGCACTCGTCACATGAGTGAGCCCACAGACCTTCAGAAGGAAACCTTCACTAGAGTGCTCAAGGGTCAGATTGCTATCGGCGCTGCTTTGTACCCCGTTGGGGTAAAGGGTAACGTCTTAGACTCGTTGGCACGTAAGTATCTGTGGGACGTCGGCCTGGACTATGCGCATGGGACTGGCCATGGGGTAGGGCATTTCCTGAACGTCCACGAGGGTCCCTCGGGGATCTCTTGGCGGCCGTACCCCCACGACCCGGGACTAAAGATGGGTCAGATATTGAGTAACGAACCCGGTTATTACCGGGTCGGGGAATTCGGTATCCGGATAGAGGATCTAGTCGAGACTATCAACGTCACAAACGACACGAACCACCCGAGGGCCAAAGATCTTCTGGGTGACTACAACGGGCGCGGCGTGCTGGGTTTCAACACGATAACTCTGGTACCGAATCAGAGGAAGTTCATCAAAACTGAGCTGCTGGATGACTTCGAGCTCAAATACATAAATTCCTACCACAAGAGAGTGTTGGATACTCTCGGACCGATACTGAAGAACCGAGGCCTGATGGAAGATTATGCCTGGCTGGAGAAGGAATGTTCTCCATTGATTGCAGCACAGCACCTGGTGATTGGGGACCTCATTTCCTTCGATGCAAACACTTATTCTCTATACCCTGAAAGTCATATACCCGGTAACATAAATGGCGGGAAACTCAGTGGATCTCAAAGCTTGGAGCGCGTGTCAGCTGTTAGGAATGTTATGGCAGAGAGAGGAATCGACGCTTTTATAGTACCTACGTCTGACGCGCATAACTCTCAATATATAGCGCCTACGGATGCTAGACGGGAGTGGCTCTCAGGTCTGTCGGGGTCCGCCGGTACAGCCCTCGTAACAGCCGACCACGCCTTACTGTGGACTGACGGCAGATACTTCACGCAATTCGATATGCAAGTTGATCCTCGTATTTGGACTCTCATGAGGATCGGTACTGATGTAACGATCGAGAGTTGGCTAGCGTCTAACATGAGAGGTTCAAGAGTTGGTATTGATCCAACGACCTACACACGCAGTTCTTGGACAACTTTGGAGAATGCAGTGCGTAACGCAAACATAAGTATTGTACCAATTTACGATAACACGGTGGACGAGGCTAGAAGAAGGGTTTCGGACCCCCCTCCCGCCAGGCCTAACGAGCCGTTGTTGGCGCTCACAGTAAATTTCACGGCTCATAGACCTCATAAATTTCCCTACAGTGCAGCACAGCACCTGGTGATTGGGGACCTCATTTCCTTCGATGCAAACACTTATTCTCTATACCCTGAAAGTCATATACCCGGTAACATAAATGGCGGGAAACTCAGTGGATCTCAAAGCTTGGAGCGCGTGTCAGCTGTTAGGAATGTTATGGCAGAGAGAGGAATCGACGCTTTTATAGTACCTACGTCTGACGCGCATAACTCTCAATATATAGCGCCTACGGATGCTAGACGGGAGTGGCTATCAGGTCTGTCGGGGTCCGCCGGTACAGCCCTCGTAACAGCCGACCACGCCTTACTGTGGACTGACGGCAGATACTTCACGCAATTCGATATGCAAGTTGATCCTCGTATTTGGACTCTCATGAGGATCGGTACTGATGTAACGATCGAGAGTTGGCTAGCGTCTAACATGACGAGAGGTTCAAGAGTTGGTATTGATCCAACGACCTACACACGCAGTTCTTGGACAACTTTGGAGAATGCAGTGCGTAACGCAAACATAAGTATTGTACCAATTTACGATAACACGGTGGACGAGGCTAGAAGAAGGGTTTCGGACCCCCCTCCCGCCAGGCCTAACGAGCCGTTGTTGGCGCTCACAGTAAATTTCACGGGGAGAGCGTCGAGTGAGAAAATATCGAGTCTAGTGGCTCAGACCCGTGCGAAAGGCGCTTCGGCGTTAGTGCTTACCGCGCTGGATGACATTGCGTACGTCTTAAACATCCGCGGCTCAGACATCCCATACAACCCCGTTTTCTTCTCCTATTTGGTCGTCCAAGTTGATTCGGTGATAATTTATACCGAGGAGGAGTTTTACATGGGACCCTCGTTTGCGACTATCGCCGGCGCCGGTCCCAACGGGGCAGTGATACATTACAAGCCGTCGCGTGACGCTGAACAAACCGTTATCGGCAGGAACGACATGTTGTTGGTGGATTCCGGAGGGCAATACATGGATGGCACGACAGATATCACACGCACGCGACACATGGGAACTCCCACGGATATACAAAAACAGACATTCACCAGGGTTTTGAAGGGTCAGATAATGTTGGCCACAGCCGTCTTCCCTAGAGGCACTCTAGGTAGGGATTTGGAGACGTTTGCTAGACGGTATCTGTGGGACGTCGGTTTGAACTACGCCCATGGCACGGGACACGGCGTCGGACACTTCCTGAACGTGCACGAGGGACCCGTCGGCATTATGAACATGGCCAGCGATCCGGGGCTAGAACCCGGGATCATTATGAGTAACGTCGAAGGAATTATGAAAAAAGAGCAGTACAAACAAATATTGGAGTGTAATGCTGTACCATCGGGACTTCGGCTAGCGAAGAACATTATCGGCAATTTCGCCGGTCGCAGTCCGACAGCGTTTTATACGATATCCCTCGCTCCACACCAGACCAGCTGCTTGGACGTCGACATCATGAGCGATGACGAGATAAGATACTTAAACGAATATCACGCGCGGGTGCTATCTACCCTGGGACCGATATTACGAGGACGTGATTTAGACAAAGACTACGAATGGCTGGAGAGAGAATGCGCTCCGATACAAAGAAGCTCGGCTGTTCTGGTCAAATCATCACCATTCCTCCTCATCATCATCACCCGTCTATGGTACATACCATAG

Protein sequence:

>DPOGS209428-PA
MSLQRLTALRALMAGHPTALAAYIIPTADAHNSEYISPADARREWISGFTGSAGTAVVTANKALVWTDGRYYTQFEKEADLTMWTLMKQSLPETPTMEKWLASNLIAGSVVGVDPHTMTREEWTPLQGVKIVGRPYDDVIEGLSNLARELSNMGDGEHSVWISNEASEAVHRAVSGEGVLKNPLNLISEVSPVALAKLVKNDVELEGFRKCHIRDGTAVCRFFRWLHQEVDSGNKITEVEAAERLLEFRKDEKDFMGPSFETISGAGENGAVIHYTPSSDSPRIITADDVYLLDSGGQYKDGTTDITRTRHMSDPTDLQKETFTRVLKGQIAIGAALYPVGVKGNVLDSLARKYLWDVGLDYAHGTGHGVGHFLNVHEGPSGISWRPYPHDPGLKMGQILSNEPGYYRVGEFGIRIEDLVETISVTNDTNHPRAKDLLGDYNGRGVLGFNTITLVPNQRKFIKTELLDDFECEKLKDRYESATSETLRFGHITNRAVASVLGSEIKILCAKFTQTNAAGVTARKKAMSLQRLTALRALMAGHPTALAAYIIPTADAHNSEYISPADARREWISGFTGSAGTAVVTANKALVWTDGRYYTQFEKEADLTMWTLMKQSLPETPTMEKWLASNLIAGSVVGVDPHTMTREEWTPLQTALSKAKMQLVAVESNLVDKARISLDDPPPKRPQNDIIHLPLEYTGKTAGEKIHDLRVGMLEKKASALVITALDEVAYTLNLRGSDIRYNPVFFSYLLLTPDTVTLFWSGGRIPDDIERNLSDEGVKIVGRPYDDVIEGLSNLARELSNMGDGEHSVWISNEASEAVHRAVSGEGVLKNPLNLISEVSPVALAKLVKNDVELEGFRKCHIRDGTAVCRFFRWLHQEVDSGNKITEVEAAERLLEFRKDEKDFMGPSFETISGAGENGAVIHYTPSSDSPRIITADDVYLLDSGGQYKDGTTDITRTRHMSEPTDLQKETFTRVLKGQIAIGAALYPVGVKGNVLDSLARKYLWDVGLDYAHGTGHGVGHFLNVHEGPSGISWRPYPHDPGLKMGQILSNEPGYYRVGEFGIRIEDLVETINVTNDTNHPRAKDLLGDYNGRGVLGFNTITLVPNQRKFIKTELLDDFELKYINSYHKRVLDTLGPILKNRGLMEDYAWLEKECSPLIAAQHLVIGDLISFDANTYSLYPESHIPGNINGGKLSGSQSLERVSAVRNVMAERGIDAFIVPTSDAHNSQYIAPTDARREWLSGLSGSAGTALVTADHALLWTDGRYFTQFDMQVDPRIWTLMRIGTDVTIESWLASNMRGSRVGIDPTTYTRSSWTTLENAVRNANISIVPIYDNTVDEARRRVSDPPPARPNEPLLALTVNFTAHRPHKFPYSAAQHLVIGDLISFDANTYSLYPESHIPGNINGGKLSGSQSLERVSAVRNVMAERGIDAFIVPTSDAHNSQYIAPTDARREWLSGLSGSAGTALVTADHALLWTDGRYFTQFDMQVDPRIWTLMRIGTDVTIESWLASNMTRGSRVGIDPTTYTRSSWTTLENAVRNANISIVPIYDNTVDEARRRVSDPPPARPNEPLLALTVNFTGRASSEKISSLVAQTRAKGASALVLTALDDIAYVLNIRGSDIPYNPVFFSYLVVQVDSVIIYTEEEFYMGPSFATIAGAGPNGAVIHYKPSRDAEQTVIGRNDMLLVDSGGQYMDGTTDITRTRHMGTPTDIQKQTFTRVLKGQIMLATAVFPRGTLGRDLETFARRYLWDVGLNYAHGTGHGVGHFLNVHEGPVGIMNMASDPGLEPGIIMSNVEGIMKKEQYKQILECNAVPSGLRLAKNIIGNFAGRSPTAFYTISLAPHQTSCLDVDIMSDDEIRYLNEYHARVLSTLGPILRGRDLDKDYEWLERECAPIQRSSAVLVKSSPFLLIIITRLWYIP-