Monarch geneset OGS2.0

DPOGS214132
TranscriptDPOGS214132-TA3765 bp
ProteinDPOGS214132-PA1254 aa
Genomic positionDPSCF300014 - 1408346-1423472
RNAseq coverage628x (Rank: top 20%)
Annotation
HeliconiusHMEL0113650.074.48% 
BombyxBGIBMGA006180-TA0.068.23% 
DrosophilaCG11034-PB2e-12538.12% 
EBI UniRef50UniRef50_E0VJP61e-14450.41%Protein anon-37Cs, putative n=4 Tax=Neoptera RepID=E0VJP6_PEDHC
NCBI RefSeqXP_974097.14e-14649.40%PREDICTED: similar to polyamine oxidase [Tribolium castaneum]
NCBI nr blastpgi|3800121354e-14951.33%PREDICTED: spermine oxidase-like [Apis florea]
NCBI nr blastxgi|3800121351e-14651.44%PREDICTED: spermine oxidase-like [Apis florea]
Group
Gene OntologyGO:00160202.1e-75membrane
GO:00065082.1e-75proteolysis
GO:00551149.9e-60oxidation-reduction process
GO:00164919.9e-60oxidoreductase activity
GO:00082361.3e-44serine-type peptidase activity
KEGG pathway 
InterPro domain[99-460] IPR0024692.1e-75Peptidase S9B, dipeptidylpeptidase IV N-terminal
[786-1232] IPR0029379.9e-60Amine oxidase
[546-726] IPR0013751.3e-44Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL10744 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214132-TA
ATGATAGCTACAAAATATAATCTGGTAGTCCTAGCAACGGCATTGGTGTACTTCCTTGCCGATTCTGTTGCTTCACCTCAAGGACTCCTGAAGACATTCACTTTGGAGGAACTGGTGCCTTTGCAACATGAGTTCTTTCCTGACAGAGTGGCTGTACAATGGATATCAGATACGGAATATATTATAGCGGAACCAGATTCAGTAAATAAATATGACGCCATTACCGACACACACAGCACAATACTGGATAAGAAGGAATTGCTCAACATGAGCCAGTTTTCGGTGTCCTCGTTCTCGAACGATCAAAAATATGTACTACTAGTACTTACACCAAGTAGAAAGAAGATTTATAGATATTCAACATTGGCTGAATATTCGCTGTATGATCTTGAGAAGAATAAAATCGCTAACATCGCCCACGGACCGCTCCAGGTTGTTGTATGGGGCAGTGACAAGTCTTTAGCCTACGTTGAAGACAACAATGTGTACTATATACCTGACGTGGCTCAACCCGATGTTGTGACAGCACTGACGAAAGATGGCGTCCCAGGAGAAATATATCATGGTGTGACAGATTGGATTTATGAAGAGGAAGTGTTTAATGCAGCGGAAGCAATGTGGTTTTCACCTCATGGGACCTACTTGGCCGTAGCAACCTTCAATGACACTCAAGTGGAGTCCGCCTTATACCCCTACTACGGGGAACCGTCCGATTTTAATAGTCAATATCCTTTACTCGTTCATTTTAAATATCCTAAGGCAGGTCGCACGAATCCAGATGTGCAACTGCGTGTGTTCAATCTCAATGACACGTCCAGTGAGCCGATGATGATTCCAGCCCCTGTAGATATTGTGGGCCTAGATCACATTTTGGGGAGGGTCAATTGGGCTACTGATCAAAATCTCGTCGTTCTATGGCTTAACAGACGACAGAGTATTAGTGTTTTAGTGAACTGCAATCTAAAAGAGAACAAATGCAATATAGTGAAACAGCATAATGAACCCAATGGTTGGATTGATATTAACGAACCGTTTTTCGATAAAACAGGAAAGAAAATGTTAGAAATTCAACCCATGCATTACGAAGATCAGAGATTTATGCATGTAGCACATTTTGATTTCGAAACTCAAGAAACGACCGATTTGAGTCCAGGAAATTCCACAGTCACAGAAATATTGGGATGGGATCAGAAATCAGACATTGTTCTGTATATTGTATCCCCGGGAAATGAACCTTGGCAAAGACAACTGTGGGGTGCCTCTAAAGGAATCAATAGATGCATTTCGTGCACCAAACCGACTTGTCACAACGTTGACGGTATGTTTTCACCGGCAGGTAGCTATGGAATTGTATCGTGCAGTGCCGTAAATGTACCTCCAGTTACATACTTTTTCAAAAGCCAGAATAGAGGCTTTAAGATCATAACGGAAAACTCGAAATTGCTTGAAAAATTGAGTCGTTATAAAATGCCTTTGGTCTTATTTAACAAGATATCGTTAGAAGAGGATACGATGGCTCATATCAAGTTGTTGTTGCCACCTGAAATGAAACCAGGGAAGAAGTATCCTATGATAGTGAGGTTATACGCTGGACCCGGAACAACTAGAGTCAAAGACACCTATGATCTTGAATACTACAATCTTTATTTAAGCGGCAATCGTAGTTTCATAGTAGCGTCGATCGATGTAAGGGGTTCGGGCGCGATGGGTGTGGAGGCGATGCACGCCCTCAACAACGCTCTTGGGACCGTTGAAATTACCGATACTTTAACAGCTATCAGACGACTTGTGAGTATGTATTCGTTCATTGATACCGACCGTATTGGAGCTTGGGGATGGAGTTATGGTGGTTACGCTACCACTATGATGTTGATCAGAGACCATGACAAGATAGTGACGTGTGGCGCTGCTGTCGCTCCAGTTACTTCGTGGCTATATTATGATACAATTTACACGGAGAGGTATATGGATACACCTCAAAACAACCCAGTGGGCTATGAAAACTCAGACCTGATGATGCAAGCTGAAAAACTCCGAGACCGCCGTTATCTTTTAGTACATGGCACTGGTGATGACAATGTTCACTACCAACACAGCTTGCAACTAGCCAAGGTGCTGCAAAGAGCTGACATTGCATTTGAACAAATGAGTTATACTGATGAAAATCATTCTTTGCGAGGTATCTTCGAAAACATGGCGGCGGATACAGATAAAATGGTAGTACTGCTGTCGGACAAACCGGGTATGCTGTACGACTGTGGGCCAGATTTACGTGACAGGGGCGTGTGTGGAATAGATCCATTCGATCCGAACAAATGTTTCCAAGAACCACGCGTGGTCATCATAGGAGCGGGTATGGCCGGACTCTCTGCCGCCTCAAGACTATCACAACGTGGCATCAATAATCTTGTTGTGCTTGAAGCTTATGAAAGACCAGGAGGCCGCATTCACTCTTGTTGGTTGGGAGATGTTGTTGCTGAGCTCGGCGCTGATTTGGCAAATAGTGATTATTTTACTCATCCTGTATACAACCTCTCTGCCGCAGAAAAACCTCCCCGTCCTGGTGTACCGGGTTCAGAACATACACGTGGACTGTTTAATAGTATTGTTACAAAAAAAGTGCCATATCCACCAACCGTATCTGCATATTATAAATTTCGCCAAATTGAAGAAGAAGCTAGTAATATTTTTTGCCTTGGAGGAAGCAAACAGCATGGATCATTAATTAATTTTATGAGTATAAGAATTCAACAGGAACTTCATGAATATCCAGAAGAACAGCAACATGATGCGGCTCGAATAATGTTTGGACTTACCCATATGATGAATGCTCGTTGTGGTGACGATACGGCAATGCTTTGTGCGGATCACACTGGCTGTTTTATGAACATGCCAGGAGGAGATGTGCGGGTGCCGTTGGGGACAATAGGCACGCTTGCACCACTGTTACGTCAAATACCCGAAGGTGCAATACGGTACTGTAAACCCGTGAACTGTGTATATTGGGGAACTTGCATCAAATCAGGATATCGATCTACAGTTTGTACAACTGATGGAGATGAATTCCCTGCAGATTATGTTATTATTACAGCTTCTATTGGAGTTCTCTATTCAAATTCAACAAGACTTTTTTGCCCATCACTCCCCGCTTCTAAAATAGACGCTCTCAGATGCTTCGGATTCGGGTACTGTAATAAAATTTATTTAGAGTATTGCCGTCCATTTTGGTTTTGGCATAATGGAAGCTTAGATTTTGATTACACTTATGAAACTTTATCTCATCGTAATGATTGGACACGAGGTATTACAGCAATACGTGTGGTGCCAAATAGTAAACATGTAATAAGCGTTCTTGTATTTGGTAAAGAAGCGTTGACACTAGAGGGACTTTGCGATAAAGACGTCGCAGAAGGAGTTACTGACCTTTTAAAAACATCGACGGGGAATCGTTATATTCCCTATCCGATTACAATTTTGCGATCTCATTGGGTTTCTGATCCATATTTTCAAGGTGTGTTTTCTTATGAAGGGAAGTGTACAGATGGAGAAGCACAAAGGGCTTTGGCGTGTCCTTTACCCGGTCCCAGTGAATCAATTCCACCTATTCTTCTATTTGCCGGAGAAGCAACTGTTCCAGCACATTATGGCACAATTGATGGTGCCAGAATAAGTGGAGTAAGGGAAGCGGAACGAATTGTACAATTAACAAAGCAATTCGGAGGACCACCTCTGCCAACTTCAATAAATCCCGTGTGTTGTGGCTGA

Protein sequence:

>DPOGS214132-PA
MIATKYNLVVLATALVYFLADSVASPQGLLKTFTLEELVPLQHEFFPDRVAVQWISDTEYIIAEPDSVNKYDAITDTHSTILDKKELLNMSQFSVSSFSNDQKYVLLVLTPSRKKIYRYSTLAEYSLYDLEKNKIANIAHGPLQVVVWGSDKSLAYVEDNNVYYIPDVAQPDVVTALTKDGVPGEIYHGVTDWIYEEEVFNAAEAMWFSPHGTYLAVATFNDTQVESALYPYYGEPSDFNSQYPLLVHFKYPKAGRTNPDVQLRVFNLNDTSSEPMMIPAPVDIVGLDHILGRVNWATDQNLVVLWLNRRQSISVLVNCNLKENKCNIVKQHNEPNGWIDINEPFFDKTGKKMLEIQPMHYEDQRFMHVAHFDFETQETTDLSPGNSTVTEILGWDQKSDIVLYIVSPGNEPWQRQLWGASKGINRCISCTKPTCHNVDGMFSPAGSYGIVSCSAVNVPPVTYFFKSQNRGFKIITENSKLLEKLSRYKMPLVLFNKISLEEDTMAHIKLLLPPEMKPGKKYPMIVRLYAGPGTTRVKDTYDLEYYNLYLSGNRSFIVASIDVRGSGAMGVEAMHALNNALGTVEITDTLTAIRRLVSMYSFIDTDRIGAWGWSYGGYATTMMLIRDHDKIVTCGAAVAPVTSWLYYDTIYTERYMDTPQNNPVGYENSDLMMQAEKLRDRRYLLVHGTGDDNVHYQHSLQLAKVLQRADIAFEQMSYTDENHSLRGIFENMAADTDKMVVLLSDKPGMLYDCGPDLRDRGVCGIDPFDPNKCFQEPRVVIIGAGMAGLSAASRLSQRGINNLVVLEAYERPGGRIHSCWLGDVVAELGADLANSDYFTHPVYNLSAAEKPPRPGVPGSEHTRGLFNSIVTKKVPYPPTVSAYYKFRQIEEEASNIFCLGGSKQHGSLINFMSIRIQQELHEYPEEQQHDAARIMFGLTHMMNARCGDDTAMLCADHTGCFMNMPGGDVRVPLGTIGTLAPLLRQIPEGAIRYCKPVNCVYWGTCIKSGYRSTVCTTDGDEFPADYVIITASIGVLYSNSTRLFCPSLPASKIDALRCFGFGYCNKIYLEYCRPFWFWHNGSLDFDYTYETLSHRNDWTRGITAIRVVPNSKHVISVLVFGKEALTLEGLCDKDVAEGVTDLLKTSTGNRYIPYPITILRSHWVSDPYFQGVFSYEGKCTDGEAQRALACPLPGPSESIPPILLFAGEATVPAHYGTIDGARISGVREAERIVQLTKQFGGPPLPTSINPVCCG-