Monarch geneset OGS2.0

DPOGS201447
TranscriptDPOGS201447-TA6708 bp
ProteinDPOGS201447-PA2235 aa
Genomic positionDPSCF300006 - 797122-807785
RNAseq coverage380x (Rank: top 31%)
Annotation
HeliconiusHMEL0155010.052.48% 
BombyxBGIBMGA002696-TA0.046.35% 
DrosophilaCda5-PB0.073.85% 
EBI UniRef50UniRef50_Q16JD90.078.57%Putative uncharacterized protein n=2 Tax=cellular organisms RepID=Q16JD9_AEDAE
NCBI RefSeqXP_001663505.10.078.57%hypothetical protein AaeL_AAEL013367 [Aedes aegypti]
NCBI nr blastpgi|1571356010.078.57%hypothetical protein AaeL_AAEL013367 [Aedes aegypti]
NCBI nr blastxgi|1571356010.037.21%hypothetical protein AaeL_AAEL013367 [Aedes aegypti]
Group
Gene OntologyGO:00059755.7e-43carbohydrate metabolic process
GO:00038245.7e-43catalytic activity
GO:00168104.3e-16hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
KEGG pathway 
InterPro domain[1848-2170] IPR0113305.7e-43Glycoside hydrolase/deacetylase, beta/alpha-barrel
[2075-2167] IPR0025094.3e-16Polysaccharide deacetylase
Orthology groupMCL15663 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201447-TA
ATGTACAGTCACGAGCTCCAAACATGTGATTGGCCGCGTAACGTAGGCTGCGACGCCACCGGTGCCGTCATAGCTGACGATTACGAACGGCTAAACGAAAGACAGCCTCCACCTCCTACATCTCGAAGAAATCCACCTCCACCTCCTCCGTCCAGAGCACAGCCCCATCCAGTTATTACTTCAAGAGGGCAACCAAAATTCAACCGACAAGAATATGAAAAACAACAACAGTTATATGCAGAAGTAGATGACTTACCTCCTGTGGAAGAAATTGAGAATGATAGGCAGCAAAGGGTATACAGAGGACAACCATCAACTATTGGACAAGTTCAAAAAGATAGGGACGGTTACCATGGATCGCAGGGTGTTAGTGCTGGGAGAACACTTAACTCTAATATTATTCCTTCCTCAATTGCGCAAAATAGTAAAATTGGATCGTTTTCTTTTGGGACGCAACTAGAAGACAGAAGAACAGCCACTGCCACCCAAACTCCTCAGTCTTATAGAGAAGATATATATGACGTTTTGACTGACTCTTCTGACTTAACTAAAGACTTTACGACTGGTGAAAGGACTATTAAAGATATAAGTGATAACACTTTATCGAGGAAAAGAAGAGATGTTGAATCCCATTTAAATGCATCTTCTGTACCTGATAAAATTTCTAGTAGGAACGATAATGACCAAGAAATGGAATATATAGAGTTTGACCCAGAATCTGATGAAGATGAATACCAAGATGATGAATTGGATGATAGTGAAAGAGGGAAGAGACAAATTAGGTTTTATATAAAAGAGGGACATAAGGTTCCTCTTAAGTTTACCAGTTCTAAACCTGTGAAATTTGTAAACTTGCGGCATAATAATCAAAATCCCAACATTCATAATCCACGCTACCATACAAATCATAATAGTGGATTTAATGATAATACTTATACTTCATTTGTTGTGACTAATAATAACAACCATTATCCAGTAAGTGGCCACGTCAATTATCAAACTGTTGATCATTATCAAAATCAAAGGCCATTTAAAGCAAGTTTACCTGATTTATCAAAACCTAAATATTTACCCAATAATGCAGGAACTCAAATTATTACAAAAAGTCCTCCTATTTCTTCTTTAAATAACAATCAAAATCCATTTGCTTCTTTAGCTGGTGGGTTTTATAACAATGCGTTGAAAAATGATCATAATACAATATCGCAAGGACAAATATCCTCAGTGAAACCAAATTTATCTTCACCACATGATTTAACGCATTTCCCCAGTACAATTGTTACTGGAAGACCTTTATCATCTTCTACTCAGAGTTTAAATTATAATGTAAAAAATGATGAAAATAAAAAACTAAATAATGTTTACAAGAACAATAATAACAAATTTACAAAAGACGAAGACTATAATGAAGATGAAGACTATTCTGATGAAGAAGCAGAATCTTCAGAAGAAGATGAGGAAGATCACAAACCAAATTTCTCACCACCTATTACTGTCCCTTATAGTTTTAATCATCCCAGAAATAAATATGCTAACATTGATAATCCATTTGCCCGACCGAATTTTAATTTTGATGAATTTTTGGCTAAATTAAGAGATGATCAATATTCGGTTATCGGACTATCAACTCAAAAACCAAAAGCTTTACAAAATAATGATGTACAAACAGATTCACCTATAAATACAATTCCATCTATTAATAGTCACAAAATATCATCTTTTAAAGGTATAAGTACTCCAAAGCCATTTACAATGTCTGACGTACCGCAAAATTCAACATATGCTATAAATGAAAACATAAAAAATTTCGCTCCCCAACATGGCTCTGATTACGTTTTGAGGTCACAAATAACTAATAACCCATACTTTCAACACAATCCAAAAAATGTTAATAGACTTCCACAAAAGGATGCTGGTATACCTTTAGAGACTTTAAAACCGAAATTAAAGCTACCTAACTTCCAGGATAATAGACCACTTTCTATAAATTACAATTTCAATACTCCAGCAGAAGGTAGTAATCAACCGTCAAATACAATAAGGCCAATTGTCACTCCGACATCTTATTATAGCACTCCAAACAACAACAATAAATTACCACTACAATCTCATCATATAAATAATGCAAAACCGTTTTTGGTATCAACATCGCCACCTTTCAATAGATATGTGTTAAGCATTTCGCAGTCTACTTCGAGACCTGCTACATTTGTAAATCAGCAAGTTTCTCCTGTACAAAATTACTGGAAAAAACCATCTATTGCTTTCACACCTACAACACCATCTTCGATTAGTCCAAATACAGTCACAGAAATTGCAAAATGGACAAAATTGTATTCACAAGCAACACAATCATCTACAATAATACCATTGAGTGGTAAAAATATAGTTGCTGATGTAAGTACTAAAGCTCCGCCAAAGCGTAAACCTATACCGAAACCTTCGCCGGAAATGAATGATTACTATTATGATGACGAAGATGAACAGTATTATTATGAACCAATCGTTAAGCCTAAATATATGCCAAGCTCCGAAGTTATGCCTCAAAGACCGCCTATGGCACAAAACTATGAAGAATACGACGATTCCAATGAACAACTTGAAATTCATACAGATTCAAAGATTCAAGAACATCAAAAAATACCTAGTAGTCAAAATAATTTTAAAGTAGAAAGTGCAACAAAAAACCACAATGATGTATCCGTTGTCACCAAGTCACCATATAAACAATCAAACAAAATTATTAACGGCAAAATTCCCGTACCAGTAATGGTTGATTATGACGATTCAACAAATTCCATGTCTCATAATAGTCGCAATCGAACGTATTACTTAAGGAAGCCAAATAAGCCTGAGAATAATCCAAATACATTAAAACCTCCTAAGTATTTGAATCAGACAACCTTGCGGCCTTATACTGTCAGACATAGATTGGCAATGCCAACGACTGAAAAAAATCAGGTTAATCAAGATGTAGAAAATAAACAAATGAGAGGAAGAATACGACATCACAATATAGTTGCGGAAATGAAATTGACTACTCCTCATGACAGCTTTAAACAAGAGACTCGAATTACTAAGACTGGTCATGACGATAAAACGAACAGCCTGGAACCCACAGAGAGCGTTACACCTTCCTCGTATTCTCCAAGTCCGCGACCAAAAATGCTTTATAACGGTTCTCAGACTTACAGTCCCGATCAGTATGATCCTTATTACGCTGTATATGATGAAGACGGTGAACTGTACAAGGATACAGACTATGTGCAGCAATATAACTCAGCTTCACTCCGACCAGCAGTTCAGCAAACGTACAGAGGCACTCCGCCCTCACGTCGGCCAGTAGAGACCTATTCAGCAAGACCTGTCGCAGATGATTACGACGATGCTCTTATTCAAGGACCTATTATAAATCAAAACCAATACCAGACATCTGTTCGTCAACCGGCAAGGGGTGAAGGTAACGAATTGGGTTATGATCCTATACCAAGCAGCGTGAGGACGACTATTTATGAAGCCACTTTCCCAAGCACGAATCCAACAACAACTAGCACCAGCACTACTACCACTACCACTACCACTACCACAACCACAAGACGACCAACAACCGCACCTTACACGGAAGCAATGACCCCGTCTCGTTATTCCCCAAGATCATCCACTGATGAGAATCGTAGTCGCCTCCCTGCTTCCTCTACCGTCCCAAATGCCATTGATGGATTTGTTACATCTATTCCTTCTACATTATCAACCTCCATAAACTTATCCTCTCACGCTAAACCTTTTGAAAAGACGGATAATGCACATAGGTTACTTGAACCGTCGACTGAAATAACGACTCGTAGACCAACTACTCGTGTGACTGAAAACAATAAAAATTTCAGACTTTATTATGATATCAATGAGAAAGACGTTACCCAAACCGAAATAAATAAAGGACTCCGAAATGGTGAACAAGCCAGCGAGGAAGTTGATATCGTAAATATAGCATCTCATCAGAAAGAAAATGACTTCGAAGGCCCAGCTGATAACTATATGTCCTCATCTGTACGAACCCCTACATATAGGCCCAGAATTCGATTTAGTAATAGTGCCACACCAACAACGACAACAAAAGAAGAAGTTTTTAAGCGACCGGTATCGGAATCAACAACAAGTTCAATAAGCCCTAAAACTTTCACCGATAGTAGGTTAGACCGAGCAAATGAGAATAGTGAAAATAAACATTATGTTGACAGTAATAAATCTCAAGGCTTCAAACAAGTGGATTATATCATTCAAGATGAAGACTTAAAATACAAATATCCAATAGTTTCACCGTATAAAACATTAGATAATTTAAGAAATACTGGTAGAGACTATCTGTCTAGAGATGATAAAATAGATATTCCTTCATCAACTATGTCAACTGGTACTTCAAGTTTAAGATCAGTAACAACAACATCATCAGACAGAGTTACAAGAAAAAAACCTTCCTACTATCTTTACAATATCAAGGATGATGAAAATGAGCAAACTACGGAAATATATAAAAGTGGTGTGAAACAAAAAAATCGAATCTTATTTAAAGATGCGCATAGTAGCACACCAAAAATTGTAGAATTTATTTCACCTTCTACAACATTGGAGACAAATGAAGAGGTGGTAAATATAGGATTTAAAAATAAAAAACAAAATGCTAGCGTTCAAAAGCCATCTCGGAATAGTTTTAAACACGTCAGCATTTTGACAGAACCTAGTGTTCTTAGAAACATTTCCCCTTCTGTTGATAATCCCACTACAGTTATCAATCATGTTCCAGAATTGAAATATGATTCAGTTCAAACTGCAAATCCTACAATTTCTCTTTCATCAGAAATTCCAATGATGGATGCAGCGACAGAAGGTGCTTTCAGAGATATAATACCAAATATTGACTATGAATTAACAACGAAATCTCATTCATTAAAATCCAAATTTAGTAGCTTAATGAATAAAGGATCCAGAGACGAAGAACCGAGATCTCATAAGTTTAAGAAAATAATTGAAACTGAGATAAAACCAGTTGACAGTAATATTGATCATTATTCTGAAAGAAGTAATTCTAACCTAGAAACTAACATAGATAGCAAAGATTTAACGACAGAAACTCAAATAAGACTTACAAATGACTTTGTTAATGACATTGCCTCTACGACAAGCACAACTAAATCAACTACAACTTTTAAAATTGACGTTGAGAACAAGCAGACCTCTAAGAGCTTATCCTATCCAACTCGAGCCTCTCGTATCAACCCTGCCATAAAGTTAGCCGCGGCAAGTGTTGGAGGTGGGCGCAGGAGTTACCAATCGCCATCCAATTGTTCATCAGACAACAGTCTGCAAGTGAATCCAAAATGCAACGAAATCAAATATCCGAGGCCCACAAGCACAAGAGGTCGAGGTTCGGCACATTTTTCAACTTCTGGTGGATCTGAGGCTCCACAGCAGACTCCTAACAGAGGAACACCTCCAACTCGCAGTCGTCCTACGTTAAAACCCTCAACAGCCATAGTTACAAAGACTGTGGATATCAATATTTACGCTCATCCACCATCGCGCCCCGCTCCTGTTTACCCACAACCGACACCTGACAAGACAGCTGCCAAATGTAGAAAAGATGTATGTCTTCTACCAGATTGTTTCTGCGGCGGAAAAGACATTCCTGGCGAATTGCCGGTGGATAAGGTGCCTCAAATTGTTTTGCTGACTTTCGATGATTCCGTAAATGATTTGAACAAGGGCTTGTACACGGATCTATTTGAAAAAGGACGGGTTAACCCAAATGGTTGCCCTATAACAGCTACCTTTTATGTATCTCACGAATGGACGGATTACAGTCAAGTTCAAAACTTATACTCGGCTGGACATGAAATGGCATCTCACACAGTATCTCATAGTTTTGGAGAGCAATTCTCTCAGAAAAAATGGAACAGAGAAGTCGGAGGTCAAAGAGAGATTTTGGCAGCGTACGGTGGTGTTAAACTCGATGATGTTAGAGGAATGCGTGCACCTTTCTTATCTGTAGGAGGAAATAAAATGTTCAAAATGTTGTACGACTCCAACTTTACATACGATTCATCATTGCCAGTATATGAAAACAGACCACCGAGTTGGCCTTATACTTTGGACTATAAACTTTTCCACGATTGCATGATACCACCTTGTCCCACCAAATCTTATCCAGGAGTTTGGGAAGTTCCTATGGTCATGTGGCAAGATTTGAATGGTGGCCGTTGTTCTATGGGCGATGCTTGTGCCAATCCGCCGGATGCAGAAGGTGTTTACAAAATGATTTTGAAAAATTTCGACAGACATTATACCAGTAACAGGGCTCCTTTTGGTCTCTTCTATCATGCAGCTTGGTTCACTCAACCTCACCACAAAGAAGGTTTCATCATGTTCCTAGACTTCATTAATAAAATGAATGATGTTTGGATTATCACAAACTGGCAAGCCTTGCAGTGGGTGCGAGACCCCACCCCAATATCCAGATTAAACAATTTCCAACCGTTCCAGTGCAATTATGCGGATCGGCCGAAAAAATGCAACAATCCTAAGGTTTGCAACTTGTGGCATAAATCCGGAGTAAGGTATATGAGGACATGTCAACCCTGTCCTCCAATTTATCCTTGGACTGGAAAAACTGGCATCTCATCATCGCGCATTGACAACGAAATTGAAGAATAG

Protein sequence:

>DPOGS201447-PA
MYSHELQTCDWPRNVGCDATGAVIADDYERLNERQPPPPTSRRNPPPPPPSRAQPHPVITSRGQPKFNRQEYEKQQQLYAEVDDLPPVEEIENDRQQRVYRGQPSTIGQVQKDRDGYHGSQGVSAGRTLNSNIIPSSIAQNSKIGSFSFGTQLEDRRTATATQTPQSYREDIYDVLTDSSDLTKDFTTGERTIKDISDNTLSRKRRDVESHLNASSVPDKISSRNDNDQEMEYIEFDPESDEDEYQDDELDDSERGKRQIRFYIKEGHKVPLKFTSSKPVKFVNLRHNNQNPNIHNPRYHTNHNSGFNDNTYTSFVVTNNNNHYPVSGHVNYQTVDHYQNQRPFKASLPDLSKPKYLPNNAGTQIITKSPPISSLNNNQNPFASLAGGFYNNALKNDHNTISQGQISSVKPNLSSPHDLTHFPSTIVTGRPLSSSTQSLNYNVKNDENKKLNNVYKNNNNKFTKDEDYNEDEDYSDEEAESSEEDEEDHKPNFSPPITVPYSFNHPRNKYANIDNPFARPNFNFDEFLAKLRDDQYSVIGLSTQKPKALQNNDVQTDSPINTIPSINSHKISSFKGISTPKPFTMSDVPQNSTYAINENIKNFAPQHGSDYVLRSQITNNPYFQHNPKNVNRLPQKDAGIPLETLKPKLKLPNFQDNRPLSINYNFNTPAEGSNQPSNTIRPIVTPTSYYSTPNNNNKLPLQSHHINNAKPFLVSTSPPFNRYVLSISQSTSRPATFVNQQVSPVQNYWKKPSIAFTPTTPSSISPNTVTEIAKWTKLYSQATQSSTIIPLSGKNIVADVSTKAPPKRKPIPKPSPEMNDYYYDDEDEQYYYEPIVKPKYMPSSEVMPQRPPMAQNYEEYDDSNEQLEIHTDSKIQEHQKIPSSQNNFKVESATKNHNDVSVVTKSPYKQSNKIINGKIPVPVMVDYDDSTNSMSHNSRNRTYYLRKPNKPENNPNTLKPPKYLNQTTLRPYTVRHRLAMPTTEKNQVNQDVENKQMRGRIRHHNIVAEMKLTTPHDSFKQETRITKTGHDDKTNSLEPTESVTPSSYSPSPRPKMLYNGSQTYSPDQYDPYYAVYDEDGELYKDTDYVQQYNSASLRPAVQQTYRGTPPSRRPVETYSARPVADDYDDALIQGPIINQNQYQTSVRQPARGEGNELGYDPIPSSVRTTIYEATFPSTNPTTTSTSTTTTTTTTTTTTRRPTTAPYTEAMTPSRYSPRSSTDENRSRLPASSTVPNAIDGFVTSIPSTLSTSINLSSHAKPFEKTDNAHRLLEPSTEITTRRPTTRVTENNKNFRLYYDINEKDVTQTEINKGLRNGEQASEEVDIVNIASHQKENDFEGPADNYMSSSVRTPTYRPRIRFSNSATPTTTTKEEVFKRPVSESTTSSISPKTFTDSRLDRANENSENKHYVDSNKSQGFKQVDYIIQDEDLKYKYPIVSPYKTLDNLRNTGRDYLSRDDKIDIPSSTMSTGTSSLRSVTTTSSDRVTRKKPSYYLYNIKDDENEQTTEIYKSGVKQKNRILFKDAHSSTPKIVEFISPSTTLETNEEVVNIGFKNKKQNASVQKPSRNSFKHVSILTEPSVLRNISPSVDNPTTVINHVPELKYDSVQTANPTISLSSEIPMMDAATEGAFRDIIPNIDYELTTKSHSLKSKFSSLMNKGSRDEEPRSHKFKKIIETEIKPVDSNIDHYSERSNSNLETNIDSKDLTTETQIRLTNDFVNDIASTTSTTKSTTTFKIDVENKQTSKSLSYPTRASRINPAIKLAAASVGGGRRSYQSPSNCSSDNSLQVNPKCNEIKYPRPTSTRGRGSAHFSTSGGSEAPQQTPNRGTPPTRSRPTLKPSTAIVTKTVDINIYAHPPSRPAPVYPQPTPDKTAAKCRKDVCLLPDCFCGGKDIPGELPVDKVPQIVLLTFDDSVNDLNKGLYTDLFEKGRVNPNGCPITATFYVSHEWTDYSQVQNLYSAGHEMASHTVSHSFGEQFSQKKWNREVGGQREILAAYGGVKLDDVRGMRAPFLSVGGNKMFKMLYDSNFTYDSSLPVYENRPPSWPYTLDYKLFHDCMIPPCPTKSYPGVWEVPMVMWQDLNGGRCSMGDACANPPDAEGVYKMILKNFDRHYTSNRAPFGLFYHAAWFTQPHHKEGFIMFLDFINKMNDVWIITNWQALQWVRDPTPISRLNNFQPFQCNYADRPKKCNNPKVCNLWHKSGVRYMRTCQPCPPIYPWTGKTGISSSRIDNEIEE-