Monarch geneset OGS2.0

DPOGS215059
TranscriptDPOGS215059-TA3018 bp
ProteinDPOGS215059-PA1005 aa
Genomic positionDPSCF300208 + 233325-253244
RNAseq coverage526x (Rank: top 24%)
Annotation
HeliconiusHMEL0048283e-16337.50% 
BombyxBGIBMGA006874-TA2e-16237.51% 
DrosophilaCht7-PA0.071.26% 
EBI UniRef50UniRef50_Q7QIJ10.073.95%AGAP006898-PA n=8 Tax=Arthropoda RepID=Q7QIJ1_ANOGA
NCBI RefSeqXP_002425481.10.079.62%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3852584750.088.69%group III chitinase-like protein [Plutella xylostella]
NCBI nr blastxgi|3852584750.088.69%group III chitinase-like protein [Plutella xylostella]
Group
Gene OntologyGO:00060321.8e-153chitin catabolic process
GO:00045681.8e-153chitinase activity
GO:00038248.2e-122catalytic activity
GO:00431698.2e-122cation binding
GO:00059758.2e-122carbohydrate metabolic process
GO:00045535.3e-114hydrolase activity, hydrolyzing O-glycosyl compounds
GO:00060301.1e-17chitin metabolic process
GO:00080611.1e-17chitin binding
GO:00055761.1e-17extracellular region
KEGG pathwayphu:Phum_PHUM2037400.0 
 K01183 (E3.2.1.14)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[109-454] IPR0115831.8e-153Chitinase II
[433-477] IPR0137818.2e-122Glycoside hydrolase, subgroup, catalytic core
[111-454] IPR0012235.3e-114Glycoside hydrolase, family 18, catalytic domain
[109-483] IPR0178531.1e-103Glycoside hydrolase, superfamily
[931-1003] IPR0025571.1e-17Chitin binding domain
Orthology groupMCL13685 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215059-TA
ATGTGCCTGAGTCTGAATATCCTGGATTATATACACGTGGTCGATGATATAAAAAAGTTTTATCCTAAAATTATGACTCCCTCCGAGCTGCCACCGAGACTCCTTCGGTATATTATGACCTTTCTCTTCATAATCTCAGTAATAGCACCGTCCTCAGATTCATCAAGCGTCCGCAGAAGATTACGGAAACCGAGCAAAGTATCCACTTCAGTAACAACCAGTGTATCAAGATCCACGGATCAGGTGATATCGGCTAGCGTGAACAGGCCAAAGATCCGTGGCCGGCCCAGTGTCGCTAGCCGCAAGTCATCTGCGGCCATAGACAATTCTATCGTCTGTTACTACACGAACTGGTCTCAGTACAGAACAAAAATTGGTAAATTCACACCCGAGGACATACAGCCAGACCTTTGCACTCACGTCATTTTCGCCTTCGGATGGATGAAGAAAGGAAAGCTCGGCTCCTTCGAATCCAATGACGAGACCAAGGATGGCAAAGCTGGGCTTTACGACAGAATCAACGAACTAAAAAAAGCTAACCCGAAATTGAAGACTCTTCTGGCTATAGGCGGCTGGTCCTTCGGTACTCAAAAGTTTAAGGATATGACAGCGACTCGCTACTCGAGACAGACATTCATATACTCAGCGATACCGTACCTCAGGGATAGGAACTTTGACGGTTTGGATGTGGACTGGGAGTACCCTAAAGGCGGGGACGACAAGAAGAACTACGTGTTGTTGTTAAAAGAATTACGCGAAGCATTCGAAGCGGAAGCGCAAGAAGTGAAGAAGCCACGTCTTCTCCTGACCGCTGCTGTTCCAGTCGGACCTGACAACATCAAAAGTGGTTATGATGTTCCCGCTGTTGCTAGTTACTTGGACTTCATAAACGTCATGGCTTACGATTTCCACGGTAAATGGGAGAGGGAGACAGGACACAACGCACCTCTTTACTCTCCATCATCAGATTCAGAGTGGAGAAAACAGCTGTCAGTAGACCACGCCGCCCACTTGTGGGTCAAACTTGGTGCTCCCAAGGAGAAACTGGTTATTGGGATGCCAACCTACGGGAGAACCTTCACCCTGTCCAACATGAACAATTTTAAAGTAAATTCCCCGGCGAGTGGTGGCGGTAAGGCTGGGGAGTACACAAAGGAGGCGGGCTTCCTGGCGTATTACGAGGTTTGTGAAATTTTGAAAAATGGTGGAGCTTATGTTTGGGATGAAGAAATGAAAGTTCCCTACGCCATTCAAGGCGACCAATGGGTCGGCTTTGACGACGAAAGGTCCATAAGAAACAAAATGAGGTGGATCAAAGACAATGGTTTCGGTGGTGCCATGGTGTGGTCCGTTGACATGGATGACTTCTCTGGGTCCGTCTGCGGCGGAGACGTGAAGTATCCTCTGATTGGTGCTATGAGGGAGGAACTTCGCGGCATATCTCGTGGTAAAGACAAGAAGGACGTCGACTGGTCTAAAGTAGCCGCCAGCGTCATCGTGGAAGTGACAGAGAAACCTTCACCCATCAAACTCAGCCTGTCGGAGATACAGGAGAAGCTTAGCAAAATAAAGAAACCCACCAAGACTCACGTTATTAAAAATAACAACGCAGTATCGATTGACAAAAACAGGCGTGAGCCACAAATCCTTTGCTACCTGACATCATGGTCGTCTAAACGGCCCAGTGCTGGTCGTTTTATGCCAGAGAACGTTGATCCCACACTCTGTACCCATATTATATACGCCTTCGCTACACTCAAGGATCACAAACTCTCTGAAGGCGACGAAAAGGACTCAGAAATGTACGATAAAGTGGTTGCTTTGAGAGAAAAGAACCCGAACTTGAAGGTATTATTGGCAATCGGAGGCTGGGCGTTCGGTTCGACACCCTTCAAAGAACTCACGTCCAACGTGTTCCGCATGAATCAATTTGTTTACGAAGCCATCGAGTTTCTGAGAGACTATCAGTTCAACGGGCTGGATATAGACTGGGAATATCCAAGAGGTGCTGACGACAGGGCCGCATTTGTTTCTTTACTTAAAGAACTTCGGTTGGCCTTTGAGGGAGAAGCGAAGACTTCAGGCCAACCTCGCCTGTTGCTGTCTGCTGCCGTTCCAGCCTCCTTTGAGGCTATTGCAGCTGGATACGACGTTCCTGAAATATCTAAATATTTGGATTACATAAACGTGATGACATACGACTTCCACGGTCAGTGGGAACGTCAAGTTGGTCACAACAGCCCTCTTTTCCCGCTCGAAAGCGCCACCAGTTACCAGAAGAAACTCACCGTGGACTATTCAGCTCGCGAGTGGGTTCGTCAAGGTGCACCCAAGGAAAAGCTAATGATAGGAATGCCGACATATGGAAGGTCATTCACTTTAATCAATGAGACTCAATTTGATATCGGAGCACCCGCCTCTGGAGGTGGTCTGACGGGTCCATTTACAAACGAGGCTGGCTTCATGTCGTACTATGAGATCTGCGATTTCCTCCGTGAAGACAACACCACTTTGGTCTGGGACAACGAACAGATGGTACCCTTCGCGTACCGACAGGATCAGTGGGTTGGATTTGATGATGAAAGATCACTCAAGACTAAGATGGCGTGGCTCAAAGAGGAAGGTTTTGGTGGTATCATGGTGTGGTCTGTGGACATGGATGACTTCCGAGGCTCGTGCGGTACCGGCAAGTACCCGCTCATAACCACCATGAAGCAGGAGCTCGGAGACTATAAAGTTAAACTGGAATATGACGGCCCCTATGAGTCATCCAACCCTAATGGCCAATATACTACTAAAGATCCTACCGAAGTGGTTTGCGAAGAGGAAGATGGCCACATTTCCTATCACCCAGATAAGGCTGACTGCACCATGTACTACATGTGTGAGGGTGAACGCAAACACCACATGCCGTGCCCATCGAACCTAGTCTTCAATCCCAACGAGAACGTCTGCGATTGGCCTGAAAACGTCGAAGGTTGTACTCACCACACCCAAGCACCACCTGCTAAAAGGTAA

Protein sequence:

>DPOGS215059-PA
MCLSLNILDYIHVVDDIKKFYPKIMTPSELPPRLLRYIMTFLFIISVIAPSSDSSSVRRRLRKPSKVSTSVTTSVSRSTDQVISASVNRPKIRGRPSVASRKSSAAIDNSIVCYYTNWSQYRTKIGKFTPEDIQPDLCTHVIFAFGWMKKGKLGSFESNDETKDGKAGLYDRINELKKANPKLKTLLAIGGWSFGTQKFKDMTATRYSRQTFIYSAIPYLRDRNFDGLDVDWEYPKGGDDKKNYVLLLKELREAFEAEAQEVKKPRLLLTAAVPVGPDNIKSGYDVPAVASYLDFINVMAYDFHGKWERETGHNAPLYSPSSDSEWRKQLSVDHAAHLWVKLGAPKEKLVIGMPTYGRTFTLSNMNNFKVNSPASGGGKAGEYTKEAGFLAYYEVCEILKNGGAYVWDEEMKVPYAIQGDQWVGFDDERSIRNKMRWIKDNGFGGAMVWSVDMDDFSGSVCGGDVKYPLIGAMREELRGISRGKDKKDVDWSKVAASVIVEVTEKPSPIKLSLSEIQEKLSKIKKPTKTHVIKNNNAVSIDKNRREPQILCYLTSWSSKRPSAGRFMPENVDPTLCTHIIYAFATLKDHKLSEGDEKDSEMYDKVVALREKNPNLKVLLAIGGWAFGSTPFKELTSNVFRMNQFVYEAIEFLRDYQFNGLDIDWEYPRGADDRAAFVSLLKELRLAFEGEAKTSGQPRLLLSAAVPASFEAIAAGYDVPEISKYLDYINVMTYDFHGQWERQVGHNSPLFPLESATSYQKKLTVDYSAREWVRQGAPKEKLMIGMPTYGRSFTLINETQFDIGAPASGGGLTGPFTNEAGFMSYYEICDFLREDNTTLVWDNEQMVPFAYRQDQWVGFDDERSLKTKMAWLKEEGFGGIMVWSVDMDDFRGSCGTGKYPLITTMKQELGDYKVKLEYDGPYESSNPNGQYTTKDPTEVVCEEEDGHISYHPDKADCTMYYMCEGERKHHMPCPSNLVFNPNENVCDWPENVEGCTHHTQAPPAKR-