Monarch geneset OGS2.0

DPOGS201894
TranscriptDPOGS201894-TA1617 bp
ProteinDPOGS201894-PA538 aa
Genomic positionDPSCF300191 + 431666-434217
RNAseq coverage804x (Rank: top 16%)
Annotation
HeliconiusHMEL0147432e-9262.55% 
BombyxBGIBMGA006067-TA4e-17254.92% 
DrosophilaMal-A4-PA1e-4726.08% 
EBI UniRef50UniRef50_O160981e-4428.19%Maltase 1 n=30 Tax=cellular organisms RepID=MAL1_DROVI
NCBI RefSeqXP_002049664.13e-5129.16%GJ21717 [Drosophila virilis]
NCBI nr blastpgi|1954303566e-4927.96%GK21507 [Drosophila willistoni]
NCBI nr blastxgi|3389745004e-4027.04%maltodextrin glucosidase [Bradyrhizobiaceae bacterium SG-6C]
Group
Gene OntologyGO:00431696.2e-42cation binding
GO:00059756.2e-42carbohydrate metabolic process
GO:00038246.2e-42catalytic activity
KEGG pathwaydme:Dmel_CG86938e-46 
 K01187 (E3.2.1.20, malZ)maps-> Starch and sucrose metabolism
    Galactose metabolism
InterPro domain[181-420] IPR0137816.2e-42Glycoside hydrolase, subgroup, catalytic core
[19-464] IPR0159021.6e-40Alpha amylase
[18-420] IPR0178532.3e-38Glycoside hydrolase, superfamily
[42-346] IPR0060471.2e-20Glycosyl hydrolase, family 13, catalytic domain
[27-370] IPR0065892.5e-07Glycosyl hydrolase, family 13, subfamily, catalytic domain
Orthology groupMCL25201 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201894-TA
ATGTTGGTGGTAGCGTTAGTTTTCTTTTCACTGGGAGCCTCGCTCGCCGACCCCTGGTGGAGCAAGGCTATCTACTACAGATTACTAGTGGACTCATTTAAGGACCTCGACGGAAACGGCCTTGGAGATTTAGAAGGAGCTACAAAACAGCTAAGTTACGTACGCGCTATCGGAGCTGATGCTGTAATCTTATCTCCCATCTCCGAGAAAAGCTTGGACTGTAATTCACCCGGGACCATCGATTTTGTTAACATTGAAACAAGATATGGTACTATCGATAACTTTAATGCACTGCTGGCTAAAGCAAATAAATTAGAGCTCAAAGTTTTGATAACATTACAACTACAAACTATAAGTTCAAACTCAATTTTATTTAATTCAAGCGCCGAAAGAAAGACTGGCTTTGAAGATGGAATTGTATGGATCAGTGGAGCAGCTGAGGAAGCACCGGCCAGTCGAGCGTTTAGAAACTGGACGTGGCATGAATACAGAGGAGCTTATTATGCAACTGTTAACAAAGAGGCGATATTAAATATTTGCTCTGAAAGTGTCGTTGCGATGCTATCTGAAGCTATGTGCGAGTGGTTGAAGAGGGGAGCCGCTGGAGTTCTGCTTAATCCTGACTTTTTAATGAATTACGAATGTGGTCAAAAACTTGTCAAAAGGATAGCGAAAGAGGCCGTGGCCTGCTCGGGGAATCAACATTATGATCTGCCGGTTATTTTAGTAGAATCCTCTCTGGAGGCGGAAGTCGCTACTAAATATTACGCTGAGGGCGGTGTAGGTGCCAATAGCGTAATAAGTTATGCCTTTTCTACAAAAGCCAAACGGCCGGCTGTTGGCTTGGCGCTCGACCTGCACGCAGCGCTGTTGTATTCACCACAAGATACGACCCCCGCTTGGCTCACCAGCACATCCGACGGAAACCGGATAGCAACTCGTTATGGCAGCGAAATGGTTGATGCTATAATACTATTAACTTTGATTCTTCCTGGCTCTGTTATCATTCAACAAGGAGATGAGCTAGGAGTTGCAGACACCATACTTGATTGGACAAATACTACCAATTGTTGGCCAATGAATTACATACCTTCGGCTGCTCCTTTTCCATGGGATAATAGCCCTAAAGCTAATTTTACTACGGGAGAACCCTGGATGCCGCTGCCTCCAAATTACAGGTATAAGAACGCAAAATCTGAATATGGTAATGAATTGAGCCATGTTAGTGTAATGAAGATAGCATCTGCAATGAGAAAATCTGCAGCTATTGGGCCACACGTTGAGATAAAAGTATTAAAAGATGCGCTAGCGATATTGCGGTGGGGTGGGGGCGGATCACTTTTAGTCGTATCTAATCTTGGTACCGGGTCGACTGAGGCTCAGCTTTCCGAGATTCCTGGACTACCAGCTGAAATGACGGTAGCATCAAGTTCTGGAGGATCTAGTTTATCTCTCGGCAACCATATCACAGTAGGAAAGACGTTAAAATTGTCGCATGGGGAAACCGTGCTCCTGGTTGGACCTCCGAGACACTGTGGGGGACCGGGACCAGTGGATAAAATTACGAGCAAACTCTCTGAGGGATGGCAGAAAATCAACAAGTACTTCAACCTCTGA

Protein sequence:

>DPOGS201894-PA
MLVVALVFFSLGASLADPWWSKAIYYRLLVDSFKDLDGNGLGDLEGATKQLSYVRAIGADAVILSPISEKSLDCNSPGTIDFVNIETRYGTIDNFNALLAKANKLELKVLITLQLQTISSNSILFNSSAERKTGFEDGIVWISGAAEEAPASRAFRNWTWHEYRGAYYATVNKEAILNICSESVVAMLSEAMCEWLKRGAAGVLLNPDFLMNYECGQKLVKRIAKEAVACSGNQHYDLPVILVESSLEAEVATKYYAEGGVGANSVISYAFSTKAKRPAVGLALDLHAALLYSPQDTTPAWLTSTSDGNRIATRYGSEMVDAIILLTLILPGSVIIQQGDELGVADTILDWTNTTNCWPMNYIPSAAPFPWDNSPKANFTTGEPWMPLPPNYRYKNAKSEYGNELSHVSVMKIASAMRKSAAIGPHVEIKVLKDALAILRWGGGGSLLVVSNLGTGSTEAQLSEIPGLPAEMTVASSSGGSSLSLGNHITVGKTLKLSHGETVLLVGPPRHCGGPGPVDKITSKLSEGWQKINKYFNL-