Monarch geneset OGS2.0

DPOGS214723
TranscriptDPOGS214723-TA2532 bp
ProteinDPOGS214723-PA843 aa
Genomic positionDPSCF300022 - 82904-95264
RNAseq coverage1171x (Rank: top 11%)
Annotation
HeliconiusHMEL0085915e-11449.23% 
BombyxBGIBMGA005068-TA4e-16252.15% 
DrosophilaCD98hc-PA5e-4727.81% 
EBI UniRef50UniRef50_UPI00022475262e-6732.42%UPI0002247526 related cluster n=2 Tax=unknown RepID=UPI0002247526
NCBI RefSeqXP_973672.16e-6732.80%PREDICTED: similar to CD98hc amino acid transporter protein [Tribolium castaneum]
NCBI nr blastpgi|3454907838e-6732.42%PREDICTED: maltase 1-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454907831e-7231.85%PREDICTED: maltase 1-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00431694.5e-23cation binding
GO:00059754.5e-23carbohydrate metabolic process
GO:00038244.5e-23catalytic activity
KEGG pathway 
InterPro domain[289-579] IPR0159022e-42Alpha amylase
[298-577] IPR0178534.2e-35Glycoside hydrolase, superfamily
[444-577] IPR0060474.5e-23Glycosyl hydrolase, family 13, catalytic domain
[612-645] IPR0137813.1e-08Glycoside hydrolase, subgroup, catalytic core
[75-143] IPR0065784.8e-06MADF domain
Orthology groupMCL15614 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214723-TA
ATGTCGGAGACGCGTAAAAACCATCTCACTATAGATGGTGCACAATCCAAAGAAGAGGACCACGTCGCCTCTTATAAACCTATCCCGGAATCCGATACAGTACATGAAATACAATATATTGAGGCTGACACTGAATACATGCAGTTGAATCAGGCCGATATTATGGAGACTTCAGATGACCCCGACGATCACTGGGATCACGAGAGTACGGCTGGCATGCTGGCTCTGTACTTGCAAAATATAGAAAAGTTTAGAAATCCAAAAATTAGGAAAAAAAATGTTTGGGTGGATATTGCAAATGCAATCGGCAAGGGTCCGGACAGCTGCGACAAAAAGTTCAGAAATCTCAAACAGACATACATAAGGTTACTAAGAAAGAAGAATCGTAGTGGGGTGACAGCATTCAAATGGCCATATTTTGACATATTCGAAGAAATTTATAGCATTGATGGAGAATATCAACCAGAAATCAAACAGAAGATAACAGACGGCAACAGTGACAGAATTACTAAAGTTTTGATGGGAATGGATTCTTCTTCTTCCAAACACGAAGAAACTTTTGAAAATGGTGAATCTTCTAATGGACAAAATGATGAAGTGAAAAGAAGATTATTGAGAAAGAGGAATGCAGAGTTTAGAAAAGTTACACTAGAAATGAGAGATAGGCAGAGAGTTGTAGAAGAGAAATTAGATAGATTAATAAATATAGTCGAAGAGTCCAATAACATTCAGAGAGAAAGGAATAGCGGTAATGGGTTTGATTATGGCTATTCGCTCCTCAGTGGACCGCCGCGCCGCGCCGCTACTGCCGCGAAATTCCGTACATCAAAGTCGAGTCTGCACAAGTCCAGGGACAAGGTCTCCGCAGACGAGGCCGAAGAGAGACTACTACAGAAGGAGGAGGAGGCGAAGATAACGACCAGGGTCGACATGGCGGACGCCAAGTTTGTGGTCGAGGATCACAGGAACGGGGACGCCAAGATTGAGCTGGACGCGAACAAGAGGTTCACGGGCCTGACCAAGGAGGAGCTGATGAAGTACGCGGACGACCCGTTCTGGGTCCGCCTCCGCTGGTTCATGTTCGTGTTGTTCTGGTCTCTGTGGCTGTGTATGCTGGCCGGGGCCATAGCCATCATTGTGAGAGCTCCCAGGTGTGTCGCGCCGGAGCCTAAGACCAGGTACGAGACAGGTCCTCTAGTTGATCTGGACCTCGCTGACTACACCACGGCGGAGTCTCATCTAGACACTCTCCAGCAATACCAGGTGTCTGGGCTGTTCGCCTCCGCCTGTCAGTCTACCTACGTGGTGCTCGAAGACAGCTCCTGTCTAGACAAGTTCAAACAGTTTGCTGATAAAGCCAAGAACTATGGAATCAAGGTCATAGTAGACCTGACAGCCAACTTCGTGTCCACCAGTCATCCTTGGTTCCAGCAGAGTGAGAACCGTTCAGAGCAGTTCTCGGAGTACTTCATCTGGGTGAAGAGTGATGAACATGATCCCGAACTCAACACCACCATACCCAAACCACCTAATGATTGGGTGTCCACAGTGAACACTGGTGCATGGTCTTGGAGTGAGAGAAGGAAGGAGTTCTATCTTCACCAGTATGGCGAGGGACTCGCTGACCTCAACTTCCACAACCCTAATGTAGTCAAACAGTTCGATGAGGTCATCAGACTGTGGATGAAGGCCGGAGCCGGTGGCATCAGGTTGCACAACGTCCGTCAGCTGTTAGTGAGCAGTCCTCCTCTGTCTGAGCTGCCTCACACGGGCGCCGGGAGCACGCCGGGGGCGGACCACTCGCAGTACCCCTTCTGGAGACACTCTCGGACCTCGGACCAGCCACAGCTGGATTCGCTGCTGGCTCACTGGTCATATATCGTGGAGCAAGCTTCCTCTGAGCCGACGGTGTTCACGTTAGCGGAACCTTCCCGGCCGGAGCTGTTCATGCTGCAAAGGAACACGAGTTGTCTCCGGCCCGCCAGCGGAGCACCCGTCGACCTGGCGCGGCCCGGGGCGGCCAAGCTCCTCGCTGAGCGACTGTCACGCTGGCCCGCCATACAGTTGACTGATGATAAGCCGGACGAGGAGACGGCCGTGTTTTCCATGCTGCTGCCGGCCGCACCTGTCATGGTCTTGGAACAACTGGCTGGGGATGACAATGATACTACCCCCAGCGAGAGTTTGAAGCACGCGATATCACTGCGTACCGACGCCAGTGTGCAGCACGGAGCGTTGGTTGTGACTGACGCACCCGTTCACAACTCCAGCGACATGATGCTGGCCGTCGCCAGATGGAAGGCGGACCACTCCGGCTACGTGTCGGTGTATAACCCCGGCGCCTCTGGTCTCGTGTCTCTGTCTTCAGTCCGCTCTCTGCCGTCTTCCCTCGCGGTGCATCACGTGTCGAGGAACACCAAGCTCGCCTCCAATTACACCAGTAACCAGGCCGTGGAGACGGCGAGCGTGTTCGTCCCGGGCAAGTCGGCGGTGATCTTCTCGTACGTGCCGAAAGATGGCGCTGAAAACTGA

Protein sequence:

>DPOGS214723-PA
MSETRKNHLTIDGAQSKEEDHVASYKPIPESDTVHEIQYIEADTEYMQLNQADIMETSDDPDDHWDHESTAGMLALYLQNIEKFRNPKIRKKNVWVDIANAIGKGPDSCDKKFRNLKQTYIRLLRKKNRSGVTAFKWPYFDIFEEIYSIDGEYQPEIKQKITDGNSDRITKVLMGMDSSSSKHEETFENGESSNGQNDEVKRRLLRKRNAEFRKVTLEMRDRQRVVEEKLDRLINIVEESNNIQRERNSGNGFDYGYSLLSGPPRRAATAAKFRTSKSSLHKSRDKVSADEAEERLLQKEEEAKITTRVDMADAKFVVEDHRNGDAKIELDANKRFTGLTKEELMKYADDPFWVRLRWFMFVLFWSLWLCMLAGAIAIIVRAPRCVAPEPKTRYETGPLVDLDLADYTTAESHLDTLQQYQVSGLFASACQSTYVVLEDSSCLDKFKQFADKAKNYGIKVIVDLTANFVSTSHPWFQQSENRSEQFSEYFIWVKSDEHDPELNTTIPKPPNDWVSTVNTGAWSWSERRKEFYLHQYGEGLADLNFHNPNVVKQFDEVIRLWMKAGAGGIRLHNVRQLLVSSPPLSELPHTGAGSTPGADHSQYPFWRHSRTSDQPQLDSLLAHWSYIVEQASSEPTVFTLAEPSRPELFMLQRNTSCLRPASGAPVDLARPGAAKLLAERLSRWPAIQLTDDKPDEETAVFSMLLPAAPVMVLEQLAGDDNDTTPSESLKHAISLRTDASVQHGALVVTDAPVHNSSDMMLAVARWKADHSGYVSVYNPGASGLVSLSSVRSLPSSLAVHHVSRNTKLASNYTSNQAVETASVFVPGKSAVIFSYVPKDGAEN-