Monarch geneset OGS2.0

DPOGS214061
TranscriptDPOGS214061-TA1878 bp
ProteinDPOGS214061-PA625 aa
Genomic positionDPSCF300171 - 33599-36652
RNAseq coverage277x (Rank: top 39%)
Annotation
HeliconiusHMEL0082720.072.56% 
BombyxBGIBMGA010398-TA3e-3769.31% 
DrosophilaMal-A6-PC1e-1735.43% 
EBI UniRef50UniRef50_Q5TND72e-10836.28%AGAP009127-PA n=4 Tax=Culicidae RepID=Q5TND7_ANOGA
NCBI RefSeqXP_553055.23e-10936.28%AGAP009127-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582998706e-10836.28%AGAP009127-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582998704e-10936.60%AGAP009127-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00431692.4e-39cation binding
GO:00059752.4e-39carbohydrate metabolic process
GO:00038242.4e-39catalytic activity
KEGG pathwaysha:SH00722e-19 
 K01182 (E3.2.1.10)maps-> Starch and sucrose metabolism
InterPro domain[28-624] IPR0159029.9e-64Alpha amylase
[180-532] IPR0137812.4e-39Glycoside hydrolase, subgroup, catalytic core
[179-533] IPR0178532.2e-36Glycoside hydrolase, superfamily
[206-278] IPR0060471.6e-18Glycosyl hydrolase, family 13, catalytic domain
[191-485] IPR0065892.7e-08Glycosyl hydrolase, family 13, subfamily, catalytic domain
Orthology groupMCL16308 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214061-TA
ATGGATAATTTAGCTGAAAATAGTACAAAAAAGGGGGGTACTGATCTTGAAACTCTAGATTATCTCCGCGGTGTGAAATCTTCAACGTGTCTACTTCTTCCCATGACTCCGAGCCCTACCCAGTTGGATTTTAAACATCCACTCTCCGAAGAAATGACTGAAGGTGCTTTTTTGACACTGAATGATGATCCAAAAATAGTTGATCTTCAATGCTCTCCTGATCCTTGTAGTGGAGATTCAAGTTCATCAGCAGATTCTAACTCGGTAGTCCAAGATCCTGTCAGTGCTCAACTTATCAATAACATAAGCATGTTGGACTATCAGACTTTGAGTAAAAATGGCGACATTATTGGACAGCCTGAAATCTGTAAACTAAACGGAAGCTTAAATGTGAGCAATAGGAAACTACCACATTTTGTTAACTGGAATTGGTGCATTATAAGGAAGGTTCTACTATGGTTTGTTGTTTCCGGACTTGTTGCATGCACTGGTACTATTATAGCTATGGTTATCAATATACCAAAAGAATGTAACCCAGATCTACCCTGGTATCAAGGTAAGGTATTTTATGAAATATTCCCTGCCAGTTTCAAGGACTCAAACAATGATGGCATGGGTGACTTGAAGGGACTTATCAAGAAGTTGGATTACATAAAAGATTTAGGCGGCTCATCTATCCGTTTGAATTATATATTTGAGGCGCAAAATTATCCTGAAAATTATTATAACACTACATCCCTCCTACAAATTGACCGCAGTTTAGGAGTTCTGAAGGACTTTCAAGAGCTGGTGACCGAGGCCCATAAAAGAAACATGGGAGTTATCTTGGATATACCAGTTTTGAGCATGGCTGAAACTCTTAATAAGTATGATGAAAATGATACCTTTGTATTTTCAATAGACCCTCAAGAAAGTAATTTTGACGCAACGTCTGCAGCGATTGCATATTGGTCTCGTGCACAAAATGTCGACGGATTTTATTTGAAGAATCTGGAGAAATTTGTTGATGATGTTAATTTCGGAAAATCGCTTCAGGTTTGGAAACAAATATTGGGTTACGGGAAAATATTTATAGCCAGTGAAGAAGCGTTAAATATGGCAAAAGATACAAGTCTAACAGTGCTTTTGAGTAGGATTGACCTTATTGATGTTCATTTGGATTTACAAAAAGGTATTGATGGTCTTAAGAAACATATTGAAGGCTTAGTACCCGGTATCCTATGGGACAAGCCTCATTATCCTTGGATTCAATGGAACATTGGAAATGTTAATAGTGAAAGGATATCTAGTAAACACCAAAATAACACATTAGTTTTAACTGCACTTGAGTTGGTTCTCCCGGGCACTGTCAGTATTTTTTACGGTGATGAAGTAAGTCTTGGAGGTCTTTCAGAAAATGAAATGGAAGGAGATTTTCATGAACATGAGCACATTCACAACTTAATACCGATGTCTTTCAATGGCGAAGACAAAGTTGATAATAACAGTCCCGCGTCTATCTTGCCTTGGAATTCTAAATCCGTATTAGAACCGCAGTATCAAAACTTGAACGTTGTGAGATCTTTGATACGTTTAAGATCAACCACACCAACCATATACTTAAAATCAATCTACAAAGAGGGTAGGATACAAAGAAGTATGGAAATACGTGAAACTGAAGGTAACCTCATTGTTATTGAGCGTTGGTTTCCACGCAGAAATACATGTGTATTCGTAGGCAATCTGGGTAACAAGCCGATTACTACTGATTTGTCATCCATGTTCTACGGTGGAATTGTAATAGGAAGCACAAATATGTCCTTAGTGGGTGAAGCTTTGTATTTGGAAAAAGTCACGTTTGAGCCCTTTTCAGCTATTATATTAAAATTGGAGAAATAG

Protein sequence:

>DPOGS214061-PA
MDNLAENSTKKGGTDLETLDYLRGVKSSTCLLLPMTPSPTQLDFKHPLSEEMTEGAFLTLNDDPKIVDLQCSPDPCSGDSSSSADSNSVVQDPVSAQLINNISMLDYQTLSKNGDIIGQPEICKLNGSLNVSNRKLPHFVNWNWCIIRKVLLWFVVSGLVACTGTIIAMVINIPKECNPDLPWYQGKVFYEIFPASFKDSNNDGMGDLKGLIKKLDYIKDLGGSSIRLNYIFEAQNYPENYYNTTSLLQIDRSLGVLKDFQELVTEAHKRNMGVILDIPVLSMAETLNKYDENDTFVFSIDPQESNFDATSAAIAYWSRAQNVDGFYLKNLEKFVDDVNFGKSLQVWKQILGYGKIFIASEEALNMAKDTSLTVLLSRIDLIDVHLDLQKGIDGLKKHIEGLVPGILWDKPHYPWIQWNIGNVNSERISSKHQNNTLVLTALELVLPGTVSIFYGDEVSLGGLSENEMEGDFHEHEHIHNLIPMSFNGEDKVDNNSPASILPWNSKSVLEPQYQNLNVVRSLIRLRSTTPTIYLKSIYKEGRIQRSMEIRETEGNLIVIERWFPRRNTCVFVGNLGNKPITTDLSSMFYGGIVIGSTNMSLVGEALYLEKVTFEPFSAIILKLEK-