Monarch geneset OGS2.0

DPOGS205026
TranscriptDPOGS205026-TA999 bp
ProteinDPOGS205026-PA332 aa
Genomic positionDPSCF300288 + 123463-134459
RNAseq coverage342x (Rank: top 34%)
Annotation
HeliconiusHMEL0051213e-11271.54% 
BombyxBGIBMGA010360-TA1e-8155.68% 
Drosophilagalectin-PA2e-2329.84% 
EBI UniRef50UniRef50_D2A2V87e-2731.10%Putative uncharacterized protein GLEAN_07619 n=2 Tax=Tribolium castaneum RepID=D2A2V8_TRICA
NCBI RefSeqXP_971732.13e-2933.21%PREDICTED: similar to galectin (AGAP008844-PA) [Tribolium castaneum]
NCBI nr blastpgi|910805155e-2833.21%PREDICTED: similar to galectin (AGAP008844-PA) [Tribolium castaneum]
NCBI nr blastxgi|910805151e-2633.21%PREDICTED: similar to galectin (AGAP008844-PA) [Tribolium castaneum]
Group
Gene OntologyGO:00055298.2e-27sugar binding
KEGG pathway 
InterPro domain[45-172] IPR0089857.6e-28Concanavalin A-like lectin/glucanase
[46-172] IPR0133203.8e-27Concanavalin A-like lectin/glucanase, subgroup
[53-180] IPR0010798.2e-27Galectin, carbohydrate recognition domain
Orthology groupMCL15145 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205026-TA
ATGCCAATTGAAGGTTTAAAATGTGTTAATTGTATTAATATGTCTGGAATAGAGGATGGCGAACTAGCTAGGGACTTTGAATATGACTTTGCTGAAGATGCTTTGGACTTTGAGAATCAGTTATCAGAAGCTCGGGATGTCAAGTTCACGCAGAATCTAACGGAACCGCTCTCTATTGGATCTCATATTATTTGTACTGGAACTCCTAGCGACGATCTGCCATGGTTCGCAGTAAACATAGGTTGTGGGGACCCGTCTCGAAGCGATATAGCGATTCATTTCAATGTACGACTGCCACAGTGTTACGTGGTGCGAAATACAAAGAGGCATGACAAATGGGGTATGGAAGAAACCACGGCATATAGGTCGTTCCCCTTTAAAATAGACCGTCCGTTTACGATTGAGGTTCTAATAGACGAAAAGGAGGCTTTATGGGCCATTGATGGCGAGCATTACTGCAGTTACGCCCACAGAAATCCGAGCCCACTGAACGCCACGTGGGTTCAAGTGACAGGAATACGAGACGCTGCCTTGAAAATACAGAAAACTGACATATACCCAACTCTATCGCCGCCTCCACTAGCAGTACCAATAAAATCTGACTCAAATGACACCATAGAAGACGAACCGAAATGGCATCCAAATGTCATAGCGACTATTGCTGACGGAATCCCGGAGGGACACCAAATCGTTATACGCGGACGACTACGTCCCATGCTACACTCGTTCACTATAGACTTGTTGGATGTAGCTCGTGAATGGCCACGTGGGAACATACTTCTTCATGTGAACGTCCGAGCTCATGTGCAATCACAGATGTCCAGGCAGCTAGTCGTATTGAACGCGTGGCTTGGTGCGTGGGGACAGGAGAGGAGACAGAGAACAGCTAAACTGATTCCTGGTACTGAGAAACCGCAATTACATCAGACAACTTCTAGCAAAAACCCACTACAACAAGATACAGATAGGGACCATCCATTGTTCACTCTTGGTGTTTAA

Protein sequence:

>DPOGS205026-PA
MPIEGLKCVNCINMSGIEDGELARDFEYDFAEDALDFENQLSEARDVKFTQNLTEPLSIGSHIICTGTPSDDLPWFAVNIGCGDPSRSDIAIHFNVRLPQCYVVRNTKRHDKWGMEETTAYRSFPFKIDRPFTIEVLIDEKEALWAIDGEHYCSYAHRNPSPLNATWVQVTGIRDAALKIQKTDIYPTLSPPPLAVPIKSDSNDTIEDEPKWHPNVIATIADGIPEGHQIVIRGRLRPMLHSFTIDLLDVAREWPRGNILLHVNVRAHVQSQMSRQLVVLNAWLGAWGQERRQRTAKLIPGTEKPQLHQTTSSKNPLQQDTDRDHPLFTLGV-