Monarch geneset OGS2.0

DPOGS202082
TranscriptDPOGS202082-TA987 bp
ProteinDPOGS202082-PA328 aa
Genomic positionDPSCF300116 - 2969-6878
RNAseq coverage1178x (Rank: top 11%)
Annotation
HeliconiusHMEL0031491e-15181.29% 
BombyxBGIBMGA010839-TA7e-16385.22% 
DrosophilaCG5510-PA2e-11062.88% 
EBI UniRef50UniRef50_UPI0001791AA52e-11358.72%UPI0001791AA5 related cluster n=1 Tax=unknown RepID=UPI0001791AA5
NCBI RefSeqXP_001602865.12e-13868.71%PREDICTED: similar to vesicular mannose-binding lectin [Nasonia vitripennis]
NCBI nr blastpgi|3154521551e-16485.53%vesicular mannose-binding lectin-like protein [Antheraea pernyi]
NCBI nr blastxgi|3154521554e-16385.53%vesicular mannose-binding lectin-like protein [Antheraea pernyi]
Group
Gene OntologyGO:00160203.6e-181membrane
KEGG pathwaynvi:1001190125e-138 
 K10082 (LMAN2, VIP36)maps-> Protein processing in endoplasmic reticulum
InterPro domain[1-328] IPR0050523.6e-181Legume-like lectin
[31-262] IPR0133201.6e-97Concanavalin A-like lectin/glucanase, subgroup
[28-258] IPR0089852.3e-74Concanavalin A-like lectin/glucanase
Orthology groupMCL11578 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202082-TA
ATGTGTTTGTGTATTGTAAGAACTATTTTTCACCAATTCCTTATCGTATTCCTTTTATTAACTCCCGTATTAGCAGAATGGAATACGAGAGATTATATACGTCGAGAGCATTCACTGACAAAACCGTACCAAGGCAGCGGTATGTCTGTTCCGTACTGGGACTTCCTAGGTAGCACAATAGTTACAACAAATTACGTCCGTTTGACGCCAGATCTCCAATCTAAGGCTGGCGCTATTTGGAACACCGTGCCGTGTATTACAAGGAATTGGGAGATTCAAGTGCAGTTTAAGGTGCACGGTCGCGGTAAAGATCTGTTTGGTGATGGTCTGGCTCTCTGGTATGTTAAAGATAGGATGCAACCGGGACCAGTTTTTGGCAGCAAGGATTACTTCCAAGGACTCGCAATCATATTAGACACGTATAGCAACCATAACGGCGCCCATAACCACCAACATCCGTATATATCGGCCATGATAAGTAATGGCACATTACACTACGACCATGATAGAGACGGTACACACACACAACTAGCGGGATGTGAAGCAAAATTCAGGAACTACAACCACGATACACACCTCTCAATTATATATAAAGATGACACACTTAAAGTGTCAATGGATTTAGAAGGTAAGAACGCTTGGAAGGAGTGCTTCACAGTTGAAAATGTACTTCTGCCGACTGGCTATTTCTTTGGTGCATCGGCTACCACCGGTGACTTGAGTGACAACCATGACATAATAGCCATTAAGATGTACGAATTGGACCTACTTGATACGCAAAAGGAAGAAGACAGATCCCATATAATACCATCAGCGGCTACATTTGAAGCTCCTCGTGATAGAGCCGAGGACCCTAAACCGGCCATGTCTGGCTTCAAAACCTTCCTTTGGATGATGTTTATTGCCATCGTCATCATAGTACTAGTTATTCTAGGTATTATGTGGTACCAGAAAAGACAAGAACATTCTAGAAAGAGACTTTACTAA

Protein sequence:

>DPOGS202082-PA
MCLCIVRTIFHQFLIVFLLLTPVLAEWNTRDYIRREHSLTKPYQGSGMSVPYWDFLGSTIVTTNYVRLTPDLQSKAGAIWNTVPCITRNWEIQVQFKVHGRGKDLFGDGLALWYVKDRMQPGPVFGSKDYFQGLAIILDTYSNHNGAHNHQHPYISAMISNGTLHYDHDRDGTHTQLAGCEAKFRNYNHDTHLSIIYKDDTLKVSMDLEGKNAWKECFTVENVLLPTGYFFGASATTGDLSDNHDIIAIKMYELDLLDTQKEEDRSHIIPSAATFEAPRDRAEDPKPAMSGFKTFLWMMFIAIVIIVLVILGIMWYQKRQEHSRKRLY-