Monarch geneset OGS2.0

DPOGS201735
TranscriptDPOGS201735-TA1521 bp
ProteinDPOGS201735-PA506 aa
Genomic positionDPSCF300269 + 222630-230564
RNAseq coverage1724x (Rank: top 7%)
Annotation
HeliconiusHMEL0158880.078.71% 
BombyxBGIBMGA001353-TA1e-8765.62% 
Drosophilaergic53-PB1e-16956.91% 
EBI UniRef50UniRef50_Q9V3A82e-16756.91%Ergic53, isoform A n=29 Tax=Arthropoda RepID=Q9V3A8_DROME
NCBI RefSeqXP_971530.13e-17557.99%PREDICTED: similar to AGAP005404-PA [Tribolium castaneum]
NCBI nr blastpgi|910859656e-17457.99%PREDICTED: similar to AGAP005404-PA [Tribolium castaneum]
NCBI nr blastxgi|910859656e-17057.65%PREDICTED: similar to AGAP005404-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160201.8e-211membrane
KEGG pathwaytca:6601839e-175 
 K10080 (LMAN1, ERGIC53)maps-> Protein processing in endoplasmic reticulum
InterPro domain[1-506] IPR0050521.8e-211Legume-like lectin
[26-256] IPR0133206.5e-95Concanavalin A-like lectin/glucanase, subgroup
[24-255] IPR0089851.3e-74Concanavalin A-like lectin/glucanase
Orthology groupMCL12617 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201735-TA
ATGTCGTCGTATAGTTTAAATTTATTATTGTTGACGGTGTTTGGTATATTTGTGACTTCGAACACCCAGACGATACATAAACGATTTGAATACAAATATTCGTTCAAACCGCCTTATTTAGCACAAAAAGATGGTTCTGTGCCGTTTTGGGAGTACGGAGGCAATGCGATCGCGTCTGGTGAGAGCGTCCGGTTGGCGCCGTCTCTGAGGAGTCAGAAGGGCGCTATATGGTCCAAGCATCCGATCAACTTCGACTGGTGGGAAGTGGACATCATGTTCAAGGTCACCGGCAGGGGAAGGATAGGAGCCGACGGGCTGGCCTTCTGGTACGTGACGAAGCGCGGGGAGTACACCGGCGAGGTGTTCGGCTCCTCGGACCGCTGGAACGGTCTGGGGATCATCTTCGACTCCTTCGATAACGACAACAAACACAACAACCCTTACATCATGGCGGTCCTGAACGACGGCACTAAGAGCTTCGACCACAAGAGCGATGGTTCCAGCCAGCTGTTATCCGGCTGCTTGCGGGACTTCAGGAACAAGCCCTTCCCGACCCGCGCCCGGATAGAATACTACGCGAACACACTCACAGTCTACTTCCATAACGGTCTAACCAACAACGAGGCGGACTATGACTTGTGTTTCCGAGCTGAGAATGTCCAGTTGCCTCGTGGAGGGTTCCTGGGCGTCTCCGCAGCCACCGGCGGGCTGGCGGACGACCATGACGTCATACACCTGCTGACGTCATCGCTGCACTCCACACAGCAGGGAGGTGAGGGACAGCAAATAAACAGTGCAGAGCAAGCCAAGCTGTCCCAGGAGTACCAGGAGTACCAGAAGAAGCTGGAGCAGCAGAAAGAGGATTACAGGAAGGAACATCCCGATGAGGTCCGAGACAAGGACGGTGAGTTTGATGACTGGTTTGAGTCTGATGGACAACGGGAGCTCAGACAGATCTTCGCCAGCGTTGGGCATGTGCAGGACGGGGTCAGGGAACTCAGCAAGAAGATGGACGAGGTTATTGGCAAACAGACAAACTCGCTGTCCATGTTGTCAGCCGTGTACAGTCAGACACAGACGATGCAGGTCCAGCAGCCTGGACAAGCTGGCCAACCACCTGTACAGCAGATGCCGATGCTGCCCATCACCAGACACGACTGGGACCAGCTGATGGCCAACAACCAGCTCGTCATCAACACCATCGCCGAGCTCAAGGGTTTCATAATAGACGTGTCGCGTAAGACGGACAGCGTGGTGGGCGGCGTGGCGGGCGGCGCGGCGGGCGGCGCTCTAAACCAACAGGTGGTGAACGAACTGAGGGAGGGGATCAATCATGTTAAAAACAATGTGGCGGGAGTCGCACAGAGGCTGTCGTCCGCTCCGCCGCAGCCCGCGTGTCCGTCTGTGTCGTGTGTGTCCACCACCATGCTGCTGACGGTAGTGGCGTCGCAGCTGGCCGTCATGTTCCTGTATTCGTTGTACAAAGAAAGGAAAGAGGCGCAGGCTAAGAAATTCTTCTGA

Protein sequence:

>DPOGS201735-PA
MSSYSLNLLLLTVFGIFVTSNTQTIHKRFEYKYSFKPPYLAQKDGSVPFWEYGGNAIASGESVRLAPSLRSQKGAIWSKHPINFDWWEVDIMFKVTGRGRIGADGLAFWYVTKRGEYTGEVFGSSDRWNGLGIIFDSFDNDNKHNNPYIMAVLNDGTKSFDHKSDGSSQLLSGCLRDFRNKPFPTRARIEYYANTLTVYFHNGLTNNEADYDLCFRAENVQLPRGGFLGVSAATGGLADDHDVIHLLTSSLHSTQQGGEGQQINSAEQAKLSQEYQEYQKKLEQQKEDYRKEHPDEVRDKDGEFDDWFESDGQRELRQIFASVGHVQDGVRELSKKMDEVIGKQTNSLSMLSAVYSQTQTMQVQQPGQAGQPPVQQMPMLPITRHDWDQLMANNQLVINTIAELKGFIIDVSRKTDSVVGGVAGGAAGGALNQQVVNELREGINHVKNNVAGVAQRLSSAPPQPACPSVSCVSTTMLLTVVASQLAVMFLYSLYKERKEAQAKKFF-