Monarch geneset OGS2.0

DPOGS212545
TranscriptDPOGS212545-TA1431 bp
ProteinDPOGS212545-PA476 aa
Genomic positionDPSCF300315 + 190557-198133
RNAseq coverage1866x (Rank: top 7%)
Annotation
HeliconiusHMEL0145385e-14454.11% 
BombyxBGIBMGA008133-TA4e-11444.33% 
DrosophilaCG7166-PB8e-3225.77% 
EBI UniRef50UniRef50_E0VTY52e-3327.67%Limbic system-associated membrane protein, putative n=6 Tax=Neoptera RepID=E0VTY5_PEDHC
NCBI RefSeqXP_311526.45e-3730.10%AGAP010422-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582899019e-3630.10%AGAP010422-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582899012e-3629.68%AGAP010422-PA [Anopheles gambiae str. PEST]
Group
KEGG pathwaydre:304478e-21 
 K06491 (NCAM)maps-> Cell adhesion molecules (CAMs)
    Prion diseases
InterPro domain[158-256] IPR0137832.3e-17Immunoglobulin-like fold
[266-344] IPR0130983.3e-13Immunoglobulin I-set
[176-242] IPR0035982.5e-10Immunoglobulin subtype 2
[170-253] IPR0035996.4e-10Immunoglobulin subtype
[60-147] IPR0131063.1e-08Immunoglobulin V-set
Orthology groupMCL22176 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212545-TA
ATGTTTGGTTTACAATCCAAATCCATGGATGAAGCGAGACACTCTATGAAGAGACGTGAAACAGATGATGTGAACTATGACGATGTTTTAGCTGATGAGGCCACTGATAATGAAGAACAAAATGATGCTGAAGGCAACGATGTCGACGAAGACGAGCCGGTCGACGCCATAATAGAGACGCCTCCCAAAAATTACAGCGCAATAATCGGACAGGACGTCAGACTGGAGTGTAAAGTATCTCCTTCGTTTGGTGTCGTGGTCCAATGGAGCAAAGACAATAACAAATACTTCCTGGGGACTCTAAAGGCTGTTGAGCAAGACCTCAACACGTACGGAGTGGACTCAGAGAGGTTTGCCATCGCCGCTAATTCCACGGACCTGTTGATCCGCGGCGTCCGACCGGAAGACGCCGGCTCGTTCGCCTGCACCCTGATGCAGTTCAAACCGGAAGTCATCAAACACCAGCTCTCCGTACTGGAATACCCGAAGATTGTCAGTTTCACGGCGACCAACGGCGGCTCAGTGATGGAGGGCTCGTCCGTGTCCCTGACGTGCGAGGCGGTCGGCTCCCCGCCGCCGCAGGTCGTGTGGTCGCGGGACGTGGGCGGCGTGAACGAACGTCTGCAAGAAAAGGACGGAGAATTCATCGGATACTCAGTTTATATTAAAAATATCAAACGCGAGCAATCTGGAAAGTATTACTGCTATGTGTTTAACGGTATCGGAGCGAACCAGGCTGAGGTCACGGTTAATGTCAGAGGTAAACCCCGCGTTCATGTCCACAACACTATTGTTAATTCGGCAATCAATGTGGAAGCTGTCCTGCAATGCACCGTCCACGATGAACCTGGTGCTCACATCCGTTGGTACAAAGACGGTCAACTGATCGAGAGCATCTCCCGCCAGTACGATATAAGCACACGCGGGTCACACTCTAATCTCACAGTGCTGCCAACGTCCGACAGCGACTTCGGCACCTTCACATGCGAGGCGGAGAACGAGTTCGGTTCACACAACCGCTCCATATCCCTCGTCCAATCACCCGTGATACACTCACTGGAGGCTGACGGCTCGAGGATAGGGGTCACCATAACCAGCAGACGACCACTGCACACCGTCGAGCTGCAAGTACGGGAGCTGGGTGGGGACGGTGAATGGCGCACCTTCAACATCCCGGTGCCGACGTCGTCTTCGCACGAGTACTTTATAGCCTACACGTTGGACGAGCTGGAGGCCGGCAAGTACGAGGCGGTGGTGAGGGCCGAGAACGACCACAGCTGGAGCGAGCACTCCGACTCCACCCTGCTTGATATAGAGGCGAAGCCGGAGTACATTCAGCATGCTTCAGTGTACAGAGGCAATTCAGCGCATTCCGTGAAATCCGTGGCCCTCACGACTACCCTCATGTATCTTCTGGTACGGACGCTGTAA

Protein sequence:

>DPOGS212545-PA
MFGLQSKSMDEARHSMKRRETDDVNYDDVLADEATDNEEQNDAEGNDVDEDEPVDAIIETPPKNYSAIIGQDVRLECKVSPSFGVVVQWSKDNNKYFLGTLKAVEQDLNTYGVDSERFAIAANSTDLLIRGVRPEDAGSFACTLMQFKPEVIKHQLSVLEYPKIVSFTATNGGSVMEGSSVSLTCEAVGSPPPQVVWSRDVGGVNERLQEKDGEFIGYSVYIKNIKREQSGKYYCYVFNGIGANQAEVTVNVRGKPRVHVHNTIVNSAINVEAVLQCTVHDEPGAHIRWYKDGQLIESISRQYDISTRGSHSNLTVLPTSDSDFGTFTCEAENEFGSHNRSISLVQSPVIHSLEADGSRIGVTITSRRPLHTVELQVRELGGDGEWRTFNIPVPTSSSHEYFIAYTLDELEAGKYEAVVRAENDHSWSEHSDSTLLDIEAKPEYIQHASVYRGNSAHSVKSVALTTTLMYLLVRTL-