Monarch geneset OGS2.0

DPOGS202145
TranscriptDPOGS202145-TA1434 bp
ProteinDPOGS202145-PA477 aa
Genomic positionDPSCF300193 + 358579-385542
RNAseq coverage35x (Rank: top 74%)
Annotation
HeliconiusHMEL0122615e-6540.06% 
BombyxBGIBMGA006372-TA4e-7343.92% 
DrosophilaCG14521-PA9e-9250.77% 
EBI UniRef50UniRef50_E0VYZ03e-10655.56%Lachesin, putative n=3 Tax=Neoptera RepID=E0VYZ0_PEDHC
NCBI RefSeqXP_002431334.16e-10755.56%lachesin precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420218061e-10555.56%lachesin precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2700026294e-10959.19%hypothetical protein TcasGA2_TC004955 [Tribolium castaneum]
Group
KEGG pathwayxtr:1001277264e-30 
 K06775 (NEGR1)maps-> Cell adhesion molecules (CAMs)
InterPro domain[272-345] IPR0137833.3e-20Immunoglobulin-like fold
[252-345] IPR0130982.4e-11Immunoglobulin I-set
[166-237] IPR0035984.5e-10Immunoglobulin subtype 2
[56-151] IPR0035992.2e-07Immunoglobulin subtype
Orthology groupMCL34433 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202145-TA
ATGCTTCTAACGGTTTACGGCGTGATACTTCTCTGGAGAATGGACAGCATGGTTGAGAGCACTAAAGTTGTCAAGAGCCACATGCGATATCAAACAACATCGCCACTAAGTAGGGTTGGTCCAACAGTATCTCTGCGCGATGTTGAACCAAATTTTGTTGGTCCCATCGACAATGTGACTGTTGCTTTGGGGAGAGAAGCTGTTCTCACTTGCTCGGTATCGGACATAGGAGACTACAAGGTGGCATGGATCAGAGCTGATGACCAAACAATCTTGACATTGCACACCCGTTTAGTAACTCACAGTTCAAGATACGCTGTCACTAATGATTCTCCTGGCTCCTGGCAACTCCACATCAGACCACTTAAGGTTGAGGACCGAGGCTGTTACATGTGCCAAATAAATACAAGTACTATGAAGAAACAAATAGGTTGTGTTGATGTATTAGTTCCACCAAATATAGTTGATGAAGGTACGAGTGGAGACATGGTAGCTAGAGAAGGTACAGATGTAAGCATATCTTGCAAAGCCGATGGAAGACCATTGCCTCGTATCCTGTGGAGAAGAGAAGACGGAGCGAATATACAATTGAGGAATGACGCCGGAAAGCTTCATAAAGTTGATATGTACACCGGATCTTCACTTAACCTTACAAAGGTGGAGAGGAGGCAAATGGGTGCGTACTTATGCATAGCATCCAATGACGTGCCGCCTTCAGTTAGCAAGAGGATTATGCTTAGCGTCAATTTTGGCCCATCCATATTGATAGCTACAAAAGTGATTGGAGTGCCAACGGGCTCTCAAACTGAGCTCCAGTGTCTTGTTGAAGCCTATCCTCCCGCTATCAATTACTGGCTGAAAAGTGGAGAAGAAATGATTCTTTCCGGTGAAAAACATGACATTCGCGAGGTTCGGCTCTCGGCATATGAAATACGTACAATATTAACTATATCAGACTTCAGTAGCAATGACATTGGTACTTACACATGTGTTGCAACAAATACAATAGGAAAGGCCGAGGGCACATTAAGACTATACGAAATTAAGATAACTACAACAACGACCACAACAACAACAACTACAACTACTACTACGACCACCACCACCACACCAATCCCGTCAACAACAGAGTTGTTGCCAGAAACACCTCCACCTATCGTAGTGCCATTACAGCCAGCTGAACAAAAATTTTATACGCCAGATGTTACGACATATGATACCATGCAAAATGTAATAGAACAATCTGTCCTCGACAACAACTGGTTACCGACTGCGGAGTCTTCGAGCCACCGCCATTATGCGCCAGAGTTCCCTACAGTTGCTGTCATTTCAGCATTGCCACAAATTTTGCTATGCAATACTCGTAGTCTCAAATTGCTCCTTACTATTGTAATGGCTAGTTACTTTATATTAATAGGTCAAGGTGCAAGATGA

Protein sequence:

>DPOGS202145-PA
MLLTVYGVILLWRMDSMVESTKVVKSHMRYQTTSPLSRVGPTVSLRDVEPNFVGPIDNVTVALGREAVLTCSVSDIGDYKVAWIRADDQTILTLHTRLVTHSSRYAVTNDSPGSWQLHIRPLKVEDRGCYMCQINTSTMKKQIGCVDVLVPPNIVDEGTSGDMVAREGTDVSISCKADGRPLPRILWRREDGANIQLRNDAGKLHKVDMYTGSSLNLTKVERRQMGAYLCIASNDVPPSVSKRIMLSVNFGPSILIATKVIGVPTGSQTELQCLVEAYPPAINYWLKSGEEMILSGEKHDIREVRLSAYEIRTILTISDFSSNDIGTYTCVATNTIGKAEGTLRLYEIKITTTTTTTTTTTTTTTTTTTTPIPSTTELLPETPPPIVVPLQPAEQKFYTPDVTTYDTMQNVIEQSVLDNNWLPTAESSSHRHYAPEFPTVAVISALPQILLCNTRSLKLLLTIVMASYFILIGQGAR-