Monarch geneset OGS2.0

DPOGS203103
TranscriptDPOGS203103-TA1365 bp
ProteinDPOGS203103-PA454 aa
Genomic positionDPSCF300391 + 68477-79529
RNAseq coverage2109x (Rank: top 6%)
Annotation
HeliconiusHMEL0142475e-11466.46% 
BombyxBGIBMGA011153-TA0.080.45% 
Drosophilaemp-PB0.072.48% 
EBI UniRef50UniRef50_Q9W0X00.072.48%Epithelial membrane protein, isoform B n=36 Tax=Neoptera RepID=Q9W0X0_DROME
NCBI RefSeqXP_001660151.10.069.57%epithelial membrane protein [Aedes aegypti]
NCBI nr blastpgi|2700373070.069.57%epithelial membrane protein [Aedes albopictus]
NCBI nr blastxgi|2700373070.069.57%epithelial membrane protein [Aedes albopictus]
Group
Gene OntologyGO:00160204.1e-208membrane
GO:00071554.1e-208cell adhesion
GO:00048728.8e-05receptor activity
GO:00057648.8e-05lysosome
KEGG pathwaybfo:BRAFLDRAFT_1266701e-68 
 K12384 (SCARB2, LIMP2, CD36L2)maps-> Lysosome
InterPro domain[2-433] IPR0021594.1e-208CD36 antigen
Orthology groupMCL14013 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203103-TA
ATGACCTTCGAGTGGTGGGCTCGGCCCCCGGTTCGTCCGTTTATCCGCGTGTATGTTTACAACGTGACCAACGCCGATGAGTTCTTGAACAACGGTTCCAAGCCGATCCTAGACGAACTCGGACCCTACGTCTACTCCGAAGAGTGGGAGAAGGTGAATATAACGGACAATGAGAACGGGACCCTAAGCTTCCACTACAGGAGGACGTACACCTTCATGCCGGAACTGAGCTCCGGTCCTGACGACGACTCGGTCGTTGTGCCAAATATTCCTATGTTGAGCGCAACCTCACAATCGAAGCACGCGGCTCGTTTTCTCCGGCTAGCCATGGCATCCATCATGGACATCCTCAAGATAAAACCCTTCGTGGAAGTCTCTGTGGGTCAACTGCTGTGGGGATACGAAGATCCGCTCCTTAAACTGGCCAAGGACGTCGTCCCCAAAGAACAGAACCTACCGTACGACGAGTTTGGTCTCTTCTATGGGAAAAACGGCACGTCCCCGGACCCGGTGACCATGTTCACTGGTAGCGAGGACATCAGCAAGTACGGTATAATCCAACGCTACAACCACCGCGAGCGTCTCCCGCACTGGACCACGGACGAGTGCAACAGCCTCGCCGGCTCTGATGGCTCTATATTCCCTCCCCACATCACCAGGAACGACACGCTGGCTGTCTACGACAAAGACATGTGCCGGCTTCTACCTCTGAGGTATCTCAAAGACGTGGAATCAGCCGCCGGCGTGGCCGGGTACCGGTTCACTCCGCCCGAGGACGTGTTCGCCGAGAACGAACACAACAAGTGTTACTGTCCCGCGGGACCCCCCTGCGCCCCCAACGGCCTGTTCAACGTGTCGCTGTGTCAATACGATTCTCCCGTCATGTTGTCCTTCCCTCACTTCTACTTGGCCGATGAATCATTTCGTGAAGCTGTCGAGGGGATCTCCCCACCGGATGCCGAGAAACATAGATTATATATTGATGTGCAACCGGAGATGGGCACAGCTATGAGAGCTCGCGCGAGGATCCAGATCAACCTGGCCGTGTCCCAGGTGTTGGACATCAAGCAGGTCGCCAACTTCCCAGACATCGTCTTCCCCATACTGTGGTTCGAGGAGGGTATAGACGAGTTGCCCGAGTCGGTGTCCTCCATGCTGCGGCTGGCGACCAAGCTGCCTCCCATAGCCAGGGCGGCGCTGGGAGGGGGGCTCACGGCACTGGGCGCTCTGCTGGTGCTGCTGGCGGTCACCTGCCTCATACGATCCTCTCATCGTCAGAGCACGCTCCGCCTGGAAGGTCACGCGGTGGCCAAACCTCCGCCGGCCAACAACAAAGAGAACGGCTACGAACTGAACAGAAGGTAA

Protein sequence:

>DPOGS203103-PA
MTFEWWARPPVRPFIRVYVYNVTNADEFLNNGSKPILDELGPYVYSEEWEKVNITDNENGTLSFHYRRTYTFMPELSSGPDDDSVVVPNIPMLSATSQSKHAARFLRLAMASIMDILKIKPFVEVSVGQLLWGYEDPLLKLAKDVVPKEQNLPYDEFGLFYGKNGTSPDPVTMFTGSEDISKYGIIQRYNHRERLPHWTTDECNSLAGSDGSIFPPHITRNDTLAVYDKDMCRLLPLRYLKDVESAAGVAGYRFTPPEDVFAENEHNKCYCPAGPPCAPNGLFNVSLCQYDSPVMLSFPHFYLADESFREAVEGISPPDAEKHRLYIDVQPEMGTAMRARARIQINLAVSQVLDIKQVANFPDIVFPILWFEEGIDELPESVSSMLRLATKLPPIARAALGGGLTALGALLVLLAVTCLIRSSHRQSTLRLEGHAVAKPPPANNKENGYELNRR-