Monarch geneset OGS2.0

DPOGS202827
TranscriptDPOGS202827-TA1242 bp
ProteinDPOGS202827-PA413 aa
Genomic positionDPSCF300018 + 647042-651740
RNAseq coverage185x (Rank: top 49%)
Annotation
HeliconiusHMEL0092904e-7568.21% 
BombyxBGIBMGA010477-TA3e-14056.20% 
Drosophilasanta-maria-PA9e-8137.08% 
EBI UniRef50UniRef50_D2KXB35e-13856.20%Cameo2 n=3 Tax=Obtectomera RepID=D2KXB3_BOMMO
NCBI RefSeqNP_001164651.11e-13856.20%scavenger receptor class B member 4 [Bombyx mori]
NCBI nr blastpgi|2839454792e-13756.20%scavenger receptor class B member 4 [Bombyx mori]
NCBI nr blastxgi|2839454797e-13556.20%scavenger receptor class B member 4 [Bombyx mori]
Group
Gene OntologyGO:00160206.1e-121membrane
GO:00071556.1e-121cell adhesion
KEGG pathwaydre:3872605e-49 
 K13885 (SCARB1)maps-> Phagosome
InterPro domain[10-358] IPR0021596.1e-121CD36 antigen
Orthology groupMCL14539 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202827-TA
ATGACCATGACCGTGGCCATCAAAAGTCCGTATCGTTTTCGTGAGCATCGTCGGCATATCAACGTGTCCTTCAACAACCAGAACCACTCCGTGTCTTACCGAACTCAACGAAGCTGGTACTTCGACGAGGAGTTCAGCAACGGGACCATGAAGGATAACATAACCATTATAAACGTCATAGCAGCATCTGCGGTCTATCGTTCGAGACACTGGGGTTTTATTCAGCAGAAGGGTCTATCTATGGGTCTCGCTATGCTGGGTCAAGGTTTCTCGGTGACGAGGACCGCTGAAGAACTGTTTTTCGAAGGTTACGAGGATCCATTTTTGGATATAGCGAGGATTCTTCCCTCAACCACAACCGGAGGGGCGCCGCCAGTAGACCGGTTTGGTCTGTTTTACGAGAGAAATAATTCGATGGATACTGAGGGTTTGGTAGAAGTAGCAACAGGAGAGGCCAGTGGAACTTATCCCGGACAGATACTCAGGTGGAACAATTACGAAAGTCTTCCGTTTTATGAAGGGGCGTGCTCCCAGATAACTGGTAGCGCTGGAGAGATGATGGCCCACAATTTGACCGAAGAGCCTTTCACTCTCTTTGTCCCAGATCTGTGTCGGACAGTCCACTTAGAGTATAATAGCAGCGGAAGTTTAGACGGAGTTCTCTACAACAAGTACACCATGACCGAAGCCAGCTTCGATAATTCTTCAAGATCTCCCGACAACGCCTGCTTCTGCAGCGGTGAGTGCAGTTGGAGCGGTACAATGAATGTCTCCGCGTGTCGCTACGGCAGCCCGGCTTTCATGTCCTTGCCTCATTTTCTGTACGGGGATCCTGAACTGAGGTCCTACGTCACAGGCTTATCGCCAGATCCTGAATTGCATTCCTTTTACTTCGCCATAGAACCGAGACTCGGAGTGCCAGTGGATGTGGCGGGAAGATTTCAATTTAATATTTTTATTGAACCGACACCAAATATTGCACTCTATGAAAACGTTCCCCGGATGATGTTCCCGGTGTTTTGGGTGGAGCAAAAGGTTCAAATCTCTCCGGAAGTGCTATCTGAACTGAGATCCGTACGAGCGGTGCTGGAGCGAGGGGGGGCGATCCTAGCGGGAGTGGCGGTGGCTCTCGCAGCCCTCGCACTCGCTCTTCTAACCTGCTGTTCAAAGACAAGCAAGTACACCAGTCCGCAGGAGATCAAAGAAAAGGACGAAGCGGAAGTAAAATTAAACCCTATGTGA

Protein sequence:

>DPOGS202827-PA
MTMTVAIKSPYRFREHRRHINVSFNNQNHSVSYRTQRSWYFDEEFSNGTMKDNITIINVIAASAVYRSRHWGFIQQKGLSMGLAMLGQGFSVTRTAEELFFEGYEDPFLDIARILPSTTTGGAPPVDRFGLFYERNNSMDTEGLVEVATGEASGTYPGQILRWNNYESLPFYEGACSQITGSAGEMMAHNLTEEPFTLFVPDLCRTVHLEYNSSGSLDGVLYNKYTMTEASFDNSSRSPDNACFCSGECSWSGTMNVSACRYGSPAFMSLPHFLYGDPELRSYVTGLSPDPELHSFYFAIEPRLGVPVDVAGRFQFNIFIEPTPNIALYENVPRMMFPVFWVEQKVQISPEVLSELRSVRAVLERGGAILAGVAVALAALALALLTCCSKTSKYTSPQEIKEKDEAEVKLNPM-