Monarch geneset OGS2.0

DPOGS204437
TranscriptDPOGS204437-TA1155 bp
ProteinDPOGS204437-PA384 aa
Genomic positionDPSCF300002 - 174999-180200
RNAseq coverage438x (Rank: top 28%)
Annotation
HeliconiusHMEL0040304e-17371.32% 
BombyxBGIBMGA013572-TA4e-12555.10% 
DrosophilaCG10345-PA2e-3725.97% 
EBI UniRef50UniRef50_D6W9748e-4632.81%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W974_TRICA
NCBI RefSeqNP_001164151.12e-4632.81%scavenger receptor class B, member 1-like [Tribolium castaneum]
NCBI nr blastpgi|3800290813e-4933.43%PREDICTED: scavenger receptor class B member 1-like [Apis florea]
NCBI nr blastxgi|3407222422e-4931.84%PREDICTED: scavenger receptor class B member 1-like [Bombus terrestris]
Group
Gene OntologyGO:00160202e-64membrane
GO:00071552e-64cell adhesion
KEGG pathwaydre:3872606e-29 
 K13885 (SCARB1)maps-> Phagosome
InterPro domain[3-366] IPR0021592e-64CD36 antigen
Orthology groupMCL26090 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204437-TA
ATGGGGTATTTCGCGAACGCGGGGGCCTATTACTCCATTTCGACGCTGGGGCCCAAGCTGTTTATTAACATGACAGCTGAGGGTTTACTGTGGGGTTACGACGATCCCCTGGTCAACATCGCCAATAAATTTCTGCCTGGCTGGATTGATTTCGGAAAAATTGGCATTATGGACAGATTCTACGCTCAAAGAAGAGAAGAAGTTGAAATAGAACTAAGGAATAAATCAAGAAAATTTGCGATTAACTCCTGGAACAAAAGCCCCGGACTTGTTGAACAGGGGTTCACTGATTGGAATTCGAGCTATCCGTGCAATCGTTTGAAGGACACCTACGAAGGTCTGTTGTTACCTCCCGGGTTACGAAAGGACTACGAGATCCCGCTGTTCAGGAAGCAGGCTTGTCGAGTCTACCCCTACAGGTTCTCCGAGGAAATCACAAGCGATCATGGCTTCAACTTCTACAGATATATAATGAGCGAGCCTTCCTTCAACCAGTCCTCAAACTACGCCTGTCCATGCTCTCAGAACTGTCTGCCGGATGGCTTTGTGGATATTAGCAGCTGTTATTATGGGTTTCCTATAGCGTTGTCTAAGCCACATTTCCTGGACGCAGATCCCGAACAGCTTTCTTTCTTCCGCGGTTTCAACCCAGACCCCATAAAACACAGGTCCACCCTCGACTTGGAGCCGGTGTTAGGTGTTCCAGTAGCTGTGGAGTCCAACATCCAAGTGAACATCGCGGTGCGAATGTCGTCAGGGAACCCCATCACCAGACCCTTAAAAGATAAGGTCATGCCGCTGATTTGGATGTCTATATATTGTAAAAACCCACCGTCGGATATTATAACTCTGTTACATCTGCGCCTGGTGCTAGCACCTCCACTTGTGATCGCCCTGGAGGTGGTGCTGCTTATCCTAGGGATGTTTTTGGGCATTCAAGCCTTCCACAGAATTTGGAAACCGAAGTACAAACTGGTCCAGAAACCAAAGGAAAAGGTCAGGAGAAAGAGCAGCGAGCGACGAAAAAGCAGCGTGATTTTGAACATGGAGAACACTGGATTCGTTGACGAACAGGAACTGGCAAAGGAAGCCGTTTCTTTACTAGCGATCACGGAGGAAGATAACGACGTACCAGACTTATTGTTGAACGAATGA

Protein sequence:

>DPOGS204437-PA
MGYFANAGAYYSISTLGPKLFINMTAEGLLWGYDDPLVNIANKFLPGWIDFGKIGIMDRFYAQRREEVEIELRNKSRKFAINSWNKSPGLVEQGFTDWNSSYPCNRLKDTYEGLLLPPGLRKDYEIPLFRKQACRVYPYRFSEEITSDHGFNFYRYIMSEPSFNQSSNYACPCSQNCLPDGFVDISSCYYGFPIALSKPHFLDADPEQLSFFRGFNPDPIKHRSTLDLEPVLGVPVAVESNIQVNIAVRMSSGNPITRPLKDKVMPLIWMSIYCKNPPSDIITLLHLRLVLAPPLVIALEVVLLILGMFLGIQAFHRIWKPKYKLVQKPKEKVRRKSSERRKSSVILNMENTGFVDEQELAKEAVSLLAITEEDNDVPDLLLNE-