Monarch geneset OGS2.0

DPOGS210549
TranscriptDPOGS210549-TA1563 bp
ProteinDPOGS210549-PA520 aa
Genomic positionDPSCF300304 + 3026-8174
RNAseq coverage1x (Rank: top 94%)
Annotation
HeliconiusHMEL0040309e-5130.77% 
BombyxBGIBMGA013437-TA1e-8031.30% 
DrosophilaCG10345-PA1e-5828.57% 
EBI UniRef50UniRef50_D6W9742e-6531.87%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W974_TRICA
NCBI RefSeqNP_001164151.14e-6631.87%scavenger receptor class B, member 1-like [Tribolium castaneum]
NCBI nr blastpgi|3071987946e-6633.19%Scavenger receptor class B member 1 [Harpegnathos saltator]
NCBI nr blastxgi|2824035093e-6631.34%scavenger receptor class B, member 1-like [Tribolium castaneum]
Group
Gene OntologyGO:00160201.5e-76membrane
GO:00071551.5e-76cell adhesion
KEGG pathwaybta:2823461e-30 
 K13885 (SCARB1)maps-> Phagosome
InterPro domain[12-521] IPR0021591.5e-76CD36 antigen
Orthology groupMCL16587 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210549-TA
ATGAAACCCAAAAACCCGAATTATAAACACGCCGAGGACAAAAAAAAAAATTATGTTCTGATGGCCCTAGGAATACTGTTTGTGGTACTACCGATTGTAACTCTCTTTGTGGATCCAGTGCTTATGGCAATGAAATATTTAACTCGTATGTCAGTCGGCTCGAAAATATACACAATGATGAAGGAGGAAATTCCAGGAGCGCTTATTAACGTTTATATTTTCAATATAACCAACGGAGAAGCCTTCGTCTCCGGTGAGGATTATAAACTGAAAGTTGAACAAGTTGGACCTTTTGTTTATCAGGAGTTCCGTACAAACGAAGGCTTTGAAATTGATGAGGAAGCGGGTGTAATGCGTTACACCCCCATTGCTGCCGCGCGGTTCATGCCGGAACGATCTATCGCCGACCCACGACACGTCAATATTACTGTTATCAATACTATAATGCTTGCACTAGCATCTATGTTGAGTTCCTATTCTATATTTGGAAAATCTGGCTACAATTTGTTAATAAATCAACTACAGTCGAAACCATTCCTCAACATTGATGTGGACAGCTATTTTTGGGGGTATGATGACCCACTAATTGCTTTAGGCAACACCCTTATGCCAGGATGGATTACCTTCCAAAGATTGGGTATTTTAGATAGGTTATACGACCCAGCGGCCGTCCCGCGACTGGAACTTGGAATCCATGACGAAGATAAGTTTAATATAAGGACCGCGAACGGATGTCCCGGCCTCAAGGTATGGCAATACGAAAATCCTTCAAAACGTTCTCGATGCAATACTTTCACTGACGCCTATGAGGGCTTTGCCTTCCCCCCAGGACTCACTCCTGACCGAGCTCTCAGACTGTACCGTAACGTGTTCTGTCGGATGCTAGAGCTGAGGTTCGTTGACACTAAGCCACTGGACTTCGGTCCTGAAAGTTTCGTATACCAAATCAGAAACGATAGTTTTGCTGTCAACGCGGAAACCAACTGTCTCTGCGGCGAATATGGTTGCGCCGAAGGATTGTCCAGTGCGGCGCCGTGTTTGTTCGGTTTTGATCTTGGATTGTCTTTCGGACATTTTTGGAACGCTTATCCCAAAGTATATGAACGCATTGAGGGTATGCGTCCCGATGAAAAGGAACACGGTAGCGAGTTCCTGATTGATCCGAAAAGCGGTGCGGTTTTAGCAGCGAGATTCACTCTCCAGTTAAACTTAATTGTTAGAGACGTTAGTTACAACAGCCTAACCAAACCATTCAGTGAAATGGTGATACCTATGACCTATTTAAAAATTGTCCAACCACCGTTACCAAACGAAGCAAAAAATGTATTCAGATTTATGTACCAAGTCCTTCCTAACATTATACTCGGTCTACAGATTATAATATTCGTAATAGGTTTTATAATGATCGCGTACACCGTCAGAAGCATTTATTGGCAGGTTATTGTAAGAAAAGGTATTGATCTCTTAAATGCAAGTAATGAGGATAGGGTCCATGTACCACGTTCTGAAACACTTCTAGTAGAAGAGAAACCTTTGGATGAATATCGATTGTACTCGAATTAG

Protein sequence:

>DPOGS210549-PA
MKPKNPNYKHAEDKKKNYVLMALGILFVVLPIVTLFVDPVLMAMKYLTRMSVGSKIYTMMKEEIPGALINVYIFNITNGEAFVSGEDYKLKVEQVGPFVYQEFRTNEGFEIDEEAGVMRYTPIAAARFMPERSIADPRHVNITVINTIMLALASMLSSYSIFGKSGYNLLINQLQSKPFLNIDVDSYFWGYDDPLIALGNTLMPGWITFQRLGILDRLYDPAAVPRLELGIHDEDKFNIRTANGCPGLKVWQYENPSKRSRCNTFTDAYEGFAFPPGLTPDRALRLYRNVFCRMLELRFVDTKPLDFGPESFVYQIRNDSFAVNAETNCLCGEYGCAEGLSSAAPCLFGFDLGLSFGHFWNAYPKVYERIEGMRPDEKEHGSEFLIDPKSGAVLAARFTLQLNLIVRDVSYNSLTKPFSEMVIPMTYLKIVQPPLPNEAKNVFRFMYQVLPNIILGLQIIIFVIGFIMIAYTVRSIYWQVIVRKGIDLLNASNEDRVHVPRSETLLVEEKPLDEYRLYSN-