Monarch geneset OGS2.0

DPOGS206090
TranscriptDPOGS206090-TA2871 bp
ProteinDPOGS206090-PA956 aa
Genomic positionDPSCF300028 + 11623-27821
RNAseq coverage220x (Rank: top 45%)
Annotation
HeliconiusHMEL0063411e-12975.46% 
BombyxBGIBMGA011213-TA2e-9352.47% 
DrosophilaCG40006-PA8e-10152.66% 
EBI UniRef50UniRef50_E2B9C51e-10152.33%Scavenger receptor class B member 1 n=4 Tax=Endopterygota RepID=E2B9C5_HARSA
NCBI RefSeqXP_968534.12e-11058.79%PREDICTED: similar to scavenger receptor class B (AGAP002738-PA) [Tribolium castaneum]
NCBI nr blastpgi|910853013e-10958.79%PREDICTED: similar to scavenger receptor class B (AGAP002738-PA) [Tribolium castaneum]
NCBI nr blastxgi|910853018e-10958.20%PREDICTED: similar to scavenger receptor class B (AGAP002738-PA) [Tribolium castaneum]
Group
Gene OntologyGO:00160202.3e-163membrane
GO:00071552.3e-163cell adhesion
GO:00081521.2e-43metabolic process
GO:00038241.2e-43catalytic activity
KEGG pathwaytad:TRIADDRAFT_562023e-42 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[174-822] IPR0021592.3e-163CD36 antigen
[2-167] IPR0008731.2e-43AMP-dependent synthetase/ligase
Orthology groupMCL15184 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206090-TA
ATGACGACAATAGTGCCACCGCTCGCTGTGTTCCTTGCCAAACATCCGTTGGTCTCCAAGTATGACCTGAGCTCATTGAACGAAATGTGGTGCGGAGCCGCCCCCCTGTCCAAGGAAATACAGACGCTTGTCACTAAACGAACTGGTATTGATTTCATCAAGCAAGGTTACGGACTGACAGAAGTCACAATGGCCTGTTGTGTGGATTTAGTTGGCAGAAGCAAAGCAGGCTCCTGCGGTACACCTGCGCCTGGCATGAAGATCAAGGTTATAGATACTGAGAGTGGTAAGAAATTAGGTCCCAATGAAGAGGGTGAGCTGTGCATTAAGTCGCCTCTCCGCATGAAGGGATATTTGGGTGATAAAGCATCCGGTGATGCCATGATTGATGAGGAAGGTTATGTTAAGACGGGAGATATTGGGTACTATGACAAGGAAGGATACTTCTACATTGTTGATAGACTCAAAGAACTCATCAAATATAAAGGTTTCCAGATTCCATACTTTTTCCCTCAAGGTCTCTTCATCTTATTCGTCGTCGGCTTCTTGTCTCTCATCACCGGTCTCCTGATATGTTTCCTTCATCCATACGACGCGATCTTCCGATGGAAAGTCGTTATGTCAGACGGCGGTGAGATTTTCGAAATGTGGCGGAAACCGCAGGTGGAGCTGTACGCCAAAATATACTTATTCAACATAACAAACTCACACGAATACATGTCCGGTGTGGATACCAAGTTGAAAGTTCAAGAGGTCGGCCCATATGTATATAGGGAGATCTTCGAGCACGCTGATGTCGTGTTCAATGACAACGGCACCCTGAGCACTATCCCCCGACACCCCCTGGTGTGGCAGCCGGAGCTGTCCGAGGGGAACAGGGAGGACGATGTGCTGTATCTACCGCATATAGCTCTGCTGTCAATTGGGGATGTGGTATCTGAGCAAAGTTATTGGTCCCAATTGGGTTTGAATCAGCTGATCAGCGTCACTAATAGCCAGCCGATAGCGAAAATGACTGCTAAGGAGTTCATGATGGGATACGAGTCACAGCTCATGACGCTGGGGAACACGTTCCTACCGGGATGGATCTATTTTGATAAACTCGGTCTCATTGATAGGATGTACGATTTCAATGGTGACTACGAGACTATATTTACCGGTGAAAACGATGAAACTCTCAGCGGTTTGATAGATACCTATCGGGGATCGACTGATTTGCCGCATTGGGACGGTAAACACTGTTCCAATATACAGTACGCGTCCGATGGCACTAAATTTAGGGGTTCATTGACCTTGAACGACTCGAGCTTATTCTACAGGAAGAGTCTGTGCCGAGCCGCACCTCTGGTCCCAGTTGAAGAAGGTATCAAAAACGGCTTCAGAGCATACAAATACACCTTCCCAGAACATATGCTGGATAACGGGAAAGTTCTGGAAGAAAACAAATGCTTCTGTAGATTAGGTAANGAGTCAATTGGGGATGTGGTATCTGAGCAAAGTTATTGGTCCCAATTGGGTTTGAATCAGCTGATCAGCGTCACCAACAGCCAGCCGATAGCGAAAATGACTGCTAAAGAGTTCATGATGGGATACGAGTCACAGCTCATGACCCTGGGGAACACGTTCCTACCGGGGTGGATCTATTTTGATAAACTCGGTCTCATTGACAGGATGTACGATTTCAATGGTGACTACGAGACTATATTTACCGGTGAAAACGATGAAACTCTCAGCGGTTTGATAGATACCTATCGGGGATCGACTGATTTGCCGCATTGGGACGGTAAACACTGTTCCAATATACAGTACGCGTCCGATGGCACTAAATTTAGGGGTTCATTGACCTTGAACGACTCGAGCTTATTCTACAGGAAGAGTCTGTGCCGAGCCGCACCTCTGGTCCCAGTTGAAGAAGGTATCAAAAACGGCTTCAGAGCATACAAATACACCTTCCCAGAACATATGCTGGATAACGGGAAAGTTCTGGAAGAAAACAAATGCTTCTGTAGATTAGGTAAATGTCTCCCGGAAGGTCTGATAGATGTAACCGATTGCTATTACGGCTTCCCAATAGCGCTGTCCTACCCTCACTTCTACAAGGGAGAGGAAGTCCTGTTCAGCAAAGTGGAAGGTCTCCAACCAGACGAAGAGAAACATAAGACTGAGTTTTGGATCCAACCAGATTCTGGTCTCCCATTGGACATCAGTTCCAAATTCCAGATCAACATGGCGCTTGGAGATCTATCAATGATAACGAACGCTGGAAAATTCTCAAACATGTACCTGCCGATGCTGTGGTTTGATATCAGAATGTACACACTGCCGGCGTCCATGGAACAGAAGTTCAAAATATATTTAAATATTCTACCGTTCATAGAGAAATCATTGATGTATTTGAGCTTTATATCCGGTTCTGCACTCATAATGGCGACGTCGTTCATGATCTACAAACTGTTACATAAGACGTATAACGGCGGAAAAAAAGTTAAATTCAACTGGATGAATGCTAATAGTAAGGATATTTATTCCCCGTGTGAGATACCGATGGGCGATGGTTCAGATGATACAGCGTGCAGAACCCACGGCGACAGGTTCAAACAGCTGGGTCACAGGTTGAGTGACAGAGTCCAAGGGTCAGTCAACAACGTGATAGACAGCTTCCAGAAGAGAAAGGATTCGTTGATAGACAATGACGAGTCGAGCGACAGTAACGCATACAAACGCGACTCGACGTACATCGGGGATATCACCAAGTACCATGAGATCAGGCAGACTGATAGTGACGATGAATATTACAAGTATCTAGAGGTCGTCGATGATGAATTCGGTGATGGTGATGATAAGAAAGACACGTACATACATATCAACTGA

Protein sequence:

>DPOGS206090-PA
MTTIVPPLAVFLAKHPLVSKYDLSSLNEMWCGAAPLSKEIQTLVTKRTGIDFIKQGYGLTEVTMACCVDLVGRSKAGSCGTPAPGMKIKVIDTESGKKLGPNEEGELCIKSPLRMKGYLGDKASGDAMIDEEGYVKTGDIGYYDKEGYFYIVDRLKELIKYKGFQIPYFFPQGLFILFVVGFLSLITGLLICFLHPYDAIFRWKVVMSDGGEIFEMWRKPQVELYAKIYLFNITNSHEYMSGVDTKLKVQEVGPYVYREIFEHADVVFNDNGTLSTIPRHPLVWQPELSEGNREDDVLYLPHIALLSIGDVVSEQSYWSQLGLNQLISVTNSQPIAKMTAKEFMMGYESQLMTLGNTFLPGWIYFDKLGLIDRMYDFNGDYETIFTGENDETLSGLIDTYRGSTDLPHWDGKHCSNIQYASDGTKFRGSLTLNDSSLFYRKSLCRAAPLVPVEEGIKNGFRAYKYTFPEHMLDNGKVLEENKCFCRLGXESIGDVVSEQSYWSQLGLNQLISVTNSQPIAKMTAKEFMMGYESQLMTLGNTFLPGWIYFDKLGLIDRMYDFNGDYETIFTGENDETLSGLIDTYRGSTDLPHWDGKHCSNIQYASDGTKFRGSLTLNDSSLFYRKSLCRAAPLVPVEEGIKNGFRAYKYTFPEHMLDNGKVLEENKCFCRLGKCLPEGLIDVTDCYYGFPIALSYPHFYKGEEVLFSKVEGLQPDEEKHKTEFWIQPDSGLPLDISSKFQINMALGDLSMITNAGKFSNMYLPMLWFDIRMYTLPASMEQKFKIYLNILPFIEKSLMYLSFISGSALIMATSFMIYKLLHKTYNGGKKVKFNWMNANSKDIYSPCEIPMGDGSDDTACRTHGDRFKQLGHRLSDRVQGSVNNVIDSFQKRKDSLIDNDESSDSNAYKRDSTYIGDITKYHEIRQTDSDDEYYKYLEVVDDEFGDGDDKKDTYIHIN-