Monarch geneset OGS2.0

DPOGS214397
TranscriptDPOGS214397-TA1797 bp
ProteinDPOGS214397-PA598 aa
Genomic positionDPSCF300069 - 274400-287596
RNAseq coverage1248x (Rank: top 10%)
Annotation
HeliconiusHMEL0037331e-7587.84% 
BombyxBGIBMGA011350-TA0.069.56% 
DrosophilaCG1887-PD3e-16754.34% 
EBI UniRef50UniRef50_Q7Q6Q62e-16955.37%AGAP005725-PA n=2 Tax=Anopheles RepID=Q7Q6Q6_ANOGA
NCBI RefSeqXP_396852.23e-17858.54%PREDICTED: similar to CG1887-PA [Apis mellifera]
NCBI nr blastpgi|3504186582e-17859.46%PREDICTED: scavenger receptor class B member 1-like [Bombus impatiens]
NCBI nr blastxgi|3504186584e-17659.81%PREDICTED: scavenger receptor class B member 1-like [Bombus impatiens]
Group
Gene OntologyGO:00160208.8e-189membrane
GO:00071558.8e-189cell adhesion
KEGG pathwaybfo:BRAFLDRAFT_1266702e-56 
 K12384 (SCARB2, LIMP2, CD36L2)maps-> Lysosome
InterPro domain[12-519] IPR0021598.8e-189CD36 antigen
Orthology groupMCL15966 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214397-TA
ATGAGCAAAAAAAGCTATGATCTGTTCCAAGGGCGTCTCGCTGTTATAACTTTCAGCATAGCAACAGTGGTTCTTGGCGTGATACTGTCGTTCGTACCCTGGCTCGACTATATCATATTTAAGGAATTGAAATTATGGAATGGGTCGTTGAGTTACAGCTATTGGCAAAGACCTGGTGTCATCAGGCTGACCAAGGTCTACATCTTCAATGTCACCAACCCTCAAGGGTTCCTGGAGAATGGGGAGAAACCGAAACTCACTGAAATAGGCCCTTTTGTGTACAGGGAGGACATGGAGAAAGTTAATATAAAATTCCACGACAACGACACTGTCACCTACCAGCACAACAAGATACTACGCTTCGTGCCTGAACTGTCGGTGGACAAAAATCTGAAGCTGGTCGTGCCAAACATACCTCTCCTGACAGTAACCTCGTTCTCTCCGAATCTCGCTGGCTGGCTTTTCAATCTCCTAGCCACCGGCCTCTCGATAACTTACAGAGAGCGAGCCAAACCTTTCGTCCACGTGACTGCTGAACAACTGGTCTTTGGTTACGACGATCCGTTAGTCACTCTCGCTCATTACTTCTACCCAAAAGGAAAGAGACCTAACACACAAATGGGGCTCCTGCTTGCTAGGAACGGTACTCTAGAGGAGGTGTCCACGATACATACCGGAGAGGACATGGAAAGTTTTGGATATTTAGATCGCATCAATGGTATGGACCATCTGCCCCATTGGAGCGATAAACCCTGTAATGACATAAGAGCCTCCGAGGGTTCCTTCTTCCCGCCGCGTCTGTCAACCAAAGCGGACACTGTCTATGTTTACGACAAAGATTTGTGCAGAATACTTCCGTTCACCTACAGGAAGGATGTATACATAAATGGTATACAAACTGGTCTCTACACTCCACCAAACTCGACCTTCGAGAGCGCAGACGTGAATCCTGACAACAAATGTTTCTGCCAAGGAGAGAAATGTCCTCCGCGTGGTCTACAAAATATCAGCCCCTGTCAATACAACGCTCCCGTTTATTTGTCCTACCCTCACTTCTACGATGCTGAGCCGTCGTTGCTGGAACGCTTCGAAGGACTTAAGCCAGAGCAAAACAAACACGAGAGCTACTTCTACATACAGCCGAAAATCGGCGTGCCTCTGGAAGGTCAAGTTCGTGTCCAACTCAACCTGAAGGTGGACCGCGCTCCCAACATCATGGTTAACGACATTCACAAATTCCCAGACATTATATTTCCTATTATGTGGGTCCAAGAGGGTATTGAGGGCGTCAGCACGCCGATTTGGCGATGGATATTCCTCGCCACAACCTTCGGCCCGATCGCTGCACCCATCATATCCTACTCTCTGATAGTCTTCGGTCTTGCCATTCTCATACACGCCTTCATAAAGGCATATAAAAACATCGTCATAGGCCAAAATTCCTTAGAAATAGTTGAAATAGGAAGAGAAACTATTAGAAGGAGTTCTACACTCCTAATGAACAGCTCCCAAAAACTTTTGGCACACAAAGAAACGTCATACAGACCTCTGAGCCAGTCCACTCCCTCATCCGCCCACGGCAGTCAGGAGGGTAGGCTGCAGGAGCTCAATTATATCGATAAAACGGACAATATAAGTCAGAGTATCGATGAATTGAAATCGATACTCAATAGGGATTTCGTGAAAACAAATTTCTGTGAAACAGAAAAAGAATCGCTGATACATTCGGATAACGCTTTCATAGTATCCGAACATCGTCGATGTAGAGTTTTTAAAGTGCCTGATAGTTATTATTGA

Protein sequence:

>DPOGS214397-PA
MSKKSYDLFQGRLAVITFSIATVVLGVILSFVPWLDYIIFKELKLWNGSLSYSYWQRPGVIRLTKVYIFNVTNPQGFLENGEKPKLTEIGPFVYREDMEKVNIKFHDNDTVTYQHNKILRFVPELSVDKNLKLVVPNIPLLTVTSFSPNLAGWLFNLLATGLSITYRERAKPFVHVTAEQLVFGYDDPLVTLAHYFYPKGKRPNTQMGLLLARNGTLEEVSTIHTGEDMESFGYLDRINGMDHLPHWSDKPCNDIRASEGSFFPPRLSTKADTVYVYDKDLCRILPFTYRKDVYINGIQTGLYTPPNSTFESADVNPDNKCFCQGEKCPPRGLQNISPCQYNAPVYLSYPHFYDAEPSLLERFEGLKPEQNKHESYFYIQPKIGVPLEGQVRVQLNLKVDRAPNIMVNDIHKFPDIIFPIMWVQEGIEGVSTPIWRWIFLATTFGPIAAPIISYSLIVFGLAILIHAFIKAYKNIVIGQNSLEIVEIGRETIRRSSTLLMNSSQKLLAHKETSYRPLSQSTPSSAHGSQEGRLQELNYIDKTDNISQSIDELKSILNRDFVKTNFCETEKESLIHSDNAFIVSEHRRCRVFKVPDSYY-