Monarch geneset OGS2.0

DPOGS203685
TranscriptDPOGS203685-TA1332 bp
ProteinDPOGS203685-PA443 aa
Genomic positionDPSCF300010 - 2085424-2087423
RNAseq coverage32x (Rank: top 75%)
Annotation
HeliconiusHMEL0133212e-15863.66% 
BombyxBGIBMGA009172-TA6e-2742.95% 
DrosophilaLpR1-PK2e-2341.21% 
EBI UniRef50UniRef50_E2A7C93e-5549.30%Low-density lipoprotein receptor-related protein 2 n=3 Tax=Formicidae RepID=E2A7C9_CAMFO
NCBI RefSeqXP_002415782.12e-4646.80%low-density lipoprotein receptor, putative [Ixodes scapularis]
NCBI nr blastpgi|3838473784e-5750.00%PREDICTED: uncharacterized protein LOC100882865 [Megachile rotundata]
NCBI nr blastxgi|3838473787e-6450.00%PREDICTED: uncharacterized protein LOC100882865 [Megachile rotundata]
Group
Gene OntologyGO:00055154.2e-14protein binding
KEGG pathwayaga:AgaP_AGAP0123722e-25 
 K04550 (LRP1, CD91)maps-> Malaria
    Alzheimer's disease
InterPro domain[407-444] IPR0021724.2e-14Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL34511 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203685-TA
ATGAACACAGCAACGAAATTAGAGCCATCGAAAAATACAATTGAGAGTTGTCCTGACAATATTTGCAGCCCCAACAACTCGTGTAGAAGAAAAACTTCTATTACTGATGTTAACTTTAGTAATGTTCAAGTAGTCCAAAATATACCGACAGTATCACAACATGTCGATGAGATTATCCGCGGTAAGTATAGAAGTGCAGAGTATGTCCAGCCAATCAATGTTTCCCAAAGAAGACTGAGAAAAAGATTTAAAGGCATTGGCTGTGTAGTAAATTTTTCATTACATGACAGCCCAAAGAAGAAACGAACAAAAATTCAAGCATGCTCAATAGGATTAATGATTGTAGCGATAGTACTCATTAGTTTAATATTAGTCAATTTTACAACTCCTAGTTTTATACACGCAACTAATGAAACAAGTGCAAGTTTTGTGCCAATAGAAGACATATATTTTAATGAAACTACTTCAGCCGTTGTTAGTATATATCCGGAAGCATTGTTTCCTAGTGTAAATAAAAGAAACATTACATATGAAACAACAACCGAACATTCTATAGATTATCGTAATAAAAGCAACGATCTTTTAAATATCATATCAAAAATAAGAAAAAACATAAAGACTTACCCCAGGGTGGGGAAAAAGACAGAGGAGGCAAAAGAAATTGTAAACAGAGATCTGTCAGCTGATTTTTGCTCGTGCCAAACAAATGAAGTGTGTATGTTAGATGAGAAAAGTGGAACGTCTATATGCCAAGTTGCCGTAGATTTAGAGGATCCTACTGGTTGCGGAGGACTCTGCGCACTGGAAACAGAAGCTTGTCAGCTTGTGGATAAGTCAAGGGGTGTGCGTATTTGTAAATTGTTAACCCAAATCAAGTGCTCGCCTCAGGATTGGCGATGCCGAGACGGATTCTGTGTCCCATCTGCAGCAAGATGTGATGGTTTCATACAATGCTACGATCGCTCCGATGAAATGCACTGCGAATGTGATTTAAAAAAGCAATTCCGATGTGGCAATTCAATATCTTGTTTTCCCAACAAAAAGAAATGCGACGGTTTTATTGATTGTTGGGATGGATACGATGAAGTTAATTGCACTTTAGAATGTCCAGAAGATCAATTTACTTGCAATGATGGTCAATGTATAATTTCTTCAAGATTTTGCGACGGTCTAGCCGACTGCGCGGATGGGAGCGATGAGCCGCAGGGATGCGGAGGGGCTTGTGGTACACATGAGGTGCAATGCCGAAACCATCGTTGTGTGCCACGCAGTGCTGTGTGCGACGGACAAAATGACTGTGGCGATAATTCGGACGAATCACACTGCTCTTGA

Protein sequence:

>DPOGS203685-PA
MNTATKLEPSKNTIESCPDNICSPNNSCRRKTSITDVNFSNVQVVQNIPTVSQHVDEIIRGKYRSAEYVQPINVSQRRLRKRFKGIGCVVNFSLHDSPKKKRTKIQACSIGLMIVAIVLISLILVNFTTPSFIHATNETSASFVPIEDIYFNETTSAVVSIYPEALFPSVNKRNITYETTTEHSIDYRNKSNDLLNIISKIRKNIKTYPRVGKKTEEAKEIVNRDLSADFCSCQTNEVCMLDEKSGTSICQVAVDLEDPTGCGGLCALETEACQLVDKSRGVRICKLLTQIKCSPQDWRCRDGFCVPSAARCDGFIQCYDRSDEMHCECDLKKQFRCGNSISCFPNKKKCDGFIDCWDGYDEVNCTLECPEDQFTCNDGQCIISSRFCDGLADCADGSDEPQGCGGACGTHEVQCRNHRCVPRSAVCDGQNDCGDNSDESHCS-