Monarch geneset OGS2.0

DPOGS212996
TranscriptDPOGS212996-TA1782 bp
ProteinDPOGS212996-PA593 aa
Genomic positionDPSCF300024 - 424367-447261
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0210060.074.23% 
BombyxBGIBMGA006908-TA7e-18069.37% 
DrosophilaImpE1-PA2e-6956.85% 
EBI UniRef50UniRef50_F4WNZ75e-10640.90%Low-density lipoprotein receptor-related protein 2 n=5 Tax=Formicidae RepID=F4WNZ7_ACREC
NCBI RefSeqXP_001604900.13e-10841.34%PREDICTED: similar to GA16846-PA [Nasonia vitripennis]
NCBI nr blastpgi|3454874956e-10741.34%PREDICTED: hypothetical protein LOC100121297 [Nasonia vitripennis]
NCBI nr blastxgi|3454874952e-12041.35%PREDICTED: hypothetical protein LOC100121297 [Nasonia vitripennis]
Group
Gene OntologyGO:00055152.5e-10protein binding
KEGG pathwaydpe:Dper_GL112108e-20 
 K04550 (LRP1, CD91)maps-> Malaria
    Alzheimer's disease
InterPro domain[383-421] IPR0021722.5e-10Low-density lipoprotein (LDL) receptor class A repeat
[56-106] IPR0061495.2e-07EB domain
Orthology groupMCL16299 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212996-TA
ATGCACAATTTCAATAATATGCTATACCAATGCTTCATTGTTCTTAATGCTCTGTTGAGCGTCAGAGCGTTCTCCTTGGGATCCAGTTGCCGCTCCAATGTAGAGTGTAGAGTCACACATGCGCACTGCTCTAGGGGCGTTTGCGCGTGTCAGCCGTACTATGCACAGGTTGATAACTCTACTTGTCTCGAATCAACACTTCTGGGATCGGAATGTATAGTATCGGAACAATGTACCCTAAAGGTAGCGTACAGCGCGTGCTTGGATGGCCTCTGTCGCTGTTCCGACGGACATTTACAATTCAGAAAGCATACTTGTCTGTCTCCCGCTCCACCGGGACACGTTTGCTACAGCAACGAACATTGTCGATTATGGGAGAAAGAGAGTCACTGTGAATTTGTAATACCAAATCTCTTTGGAAGATGCGCTTGTAACACGAACTTCAAACAAAACGGAGAAAAGTGTTTGCCTCAAAAGGGAACAGCCTTTCCTACTGAAGCAGTGATTCTTGAACACGGTGCTTCTAAAACTGATATTATAATGGCATCGCCCTCAACTGTTGAATTGTCGAATAAATCATCAAAATCGCCAAGTAGTCCTAGCGCTGAAAGTTTGTTTGGTAATCCTAATATCTCAGACTCGGATGACATTAAGCCAAGTAATTTACCCATGTTCACGAGTAACAATGAAGTTCTAAGTACGCAAGAAGATAGTCCATTGATGCAAGCAGTTATGAAAGTTCCGACAGCGGAACCCAGTAAAAGACATCCAGTTTCAATAAGAAAGGATGAAGAATCAAATGAGCTAATAGGTGTAGACGAAATGACAGCTAATTTACATGCAAACGATCAACCTATATACAGAGTAGTGACACAACCATCTGTAGACAAAAAACCAAATAAGAAAGGGGGAGTCAAGGATCCGAATACTAGTAAAACTAATAAAAGTCAGAAAAAATCTTCATTAAAAGAGAAAAATCGTATCCGAGCTGATATAGTATCAACACCAGATCCAGGGCCAGCATCATTGGGGTTGTTCTGTCTATCGGATAGTCAATGTCAGCTCGTTGATCCGAATACAAGATGCTTGAACAAGAGATGCGATTGCACGTACCGAACTAATTCAACGTCTGCGTGCTCAGCGCGTAATCGAGGATGTCTGCCTGGTACATTCCAATGCCGTTCAACTGGTGCGTGTATCAGCTGGTATTTCGTTTGCGACGGACGGAAAGATTGCCCTGATGGGTCGGACGAAAGGTGTCAAGGTACTGGTAAAGACAACACAGGCGAACGTTGTCCGATGCACTCGTTCCGATGCGGTGGTCCTGGCAGCGCGTGCGTGTCTCGAGGAGCTCGGTGTGACGGCATTCCACAATGCGCTGGTGGCGAAGACGAGAGGAACTGTCGTGCTACTAAACGGAGAGGTTGTCCGCGACATACATTCCGCTGCGGCTCAGGCGAGTGTCTACCCGAGTACGAGTTCTGTAACGCGATCATCTCCTGCAAGGACGGCTCTGATGAACCGTCGCATCTATGTAATGAGCAGTCACGGTGGCGTGCGGCGGACTTTTGTCCATTGAGGTGCGGTAATGGAAGGTGTCGGAGCACTGCTGTGGCTTGCTCCGGAAGAGACGGCTGTGGAGACAACAGTGATGAAATAGCCTGCTCTGTCTGCAGTTGCGTCCGAAAAATATTAAATATTCCAGAGTTCAACCAAGGCATTATGTATTCACAACAACTGTCCAAACATGACCTCAATAATTTCGTAGACCCAAATACTTGA

Protein sequence:

>DPOGS212996-PA
MHNFNNMLYQCFIVLNALLSVRAFSLGSSCRSNVECRVTHAHCSRGVCACQPYYAQVDNSTCLESTLLGSECIVSEQCTLKVAYSACLDGLCRCSDGHLQFRKHTCLSPAPPGHVCYSNEHCRLWEKESHCEFVIPNLFGRCACNTNFKQNGEKCLPQKGTAFPTEAVILEHGASKTDIIMASPSTVELSNKSSKSPSSPSAESLFGNPNISDSDDIKPSNLPMFTSNNEVLSTQEDSPLMQAVMKVPTAEPSKRHPVSIRKDEESNELIGVDEMTANLHANDQPIYRVVTQPSVDKKPNKKGGVKDPNTSKTNKSQKKSSLKEKNRIRADIVSTPDPGPASLGLFCLSDSQCQLVDPNTRCLNKRCDCTYRTNSTSACSARNRGCLPGTFQCRSTGACISWYFVCDGRKDCPDGSDERCQGTGKDNTGERCPMHSFRCGGPGSACVSRGARCDGIPQCAGGEDERNCRATKRRGCPRHTFRCGSGECLPEYEFCNAIISCKDGSDEPSHLCNEQSRWRAADFCPLRCGNGRCRSTAVACSGRDGCGDNSDEIACSVCSCVRKILNIPEFNQGIMYSQQLSKHDLNNFVDPNT-