Monarch geneset OGS2.0

DPOGS202126
TranscriptDPOGS202126-TA3219 bp
ProteinDPOGS202126-PA1072 aa
Genomic positionDPSCF300150 + 522525-576251
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0107510.089.68% 
BombyxBGIBMGA002771-TA0.083.10% 
DrosophilaCG32432-PA0.043.63% 
EBI UniRef50UniRef50_B1B5580.083.72%Low density lipoprotein receptor-related protein-like protein n=2 Tax=Obtectomera RepID=B1B558_BOMMO
NCBI RefSeqNP_001116809.10.083.72%low density lipoprotein receptor-related protein-like protein [Bombyx mori]
NCBI nr blastpgi|1825091960.083.72%low density lipoprotein receptor-related protein-like protein precursor [Bombyx mori]
NCBI nr blastxgi|1825091960.083.72%low density lipoprotein receptor-related protein-like protein precursor [Bombyx mori]
Group
Gene OntologyGO:00055151.2e-11protein binding
KEGG pathway 
InterPro domain[334-424] IPR0008593.1e-13CUB
[46-84] IPR0021721.2e-11Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL10279 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202126-TA
ATGTTTATCGCTGTAATACTTACTCTGATCAGATCCACCGACTGTGATATTGTTGACGGCAGAAACGTTTACCAAAAAGATGGTACATTTTTACAAGAGCCGGAATCACTGCCAGTGCTACATTTAAATGATACCAAATGCAGAATATCAGAGTATCAGTGTTTTAATAAACGTTGTATCCCGATAAACAGATTTTGTGATGGAGGCAACGACTGCGGTGACGCCTCGGATGAACCCCGTCACTGCACACGTTGCAACAAAACATACTATGGAGATATCGGTAGAACATATGAACTGGAATTGCATCGTCCTAGAGAAGATCTGGTACCATATGTCTGCTTACTGACCATCACTGCTGCTGGAGGTGTGCACGGAGATCTTGTTCAGATATTATTCGATAGTTTCACGTTGGGTCGTTTCACATCGTTCGTTGAAGACGGCTGCCCTGATGGCTACATGCAAATTCAAGAGGCAAGCCGTCCCCAAGTAGGCGGTTCGTGGTGTGGAACTTCCTGGGGCCCCTCAATTTATTACAGCGAGACGAAATCTATTACGCTTATTATTCGCCTATTACATTTGTCTAAGGAACAGAATAATTATAACTTTGATTTCCGTATGGCATATAAAGTTTTGAGGAAAGAATATGCCACTGTGAGATACGGTGGAACGCCACCTTCAGAGAGACCTTTTTTCAAACTTAGGCCAATGTTATACCCTGTAAACATATCACACGGAGAAGATACGAACATTGGAAGGACGGCAAAAACTCCAGAGTATTACCTCGGGGATTTGATCTCGGGGACTTATTGCTCTCGGATCTACAGTGACTGCGATCGTAAAAACTGTAGATTACAATCTCCAAACTTCCCCGGCGTTTATCCTAGAAATCTCACTTGTTACTATGCGGTTAGACAGCACGAAGTGCCTCAAGGAAAACATGCTTTAATAGTTGTGAAACAACCGAATGGCCAGTTAATTTCCATAAGAAGTCAAAAGGCTTTGTACGCTCCATCTCAAAATGATAGGGAGACTAACGGTAGAGAATTAAAATTTTGGCAACAGTGCGATGAAGTACAGGACTATGTGACAGTTTATGATGGCTACACGACAAGAGATCCCGTTATACTGAGGTTTTGCGGTGGTGGAGTTTCAGTTCCTGAAGCGATATCAAGTGGTCCGGAATTGTTAGTCGAGTTTACTACTTCACCTTTTGGTACATTTCTCCAACCGGCTACATCACAGTCTCTACATGGATTCCAATTAGAAGTCGAGGTGAGATTTGTTGACCAACAATCTCCAACGTACGCAAAAAATAAAAGAGCTTGTGAATTTTGGATTCGAGGAACTGGTAGGGGAGTTTTGGAGCATCCACAACATTCACTTCCTCCAAACACAACATGCCTTTATCACATGCAAGGAATCGATACTTCCGGTCCATCAGGAAGAAATATTATTTACAGACGGCAATTTTCTGCACCGCGTTTTAGGGTTTGGTTCTCTGTATTAAAATTTTATGTAACAAACGCTATCAATCCCAATTTGCCTGAAGATGAGTATTGTGGGAGCCATTTGAATATATGGGATGGGCCGATGAGGATATCTCCCGGTTGCAGTGATATATTTTGTGATAAAGAACGATCGGTTCAAATGTCAAGAGCCACTTCCGCCCAGTCCGTGATAGTTGGACGGCCACCGACAAATGCAACCCTCATAGCTAGGTTCTGCCGCGAACGGGCTCCAAGGACTTGTGAACATGCGTTATTAAGACGATCTAGAGCATGTGCTAGAACAGAAAGTTTCTTGTCTAAAGGGGACTCTTTAACGTTGGAATTGAAGTTAACACAAGGAACAGCACTGAAGCCAGTATTCTTTAAAGCTCTATACGAGTTTGTAGATTTACATCAAGATGGAGAGGCTTGGGGACGCGGGCCGTGTTCCAGACGTTTTGCGTCAAGGTCATTCCCTGAAGCGCCACCAGACCCGCCGGTTACCTTCTCCTCGCCCAGAGATGTCTTTTTGTATGGTCGTGGCGGTGCTAAAAACATCAGTTGCACTTATCGCTTCGAAGCCAATCCGGGTGAGGTAGTACGTCTCCGAGTGTGGGGCGTCAGGAATGGTGGGAGGTTATGCAGGTCCGTGCACTCGGCTCACTCACCATGGTATCGATGCGGAGGAAATCCTACAGCTGCTGTGAGGATCTTCGAGAGACCATGGAAAGATAATGGAGCTGGTGTTCCAAGGGACTGTTTGTGCAGCGATGTTCCTGACTATGAATTCACATCCACCTCAGCAGTGGCTGAACTAAAGTTTGACGTGGTCAACATGTCCGCTGTTGATGACTTCAGATCCTTCGGTTTTGAGGCATCTGTTGAGTTCGTAAGGCTGAGCAGTGTTTGTGATTCCACCCACCGAGTGTTTGGAGCTAGTGGGGAATTGAGGCTTTCATACTATCCTCCAATGTCTGAAGATGATCGTTGTGACTTTCGTCCCTGGTTGATTGATCCAGCACCAGGCAGGTACCTTTATCTCCAAATCCAAGGCAGCTTGATTAAGGAACCGGAACTCGATAACTTCACGTTACAAAATAATATAACAGTTGCTCCGGATTTGAGGCATTTATGCCAAACACAGAATCGTATTGCAGTCTACGCTGGAGGACTTTCTCCGATATTCATATGTCCTAGGCCACCAGTAGAACAGGAAGACGACGTAGTGGAAGTATTCTCCGAAGGTTGGACTTCGGAAGTGGATGGTTTGTCCAGACACGTGCTGGCGGCGACGCCCTCCCTCGACCGGTTCGATTTGGAATGGCCACGATCGCTCTCTGTTGAACTGATAGCCGTGGAGACTGGCTCCTATTACATTACGTGGTTGGAACTATCGAGGAGACACCCAAGTCCTCGCGGCGGTGTCTTCGCTCTCTCCGGCGAGTGTACTCATAAATGTTCGTCTTTGGACGCGTGCATCGAACCAGCTCTTTGGTGTGACGGAGTGCCGCAGTGTCCGCACGGGGAGGACGAGCGTCTGTCGCAATGCTCTGCTCTCATGAGACTGCCCGCGCATTACGCAGCCGCTCTGCTCGCGCTCACAGTGCTTGCGGTATTTGCTGTGATAGCGGTGATGCGTTCGTGTCGCAGACGGCGGTCAGCTTTCCAACAACGTCTCAAGAGTTTATCATCTGATACAGCCATATTCGACGAGAAAGAAGTGATTTGCTGA

Protein sequence:

>DPOGS202126-PA
MFIAVILTLIRSTDCDIVDGRNVYQKDGTFLQEPESLPVLHLNDTKCRISEYQCFNKRCIPINRFCDGGNDCGDASDEPRHCTRCNKTYYGDIGRTYELELHRPREDLVPYVCLLTITAAGGVHGDLVQILFDSFTLGRFTSFVEDGCPDGYMQIQEASRPQVGGSWCGTSWGPSIYYSETKSITLIIRLLHLSKEQNNYNFDFRMAYKVLRKEYATVRYGGTPPSERPFFKLRPMLYPVNISHGEDTNIGRTAKTPEYYLGDLISGTYCSRIYSDCDRKNCRLQSPNFPGVYPRNLTCYYAVRQHEVPQGKHALIVVKQPNGQLISIRSQKALYAPSQNDRETNGRELKFWQQCDEVQDYVTVYDGYTTRDPVILRFCGGGVSVPEAISSGPELLVEFTTSPFGTFLQPATSQSLHGFQLEVEVRFVDQQSPTYAKNKRACEFWIRGTGRGVLEHPQHSLPPNTTCLYHMQGIDTSGPSGRNIIYRRQFSAPRFRVWFSVLKFYVTNAINPNLPEDEYCGSHLNIWDGPMRISPGCSDIFCDKERSVQMSRATSAQSVIVGRPPTNATLIARFCRERAPRTCEHALLRRSRACARTESFLSKGDSLTLELKLTQGTALKPVFFKALYEFVDLHQDGEAWGRGPCSRRFASRSFPEAPPDPPVTFSSPRDVFLYGRGGAKNISCTYRFEANPGEVVRLRVWGVRNGGRLCRSVHSAHSPWYRCGGNPTAAVRIFERPWKDNGAGVPRDCLCSDVPDYEFTSTSAVAELKFDVVNMSAVDDFRSFGFEASVEFVRLSSVCDSTHRVFGASGELRLSYYPPMSEDDRCDFRPWLIDPAPGRYLYLQIQGSLIKEPELDNFTLQNNITVAPDLRHLCQTQNRIAVYAGGLSPIFICPRPPVEQEDDVVEVFSEGWTSEVDGLSRHVLAATPSLDRFDLEWPRSLSVELIAVETGSYYITWLELSRRHPSPRGGVFALSGECTHKCSSLDACIEPALWCDGVPQCPHGEDERLSQCSALMRLPAHYAAALLALTVLAVFAVIAVMRSCRRRRSAFQQRLKSLSSDTAIFDEKEVIC-