Monarch geneset OGS2.0

DPOGS201942
TranscriptDPOGS201942-TA1692 bp
ProteinDPOGS201942-PA563 aa
Genomic positionDPSCF300224 + 112826-126918
RNAseq coverage830x (Rank: top 15%)
Annotation
HeliconiusHMEL0023120.073.96% 
BombyxBGIBMGA011852-TA5e-4042.65% 
Drosophilacue-PA1e-8533.44% 
EBI UniRef50UniRef50_D6WHF96e-9440.37%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WHF9_TRICA
NCBI RefSeqXP_975303.12e-9639.54%PREDICTED: similar to cueball CG12086-PA [Tribolium castaneum]
NCBI nr blastpgi|910790964e-9539.54%PREDICTED: similar to cueball CG12086-PA [Tribolium castaneum]
NCBI nr blastxgi|910790962e-10339.54%PREDICTED: similar to cueball CG12086-PA [Tribolium castaneum]
Group
KEGG pathwaydre:5657975e-31 
 K04550 (LRP1, CD91)maps-> Malaria
    Alzheimer's disease
InterPro domain[47-294] IPR0110423.8e-49Six-bladed beta-propeller, TolB-like
[179-224] IPR0000331.1e-09LDLR class B repeat
Orthology groupMCL14946 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201942-TA
ATGTGCAAAATTAAGTGTTTGATCGCATTCCTCATGCTATCTATCAAGTTAGTTTATTCATGGGATATCGCCATAACAACAGGGGACCAACTGGAATTTTACACTAATCAAACTAAAACTAACAATGAAGGCATAAGATTCAGAGATCTGACGGCATTGGCCTATGACGCCGTACATAATATGCTGTTGTTCGTCGACAAACAGAATGACAACGCTTCGATATTTAGTTACAATCTAATCACGAAGAAATATCTGCCGTTAGTCGGCAGAAGATCCTACGAGAACATACAAGGCCTAACATTTGATCCCGTCAGCGGAAAAATGTTCTGGACCGACACCAATGAGAGAAGTATTTACTGGGTTTCTCTGTGGCCGAGGTCAAAGGACAGCATCTATGGAAATCTTTTAATGAAGATGAACGACGAAATCCCAAGAGACATTGCTGTTGATAGTTGTAGAGGCTATATCTATTGGACGAACACCAATATTACTAAAGCAACAATAGAAAGAGCTCGCTTGGATGGCAGTGAGAGGCGTGTTATTGTTTCCACTGATATCCATATGCCGGTAAGCATTGCAGTCGACCAGCGAACCAAGAGACTGTACTGGGCTGATGATAAAGAAGGAATCCATTACTCCATAGAATCGTCTGATTTAGATGGCAAGGACAGGAGTACAATATACGCCGGAACTTACCACCAGCCGAACACGCTAACTGTCTCAAAGAATGATGTTTACTGGGTGGATTGGGGATACAAATCAGTATGGAGGATATCGAAAAATGCAACTAATACGGAACCCGAGGAACTGATAAAGTTCTCCACCGAAGTCCCTTTTGGGATTGTAGCTAACTATCAAGTAGCTGATCAAACAGAAGGAGTATCGGGATGCGAAGTTTTGGTTAAATTGCAGAAAAATCACAGCTCAGTGAACGATTCCATCAACATTCCTAGAGATGCCGGTCTATTTTGTCTTCATGGAGTCAAGACCGGGATATTTGATTGTAAATGTTCCCCGGGATACATCGGAGACAGGTGTGAAATATCTGTTTGTCAGAACTTCTGTTTGAACGGCGACTGCACCAGCAACAGTGAAGGAAAACCTCAATGCAAGTGTCAAGCCGGTTTCACTGGACAAAGGTGTGAACTAAATGTATGCTATGGATATTGTTTGAATGACGGCGAGTGCTCGCTCATACAAAACAAGCCAAGCTGTAAATGTGCTAATAACTTCGAGGGCGTCCGCTGTGAAACTCTCAAACCTGAGCCAATTAAAACAACGGACACACCCTTTGTAGAACCCTGCAACTGTACCCAAGTGAACTCTTCATCGTTACTGATGTTAGGGTGCGTGTCTGGGTGGGATACTGTACGTGATCCAGTGCTGTTAGCGCTGGGAGTGCTGGCTGGACTGCTGGCACTCACCAGCGCTGTACTGGCAGCTAATGTACTGTACCTTCGACGGAGGCCCAGAATAAAGAAGCGTATAATAGTTAACAAGAGCGGCACTCCGCTGACGGCCCGACCAGACGCTTGTGAGATAACCATCGAGGACTGCTGCAATATGAACATTTGTGAAACGCCATGTTTCGAGCCTCGTAACTCCATTCGGCCGACGCTAGTCGACGCTAAACCCGGCAAGGAAGAGAAAAGGAATTTGATATCGAACATGGAGCAGGAAGATATATACTGA

Protein sequence:

>DPOGS201942-PA
MCKIKCLIAFLMLSIKLVYSWDIAITTGDQLEFYTNQTKTNNEGIRFRDLTALAYDAVHNMLLFVDKQNDNASIFSYNLITKKYLPLVGRRSYENIQGLTFDPVSGKMFWTDTNERSIYWVSLWPRSKDSIYGNLLMKMNDEIPRDIAVDSCRGYIYWTNTNITKATIERARLDGSERRVIVSTDIHMPVSIAVDQRTKRLYWADDKEGIHYSIESSDLDGKDRSTIYAGTYHQPNTLTVSKNDVYWVDWGYKSVWRISKNATNTEPEELIKFSTEVPFGIVANYQVADQTEGVSGCEVLVKLQKNHSSVNDSINIPRDAGLFCLHGVKTGIFDCKCSPGYIGDRCEISVCQNFCLNGDCTSNSEGKPQCKCQAGFTGQRCELNVCYGYCLNDGECSLIQNKPSCKCANNFEGVRCETLKPEPIKTTDTPFVEPCNCTQVNSSSLLMLGCVSGWDTVRDPVLLALGVLAGLLALTSAVLAANVLYLRRRPRIKKRIIVNKSGTPLTARPDACEITIEDCCNMNICETPCFEPRNSIRPTLVDAKPGKEEKRNLISNMEQEDIY-