Monarch geneset OGS2.0

DPOGS210658
TranscriptDPOGS210658-TA2430 bp
ProteinDPOGS210658-PA809 aa
Genomic positionDPSCF300401 + 148841-157241
RNAseq coverage2312x (Rank: top 5%)
Annotation
HeliconiusHMEL0107912e-11255.68% 
BombyxBGIBMGA001644-TA0.073.64% 
Drosophilalarp-PD3e-11751.92% 
EBI UniRef50UniRef50_Q9VAW55e-11551.92%La-related protein n=17 Tax=Bilateria RepID=LARP_DROME
NCBI RefSeqXP_001659688.17e-12442.53%lupus la ribonucleoprotein [Aedes aegypti]
NCBI nr blastpgi|1571206151e-12242.53%lupus la ribonucleoprotein [Aedes aegypti]
NCBI nr blastxgi|1571206155e-12041.08%lupus la ribonucleoprotein [Aedes aegypti]
Group
KEGG pathwaylma:LmjF21.05405e-09 
 K11090 (LA, SSB)maps-> Systemic lupus erythematosus
InterPro domain[48-125] IPR0066302e-28RNA-binding protein Lupus La
[45-116] IPR0119913e-23Winged helix-turn-helix transcription repressor DNA-binding
[567-608] IPR0066071.3e-22Protein of unknown function DM15
Orthology groupMCL11240 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210658-TA
ATGTCCCCGCCCCAGCTCCGCGTGGCTCAGCCGGGCCTGCCGCAGGTGGTGCTCCGGTACGGGCTCGGGGCCCTGCCGCTCGCCCCGCCGATCGCGCAGGCCTTCTACTACGGCAGCGCCCCCTACGTCGGACTGGACCAGGCGACGCTCAAAGACCTCATCAAGAAGCAGATCGAGTACTACTTCAGTCCCGATAACCTGGCCCGGGATTTCTTCCTGCGGCGTAAGATGTCCCCGGACGGCACCATACCCGTCACCCTGATCGCGTCCTTCCACCGCGTGAGGGCGCTCACCGCTGACGTGCAGCTCGTGCTGGACGCCATACGCGACTCGGACAGGCTGCAGCTGATAGGAGGGTTCAAGGTCAGAACGGCGTTCGAGCCTACAAAATGGCCGATACTGGATCTACCCTCCAACTCCGATGACGGCGAGGGGCAGGAGAAGAACGAGAGAGACGAACACACAGACAGACACGACGCGGACGCGGAGAGAATACACGGAGCGCGACACGAGGAGCCGGAGCAGAAGACTGGACACGAGGAGCGGCCAGCGGAGCGGGACCAGGCCGCGCATGCTGACGCTGAGGAGGCGGGGGCCGGCGAGGACGGAGCGGACGCGGCGCTGAAGGACGGAGAACGGACTCTAGGGAACGAGGCGAATCCCGAACCGCCGTCAGAAAAACAGGATGAAGACAAACGAGCGCCGGCGGATGGGAAACAGGAAGCGGATGTGGACAGCCAGGAGAAGGATATCAACAACGACCCGCCGCTGCAACCGGATGTAGAGAAGCCCGGGGAGAAGAGACGTAAAAAGCAAATGGTCGGCATCTTCCCCCTGGCGACCATGGGCGGGCCCCTAGTGGCGCCCGTGTCCAGTCTGCTGAGAGCCGTCCCTCCGCCGCCGCTGCCCAGGATATTCCGCACACGTCCGGGGACCGCGCCCCTCGCCGTGGATCACCTCAACCCTGACGTCCCGGAGTTCGTCCCCCAGGCCAGGAGAAACGAGGAACCGATGGATTCCGCGGAGAAACAGGACAATGGTTCCAGTGCATCATCAGACAGAGGAGATGCTGATGTATGGACGGAGGTGAAGCGCCGTACCAAGGCGGGGTCTCGGGAGCGTGCGCCCCGACCTGACGCGGCGGACGACGAGCCCAAGGAGGAGCTGCACTTCCAGCTGGATGAAGAGCTGGAGCTGCCCCCGCCCAGAAACAACACCTTCACTGACGGCTGGTCGGACGAGGAATCTGATCTCGAGCTGTCGGACCGTGACATCGGCCGGCTGCTTATAGTGACTCAGACCGCGGCCCGCGCGCCCAAACACGACGGACACGACCGACAGGGCGACTGGGCCACCAGGACCAAGATCACGCAGGACCTGGAGCAGCCGAACGAGCAAAAGACAGAGGGCGGCGGTAAGGCAGCCCGCAGACGAACGGCGCGCTTCTACGCGGCTAGCAAGGATCCGCACGCTACGGACGTTATAAGCAGCAGGAAGCACAAGACGCGGCACAGCCTGAACCCGCCGGTGGAGCACCATGTGGGGTGGATCATGGACGTGCGCGAGCACAGGAGACAGCACAGGGACAGCACGGGATCGTCCCTGGGAACGTCACCCACCCTGGGCTCGTCCTGCGGCTCCGTCCCTCAGTCCCTGCCGGCCTTCCATCACCCCAGCCACGGGCTGCTGAGAGAGAACCACTTCACACAGCAGGCCTACCACAAGTATCACTCGAGATGTCTTAAAGAACGCAAGAAGCTGGGCATAGGCCAGTCCCAGGAGATGAATACTCTCTTCAGATTCTGGTCCTTCTTCCTCAGAGATCACTTCAACAGGACTATGTATAATGAGTTTAGGAGCCTAGCTACTGAGGACGCAGCGGCAGGATTCCGCTACGGCCTGGAATGTCTGTTCAGATTCTACTCGTACGGCCTGGAACGCAAGTTCAGACCTGAGCTGTACCAGCACTTCCAGGTGGAGACGCTAGCTGACTATGAGAAAGGTCAGCTGTACGGCCTGGAGAAGTTCTGGGCGTTCTTGAAGTACTACAAGCACGCGGCGGCACTTAACGTGGAGCCTACACTTAAAGCACACCTCGCTAACTTCAATACCGTGGAGGACTTCCGGGTGCTGGAGCCCCAGCTGAACGAGCTGCTAGCGGCGGGCGGGCCGGGGGCTGGCAGGCCCCACGCCCGCGTGTACGAGAGACATCGCTCCGTGTCCGAGAGCGAGAGGACCAAGGCCTGTCGGAACACGTTCAGCAGCCGTCCCGCGGCCGCCACCAACCGCGGTCGTGCTGGTTCCTGCGGCGCCCAGGGTACAGTCGCCGCTGAGAGGAGGCAGCGCTGCAGCCTAGCCGCGGTGGCCGCGAGACTGGAGCCGTGGTCCGGGGCTACGTCGGAGCGGCGCACACCGACACACATACATTAA

Protein sequence:

>DPOGS210658-PA
MSPPQLRVAQPGLPQVVLRYGLGALPLAPPIAQAFYYGSAPYVGLDQATLKDLIKKQIEYYFSPDNLARDFFLRRKMSPDGTIPVTLIASFHRVRALTADVQLVLDAIRDSDRLQLIGGFKVRTAFEPTKWPILDLPSNSDDGEGQEKNERDEHTDRHDADAERIHGARHEEPEQKTGHEERPAERDQAAHADAEEAGAGEDGADAALKDGERTLGNEANPEPPSEKQDEDKRAPADGKQEADVDSQEKDINNDPPLQPDVEKPGEKRRKKQMVGIFPLATMGGPLVAPVSSLLRAVPPPPLPRIFRTRPGTAPLAVDHLNPDVPEFVPQARRNEEPMDSAEKQDNGSSASSDRGDADVWTEVKRRTKAGSRERAPRPDAADDEPKEELHFQLDEELELPPPRNNTFTDGWSDEESDLELSDRDIGRLLIVTQTAARAPKHDGHDRQGDWATRTKITQDLEQPNEQKTEGGGKAARRRTARFYAASKDPHATDVISSRKHKTRHSLNPPVEHHVGWIMDVREHRRQHRDSTGSSLGTSPTLGSSCGSVPQSLPAFHHPSHGLLRENHFTQQAYHKYHSRCLKERKKLGIGQSQEMNTLFRFWSFFLRDHFNRTMYNEFRSLATEDAAAGFRYGLECLFRFYSYGLERKFRPELYQHFQVETLADYEKGQLYGLEKFWAFLKYYKHAAALNVEPTLKAHLANFNTVEDFRVLEPQLNELLAAGGPGAGRPHARVYERHRSVSESERTKACRNTFSSRPAAATNRGRAGSCGAQGTVAAERRQRCSLAAVAARLEPWSGATSERRTPTHIH-