Monarch geneset OGS2.0

DPOGS211838
TranscriptDPOGS211838-TA1155 bp
ProteinDPOGS211838-PA384 aa
Genomic positionDPSCF300031 + 994126-996715
RNAseq coverage1753x (Rank: top 7%)
Annotation
HeliconiusHMEL0164956e-14366.50% 
BombyxBGIBMGA006022-TA3e-12458.69% 
DrosophilaLa-PA1e-7045.35% 
EBI UniRef50UniRef50_Q264572e-7042.94%La protein homolog n=5 Tax=Culicidae RepID=LA_AEDAL
NCBI RefSeqXP_002066385.17e-7241.11%GK18125 [Drosophila willistoni]
NCBI nr blastpgi|3838488212e-7345.45%PREDICTED: la protein homolog [Megachile rotundata]
NCBI nr blastxgi|1954368842e-7940.99%GK18125 [Drosophila willistoni]
Group
Gene OntologyGO:00063967.3e-36RNA processing
GO:00056347.3e-36nucleus
GO:00037237.3e-36RNA binding
GO:00305297.3e-36ribonucleoprotein complex
GO:00001661.6e-24nucleotide binding
GO:00036769.1e-10nucleic acid binding
KEGG pathwaydwi:Dwil_GK181252e-71 
 K11090 (LA, SSB)maps-> Systemic lupus erythematosus
InterPro domain[38-119] IPR0066306.4e-40RNA-binding protein Lupus La
[47-64] IPR0023447.3e-36Lupus La protein
[34-119] IPR0119916.2e-26Winged helix-turn-helix transcription repressor DNA-binding
[243-358] IPR0126771.6e-24Nucleotide-binding, alpha-beta plait
[257-358] IPR0148866e-20RNA-binding motif
[139-215] IPR0005049.1e-10RNA recognition motif domain
Orthology groupMCL13603 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211838-TA
ATGACTGAAGAAAAAGAGGTACAAGCTGAAAATAACGAGAGTCAAGATAAAGAAAATAAAAATGAGGATGAACAAAACAAAAATGCAGAAGTTGACAGTATAGAAGAAAAATCTGAATTGGACAGTAGTATAATTCGTCAAATCGAATACTATTTTGGTGACCTTAACCTACCAAGAGACAAATTTCTCAGAGAGCAAGTCAAGCTTGATGATGGCTGGGTGCCACTGGAAGTGTTGACCAGATTTAACAGACTTGCAAAACTGACGACTGACATCGGAGTTATTGCAAATGCTATCAGCAAGTCTACATCAGGTCTTCTAGAGATATCTGATGACAATCTAAAAGTCAGACGGAATCCTGAATTGCCGATACCTGAGATGAATGAAGAAAGACGTAAGGAGTTAGTTTCTAGAACAATCTATGCTAAAGGATTCGGCAAAGACGCCTCCCTCGATGACATACTAAAATATTTCAAGCAGTTTGAAGAAGTGGAAAACATTATTATGAGGAAATATCAAGATAGGAAGACAAAGACATTCATTTTTAAGGGTTCAGTATTCGCTACCTTCAAGACAAAAGATCAGGCAGATAAATTTATGGAAACAAAAGATTATAAATTTAATGATACCGACCTACTGGTTATGTGGCAAGACGCATACGTAGAGAAAAAGAGGGAGGAATATGCAAAATTGTCAGCAAACAAAAAGAATAAAAACAAAAATGGAGAATCCGAGCAAAAAGAGAAAAGTGAATTCAAACTGCCGACCGGAACAGTGCTTCACTTCAGCCAGGGTCATGACAAAATGACCAGAGAGGATGTCAAAGAAGTCCTCACTCCATTAGGTGGGGAGGTGGCTTTTATCAGTTTTAAGGTGGGAGATACAGAGGGTTGGGTCCGTCTGGCTAATGAGGGTGATGCTAAGAAAGTAGCCGAGAAGATTCCGGATGGTAAAATCAAGATCGGCGAATCTGAGGTCGTTTTCCGCGTATTAGAAGGGGAAGAAGAGAAGAATTACCTCGATAAGACTATAGAAGAAATGTCAAAGAGACGACAGAATATGAAAAAACAGAAGAAAGGTGGCAAACCACAGTACAGGAAAAGGAAACAAGATAATCACGACGATGCACCAAGAGCAAAAACAAGAGCAAGCTAA

Protein sequence:

>DPOGS211838-PA
MTEEKEVQAENNESQDKENKNEDEQNKNAEVDSIEEKSELDSSIIRQIEYYFGDLNLPRDKFLREQVKLDDGWVPLEVLTRFNRLAKLTTDIGVIANAISKSTSGLLEISDDNLKVRRNPELPIPEMNEERRKELVSRTIYAKGFGKDASLDDILKYFKQFEEVENIIMRKYQDRKTKTFIFKGSVFATFKTKDQADKFMETKDYKFNDTDLLVMWQDAYVEKKREEYAKLSANKKNKNKNGESEQKEKSEFKLPTGTVLHFSQGHDKMTREDVKEVLTPLGGEVAFISFKVGDTEGWVRLANEGDAKKVAEKIPDGKIKIGESEVVFRVLEGEEEKNYLDKTIEEMSKRRQNMKKQKKGGKPQYRKRKQDNHDDAPRAKTRAS-