Monarch geneset OGS2.0

DPOGS212613
TranscriptDPOGS212613-TA1503 bp
ProteinDPOGS212613-PA500 aa
Genomic positionDPSCF300245 + 122017-125003
RNAseq coverage160x (Rank: top 52%)
Annotation
HeliconiusHMEL0164951e-1432.43% 
BombyxBGIBMGA005210-TA8e-13558.12% 
DrosophilaCG42569-PA1e-5831.96% 
EBI UniRef50UniRef50_D6WH198e-6736.65%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WH19_TRICA
NCBI RefSeqXP_974730.22e-6736.65%PREDICTED: similar to multi-sex-combs [Tribolium castaneum]
NCBI nr blastpgi|1892353003e-6636.65%PREDICTED: similar to multi-sex-combs [Tribolium castaneum]
NCBI nr blastxgi|1892353004e-7637.65%PREDICTED: similar to multi-sex-combs [Tribolium castaneum]
Group
Gene OntologyGO:00001661.3e-13nucleotide binding
GO:00036761.3e-11nucleic acid binding
GO:00063967.2e-10RNA processing
GO:00056347.2e-10nucleus
GO:00037237.2e-10RNA binding
GO:00305297.2e-10ribonucleoprotein complex
KEGG pathwaynve:NEMVE_v1g2365788e-17 
 K11090 (LA, SSB)maps-> Systemic lupus erythematosus
InterPro domain[28-106] IPR0066303.6e-26RNA-binding protein Lupus La
[25-106] IPR0119919.1e-25Winged helix-turn-helix transcription repressor DNA-binding
[113-180] IPR0126771.3e-13Nucleotide-binding, alpha-beta plait
[119-178] IPR0005041.3e-11RNA recognition motif domain
[37-54] IPR0023447.2e-10Lupus La protein
Orthology groupMCL15790 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212613-TA
ATGTCTGAATCCGAAAGTGCCGAAGTTATCGAAGAAACAAAACTTGATAACAGCCCCCGAAAACGGGTTCGACATCGGAAAAAGCAGATATATGAGAATATTATGAAACAAATGGAATTCTACTTTAGCGATGCTAATCTCAGTAAAGATAGATTTTTAGGGGATTTGGTTAAAAACGATCCCTATGTACCGATAACAGAATTTTTGAAATTTAATAAGATTCGATCTATGACTCAGGATGTTGGTGATATTGTTAAGGCCATGAAGCACTCAACTTTTTTGGAATTATCAGAAGACAAAACCAAAGTCGCTCGGAAAACTCCCATGCTTCCATATGATGCAGACCAGAGGACCATCTATGTTGAGTCAATACCAGTTACAGCAAGTCTTAATTGGCTGGATAGAGTCTTCTCTGACTATGGACAAGTTGCATATATATCACTGCCAAAGTTTAAGAACTCACAGAAAAACAAGGGTTTTGCATTCATAGAATTTGCATCACCTCAAGACGCTCAAAACTGTATCAGTACATTTACCAAAATGGGATGTAAATTACCGTCATGTATGCCCCCGGAGGAATTATCTTCCATCAAAATGTTCAGTGCGGAACCAGCGGAGTTATCAGAGAACAACTATGAGCCTCCTAAAAAGAAATCTAAAAAGATTAAGGAGAAAAAACCGAAAGGCCTAGACCTAAAACAGGACGAGTCTGATAGTGAAAAAAATTTGGATACACCAACGGAGGAGTCTAAGGAAAATATCTCTACAGCATGTGAGGAAACTAAAATAACAGATGCAGAATCCAAAGACGATATAACAAGTCATGATGAAATGACCGATGAAGTTCCAAGGAAGAAAAAAGTTAAAAAACGCTCTTCCAAAGAAAAGGGTAAATCGAAGGCTCTCGCTAAAGGTGAAGTATCCAAGGGAGCTCTCTGGGGTCTGCAAGTGTTACCCAAATGTGAATGGAAAGCTCTTAGGAATAAATATCTCAATCTACAAAGGAAATACATGAAGGAAATTAAAAGCAATTGGCAGAATAGGAAGCACACCTACCCTGGAATACTCCCACCAGATCCAGCTATTGAAGTACCAGTAAACACCACACCGGACATAGAAAGCGGCAAGAATTTAGAGCCGTTACAAAAGATACCGGGAGTATTTGTTAAGGAGAATCTAAAGGAACCTTGTCTAGATGTGAGGTTACTGAAAAGAACTATCAAAACTAACATTCATGTTCTACATGTCGAGGCGAAAGAGGGACAGTCTGAAGTCATAATACGCTTTGATTCTTCCAAAGCGGCCGATGAGTATTGTACTAGCAATACACACGCGTTAGTACTCCGCGGTTCCGACGAGCAAGAGCAATGGCGGCGCGCGGAATCAGCACGCGCGTGTAAAAGGACACGAGGCCGCACACGCGTGCTGGCTCGCACCGCGCCGCACACACTCGCACCCGCCTCGCCGCAACCCACACACACACATATTCGATTTGATGAATAA

Protein sequence:

>DPOGS212613-PA
MSESESAEVIEETKLDNSPRKRVRHRKKQIYENIMKQMEFYFSDANLSKDRFLGDLVKNDPYVPITEFLKFNKIRSMTQDVGDIVKAMKHSTFLELSEDKTKVARKTPMLPYDADQRTIYVESIPVTASLNWLDRVFSDYGQVAYISLPKFKNSQKNKGFAFIEFASPQDAQNCISTFTKMGCKLPSCMPPEELSSIKMFSAEPAELSENNYEPPKKKSKKIKEKKPKGLDLKQDESDSEKNLDTPTEESKENISTACEETKITDAESKDDITSHDEMTDEVPRKKKVKKRSSKEKGKSKALAKGEVSKGALWGLQVLPKCEWKALRNKYLNLQRKYMKEIKSNWQNRKHTYPGILPPDPAIEVPVNTTPDIESGKNLEPLQKIPGVFVKENLKEPCLDVRLLKRTIKTNIHVLHVEAKEGQSEVIIRFDSSKAADEYCTSNTHALVLRGSDEQEQWRRAESARACKRTRGRTRVLARTAPHTLAPASPQPTHTHIRFDE-