Monarch geneset OGS2.0

DPOGS201947
TranscriptDPOGS201947-TA2181 bp
ProteinDPOGS201947-PA726 aa
Genomic positionDPSCF300244 - 219393-223840
RNAseq coverage57x (Rank: top 69%)
Annotation
HeliconiusHMEL0038240.076.68% 
BombyxBGIBMGA006580-TA0.068.55% 
DrosophilaCG31156-PA1e-13337.00% 
EBI UniRef50UniRef50_D6X0H82e-13939.51%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X0H8_TRICA
NCBI RefSeqXP_971655.13e-14039.51%PREDICTED: similar to S1 RNA binding domain protein, putative [Tribolium castaneum]
NCBI nr blastpgi|3071793487e-14139.26%S1 RNA-binding domain-containing protein 1 [Camponotus floridanus]
NCBI nr blastxgi|910911761e-13739.29%PREDICTED: similar to S1 RNA binding domain protein, putative [Tribolium castaneum]
Group
Gene OntologyGO:00061393.1e-35nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
GO:00167883.1e-35hydrolase activity, acting on ester bonds
GO:00037236.9e-09RNA binding
KEGG pathway 
InterPro domain[453-500] IPR0233238.3e-51Tex-like domain
[21-202] IPR0189742.2e-37Tex-like protein, N-terminal
[330-452] IPR0066413.1e-35YqgF/RNase H-like domain
[501-632] IPR0230971.9e-33Tex RuvX-like domain
[13-115] IPR0233199.7e-22Tex-like protein, HTH domain
[656-724] IPR0160271.3e-12Nucleic acid-binding, OB-fold-like
[653-720] IPR0123407.3e-11Nucleic acid-binding, OB-fold
[659-722] IPR0229672e-10RNA-binding domain, S1
[658-719] IPR0030296.9e-09Ribosomal protein S1, RNA-binding domain
Orthology groupMCL16471 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201947-TA
ATGCAAAGTGACCAGTATGATGTTATATTTGACCAAGCTAAAATGTTATCATGTTCTGAAAAAATATCTGTAAGTGTAGCTCAAAACTTTATTAATCTATTATCAGAAGGATGTACATTACCATTTATTGCAAGGTATAGGAAAGATGCCGTTGATCACCTTATGCCTGACAGACTTCAAGAACTCTATGAAAGCTATCAACATATAATACAACTTAAAAAAAAGGTTAAATCTGTGTTAGAAACTCTAAAGAAATCAAATAAATTGACCCCAGAGATTGAACAAAGCCTTTTAAGTGCGAGGAATTTATCAGAAGTTGATTTGGTGTATGGTCCTTTAAAATCACATTCTCAATCATTAGCTGAAAGAGCCAGAAGTTTGGGTCTTGAACCTCATGCATTGAACGCTCTTAATGGTGATTATGTCGAAATCAAATCTCTTTGCGATGGAAGTGAAGAGTTGGCTAATTTTGAAAAAGTTGAAGCCCATGTGACTCATATTGTAGCCGATATAATATATAAAGATACAAGAGTGATTGAACAAATGAGGAATTTGAAAGAAGAAACAAGGTTTACATTACAAAGCAGTAGAGTGAAATCCACAAAGACAAAACAGGAGACGGTTATGAAATATGACACCAAATCTGATCCCCAGACTTATAAATTGTATTTCGACTGGAAGTGTCCTATTCAATTTGTAAAATGCTATCAGACATTGGCTGTGAATAGGGGGGAAGATGAAAAGATACTTTCTGTGAAAGTTATTATTCCTGATTGGTTCTACAATAAACTTGAACGGTTCTGTCTCACTTTATGGAAAAGCAATTACTGGGTTCATAAAGGTCTCGGTGATGCTTACAATCGTCTGATAAAACCCTGGCTCTCAAGGAAAGTCAGATCAGATTTGACAAGTTTGGCCGAAAAGGAAGCTGTTAAGACATTTAGCACAAATTTAGAGAAATATTTATTGACTGAACCCATAAAGAATAAAACTATAGCTGGATTGGACCCTGGTTTCAAAGCAGGATGCAAGGTTGGTATAATAGATGCTACTGGAACTATGTTGGAAGCATGCAACATATACCCAAATTTTAATTGCAACAATAATGATCCAGCGGCCAGACAACTAAGTGGTCTCTTATCTAAACATAGTGTAGATCTCATTGGTCTTGGTAATGGAACAGCGTGTAGAGAAACTGAATCCTGGTTAAAGAGACACAAAATATCAGAACACATTCCTGTCATCATAGTTCCGGAGCAAGGCGCTTCTATATATTCAATTAGTAAGGAAGCTCAGAAAGAACATCCAAATATGGATCCAAATTTGATATCGGCTTTGTCATTAGCTAGAAGAGTGTTGGATCCGTTAGGGGAATTGATAAAGGTGGAACCGAAGAATTTAGGTGTTGGTTTATACCAACATGATATTCCACCTAAATTGTTGGAATCAGCCTTAGATATGACGGTGGAGAAAGTTGTGAGTTTGGTTGGAGTTGATATCAATACTGCATCACAGGCCATGTTGAGGCGTATTGCTGGTTTAAATGACGGCCGTGCGAAAAAAATAATAGCGTATAGACAAGAAAATGAAAGGTTTAAAACTCGTGCTGAGTTATTAAAAGTTCCTGGTATAGGAAAAGTTACGTACCAACAATGTGCTGGGTTTTTGAAAGTTTTGGGAGGTCTAGAGCCGTTGGACACGACTATTATACATCCTGAGAGCTATTCCGTTGCTAAAACATTTGCAAAAAAAATCGGCGTAAACGTCAAAGACTTAACCGACGCCCGATTTCCTGAAGATGTAGAAAGGAAATCCAGAAGTATAGATATATCGGCTATGAGTAAAGAACTTGACACCGATATAAGTAATTTAGAGTTGATTATAAATGCGTTCAAACTGAAGGCCTATGAAGACAATGTGATTACGTTCTGTAGACCGGTGTACTCTATGGTAGTTCAAGCAAGCGATCAATTGGAGAAAGGAATGTCTTTGACAGGTGTAGTCCGCAATGTGGTGCCGTTCGGTTGTTTCGTGGATTGCGGTGTTGGTGACAACGGACTCATACACACCAGCAACATGGCGAACGCTAACCTCAAGCTGGGAGACAGGGTCGCCGTTACGGTCATATCAACACCGAAACCAAAGAAAATACAACTCAAACTAGACAGAATATTGGACTAG

Protein sequence:

>DPOGS201947-PA
MQSDQYDVIFDQAKMLSCSEKISVSVAQNFINLLSEGCTLPFIARYRKDAVDHLMPDRLQELYESYQHIIQLKKKVKSVLETLKKSNKLTPEIEQSLLSARNLSEVDLVYGPLKSHSQSLAERARSLGLEPHALNALNGDYVEIKSLCDGSEELANFEKVEAHVTHIVADIIYKDTRVIEQMRNLKEETRFTLQSSRVKSTKTKQETVMKYDTKSDPQTYKLYFDWKCPIQFVKCYQTLAVNRGEDEKILSVKVIIPDWFYNKLERFCLTLWKSNYWVHKGLGDAYNRLIKPWLSRKVRSDLTSLAEKEAVKTFSTNLEKYLLTEPIKNKTIAGLDPGFKAGCKVGIIDATGTMLEACNIYPNFNCNNNDPAARQLSGLLSKHSVDLIGLGNGTACRETESWLKRHKISEHIPVIIVPEQGASIYSISKEAQKEHPNMDPNLISALSLARRVLDPLGELIKVEPKNLGVGLYQHDIPPKLLESALDMTVEKVVSLVGVDINTASQAMLRRIAGLNDGRAKKIIAYRQENERFKTRAELLKVPGIGKVTYQQCAGFLKVLGGLEPLDTTIIHPESYSVAKTFAKKIGVNVKDLTDARFPEDVERKSRSIDISAMSKELDTDISNLELIINAFKLKAYEDNVITFCRPVYSMVVQASDQLEKGMSLTGVVRNVVPFGCFVDCGVGDNGLIHTSNMANANLKLGDRVAVTVISTPKPKKIQLKLDRILD-