Monarch geneset OGS2.0

DPOGS207375
TranscriptDPOGS207375-TA1044 bp
ProteinDPOGS207375-PA347 aa
Genomic positionDPSCF300267 - 32031-33269
RNAseq coverage122x (Rank: top 57%)
Annotation
HeliconiusHMEL0122367e-12263.02% 
Bombyx% 
DrosophilaCG31812-PB1e-1532.21% 
EBI UniRef50UniRef50_UPI00022C92C49e-4345.32%UPI00022C92C4 related cluster n=3 Tax=unknown RepID=UPI00022C92C4
NCBI RefSeqXP_001604779.11e-3640.10%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3320227683e-4229.53%tRNA-splicing endonuclease subunit Sen2 [Acromyrmex echinatior]
NCBI nr blastxgi|3320227683e-4129.70%tRNA-splicing endonuclease subunit Sen2 [Acromyrmex echinatior]
Group
Gene OntologyGO:00063881.1e-28tRNA splicing, via endonucleolytic cleavage and ligation
GO:00002131.1e-28tRNA-intron endonuclease activity
GO:00045183e-28nuclease activity
GO:00036763e-28nucleic acid binding
KEGG pathway 
InterPro domain[190-317] IPR0066761.1e-28tRNA-splicing endonuclease
[233-317] IPR0118563e-28Endonuclease TnsA, N-terminal/resolvase Hjc/tRNA endonuclease, C-terminal
[236-317] IPR0066771.1e-27tRNA intron endonuclease, catalytic domain-like
[190-226] IPR0066781.7e-06tRNA intron endonuclease, N-terminal
Orthology groupMCL15377 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207375-TA
ATGCAATCAAAAGATTGCTGTCAAATGAATGAGAAAAGTAGCACATCTATGGATGAGGAAAAGACACTTCGTTTTCCTTTGGTTGACTCTATGTGTATTCTGTTTACTGGTTATTTTAATGGTGTAGGTGTTGAGATTAGGTCTACTGATGCAATGGCACTCTTGCATCATATAGGTTGCTATGGTAAAGGAAATACTTCCAAATCTAGACCTAAAGTTAAGAAGGAAGGAAGTCCAATTATTATGAGAAAGAGACAATTTTTAAAAAGAAGTTATTGGTACAAGAGGTTTGGAAATGAAGAGAAAAATGAAGAGTCAGATTTTTTTTTTAAAGATGTTTATGATTTGATAAAAAAAATCAAACGGGACACAAAAAAGGGTGTTATTGACTTGGTTTCGAGTGATGATGATGGTGGTGATGGTATGTCACAATTTTTAAACAACCATTCACCATCTCACACTCCTCATGAACAAGACATTGTAGTTGTTGTGGCCAATAGTGACTCTGAGGATGATAATTATTTTGCAAATTTAAAACCTCAATGTTGCCTGAATAAAGTATCCCTGCAGGAAAAGCTTATGCTTACATTACAAGAAGCATTCTTCTTGGTATACGGACTTGGTTGTTTAAAAATTGTGAAGGAAGAAGACCAAGTATTGAACATAGAGGAATGCTGGTCACTCTTTTGTAATACAGACAAATACTTTGTCAGTAAATACATAGTTTACCATCACTTTAGATCTAAAGGGTATGTTGTGAAATCAGGAATTAAATTTGGTGGGGATTTCTTGTTGTATAAAGAGGGACCGGAAGTAAATCATGCTGATTATATTGTTGTTATCAAGACTGAAAATGATACATTTAACTGGATATCTTTGTTGGGTCATGTCAGAATGGCTACAACAACGGTGAAGGAAGTTATGATTGCTGAGGTGAAATCTGAAGGTGAAAATCTACGTCTACCACATGACTTGTGTAAATATAGTGTTCGGGAGTTGGTACTGTCAAGGAACTTACCAGTAATAAACAATGAAATAGACTAA

Protein sequence:

>DPOGS207375-PA
MQSKDCCQMNEKSSTSMDEEKTLRFPLVDSMCILFTGYFNGVGVEIRSTDAMALLHHIGCYGKGNTSKSRPKVKKEGSPIIMRKRQFLKRSYWYKRFGNEEKNEESDFFFKDVYDLIKKIKRDTKKGVIDLVSSDDDGGDGMSQFLNNHSPSHTPHEQDIVVVVANSDSEDDNYFANLKPQCCLNKVSLQEKLMLTLQEAFFLVYGLGCLKIVKEEDQVLNIEECWSLFCNTDKYFVSKYIVYHHFRSKGYVVKSGIKFGGDFLLYKEGPEVNHADYIVVIKTENDTFNWISLLGHVRMATTTVKEVMIAEVKSEGENLRLPHDLCKYSVRELVLSRNLPVINNEID-