Monarch geneset OGS2.0

DPOGS202601
TranscriptDPOGS202601-TA1032 bp
ProteinDPOGS202601-PA343 aa
Genomic positionDPSCF300140 - 408407-410908
RNAseq coverage310x (Rank: top 36%)
Annotation
HeliconiusHMEL0024545e-12390.58% 
BombyxBGIBMGA006346-TA3e-16181.69% 
DrosophilaPrp18-PA3e-11560.29% 
EBI UniRef50UniRef50_B0WCY01e-12062.03%Pre-mRNA-splicing factor 18 n=1 Tax=Culex quinquefasciatus RepID=B0WCY0_CULQU
NCBI RefSeqXP_002424411.15e-12563.34%pre-mRNA-splicing factor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838550073e-12463.24%PREDICTED: pre-mRNA-splicing factor 18-like [Megachile rotundata]
NCBI nr blastxgi|3838550076e-12063.24%PREDICTED: pre-mRNA-splicing factor 18-like [Megachile rotundata]
Group
Gene OntologyGO:00056811.7e-66spliceosomal complex
GO:00083801.7e-66RNA splicing
KEGG pathwayphu:Phum_PHUM1286602e-124 
 K12817 (PRPF18, PRP18)maps-> Spliceosome
InterPro domain[157-336] IPR0040981.7e-66Prp18
[79-129] IPR0036483.2e-16Splicing factor motif
[84-113] IPR0149063.8e-10Pre-mRNA processing factor 4 (PRP4)-like
Orthology groupMCL13636 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202601-TA
ATGGACATTCTTAGGGCTGAAATAGCTAAAAAAAGAAAGCAGTTAGAAGAGAAAGATATTTTGAGACCGGCAAAAAAATATTTTAAAAGAAGTGAACTCTTAGCTAAAGAGCATGAAGAATATTTGAAAAAATATTGTCCAAGTATTACCAATAATGAAGAGGACAATGAGACAGAAAATGCTGAGAAAGCAAAAAAGCAAGAGAATGAAACTGATCAAACAGAAAGTATATCATTACCAAGAGCAGAAGTTGTCAAAAGGTTAAGGGATCGTGGACATCCTATAGTACTATTTGGTGAAAGCGAATTACAATCTTTTAAAAGATTGAGAAGAATAGAAATACAGGAACCAGAAGCTAACAGGGGTTTTAGGAATGATTTCCAAGAAGCTATGGAACAGGTAGATCAGGCCTATCTAGATGAAATATTGGCATTGGGTACTCAGAATGACAGTGAAAAACTTCAAAAAGAAGATGCCCTAGATGATTCAGTTACATATGAATACATTCAGGAAATGTCTGTAACTATGGGAAAAGGAGATAGAAACCATGATATGAATGTTATCATGACATTATTACAGTTTCTACTCAAGCTTTGGGGACAACAGTTAAATGCAGCCACTGGGGATCAGAAGACAGCAATTAAACACAAAATGACGAGGGCGACATATACACAAACGCAAGTTTATCTTAAACCTTTAATGAGAAAACTCAAGAAGAAGAATCTTCCTGAAGATATATGCGACAGTCTTACAGAAATCACAAAACATTTGCTGGATAGAAATTATATTATGGCGAGTGATGCTTATCTACAGATGGCTATTGGTAATGCACCATGGCCTATTGGTGTTACTATGGTTGGTATCCACGCCCGTACAGGTCGTGAAAAGATTTTCTCAAAAAATGTGGCTCATGTGATGAATGATGAAACACAGAGAAAATATATTCAGGCTTTGAAGAGATTAATGACCAAATGTCAAGAATACTTCCCAACGGATCCTTCTAGATGTGTAGAATATAGTACAAATAAATGA

Protein sequence:

>DPOGS202601-PA
MDILRAEIAKKRKQLEEKDILRPAKKYFKRSELLAKEHEEYLKKYCPSITNNEEDNETENAEKAKKQENETDQTESISLPRAEVVKRLRDRGHPIVLFGESELQSFKRLRRIEIQEPEANRGFRNDFQEAMEQVDQAYLDEILALGTQNDSEKLQKEDALDDSVTYEYIQEMSVTMGKGDRNHDMNVIMTLLQFLLKLWGQQLNAATGDQKTAIKHKMTRATYTQTQVYLKPLMRKLKKKNLPEDICDSLTEITKHLLDRNYIMASDAYLQMAIGNAPWPIGVTMVGIHARTGREKIFSKNVAHVMNDETQRKYIQALKRLMTKCQEYFPTDPSRCVEYSTNK-