Monarch geneset OGS2.0

DPOGS202283
TranscriptDPOGS202283-TA2901 bp
ProteinDPOGS202283-PA966 aa
Genomic positionDPSCF300032 - 266610-273659
RNAseq coverage1171x (Rank: top 11%)
Annotation
HeliconiusHMEL0047300.072.17% 
BombyxBGIBMGA004939-TA0.070.79% 
DrosophilaCG1646-PB1e-9954.55% 
EBI UniRef50UniRef50_E0VIE70.046.45%Pre-mRNA-splicing factor clf-1, putative n=3 Tax=Neoptera RepID=E0VIE7_PEDHC
NCBI RefSeqXP_970329.10.053.60%PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog (yeast) [Tribolium castaneum]
NCBI nr blastpgi|910861670.053.60%PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog (yeast) [Tribolium castaneum]
NCBI nr blastxgi|910861670.047.17%PREDICTED: similar to PRP39 pre-mRNA processing factor 39 homolog (yeast) [Tribolium castaneum]
Group
Gene OntologyGO:00054888.7e-11binding
GO:00063962.9e-05RNA processing
GO:00056222.9e-05intracellular
GO:00063973.7e-05mRNA processing
GO:00056343.7e-05nucleus
KEGG pathway 
InterPro domain[693-874] IPR0119908.7e-11Tetratricopeptide-like helical
Orthology groupMCL12857 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202283-TA
ATGGATGATGAAATGAGTTCAGATTTACAGAAATGTTTATCAGAGGGTGCTATGGACACTGATGAGGGTCCATTAATAAATGAAAATTCAAATGGTTTGTCCCATTTATTAACCGACGGAGAGTTTAACAGTGTTGACGCTGCGGTTGTTGACGAGTCGTCTACCTCGAATCAGGCTTTTTTAAATGCAGTTGAGTTATCGACTGGCAGCGTCGGTGATTATCTCGATAACAACACGGGCGGTTTTAACACTGATTCATTCGATGTCGGCGACAACGCTGTCAACTTCCCCAGCGACTCTTTAAACAATGTACTGTCAGACGGCGATTCCCGGTTGAGTGATGCTTTCAAAACGACGCCAAATGATATAGATCTTACAGAGAATCAGAAGACGGTGTCTCAATTAGAGCTTGATATGGAGTTGACGCAGGATACGTCTAACGAAACTAGAAGCAATCAGGATTTTAACATGTCCTCTTTTAACGAGAGAAATGATACGTTACCAAGTGACCAAGACAACACCAACGACACGCTTGTCGAACAGTCAGTAAAGAAATCTAGAAAGAAAAGACGTTCATCAAAAGATGAAGATGATAACAAGAAAAACCAGAACTCCAATGATAGGAATAGATTAAGCGAAAGGAAACAGGATTCGGTATCAGATAATACTCAAGGAAACAGCCAGGATAGTTCATATGGAAGATCGAAAGAAAAAGTGAAGGTTGGAAGTGCTGAAGAGACCGAGGTAGTTTCCGAAGATGAACTGCCGGTTATTCAGAAACCGAGTGTAAAAGATGCAGAGAATGTGTCCGATGATGAATTGCCGGGACCTAAACCAGCCGAACTACCGGCAGATACCGAGGTTGTATCCGAGGACGAACTGCCGACATCAAAGAAGGATGGAAAAGAGTCTCGGAAACGAAAAACTGAGGAAGGTGACGGTTATGACCCTGGTTCACCAACATCTGAATCGGAGTCGGCTAACAAAAAACAAGCTGTTTCTAAGAACGGTGAAAGCAAACCAGTGTCGGCAGAAAAAAGATTCTCAGGGGATGAAAAGCCTAAGAAGAAAACTCTCCCCGACCTAGACAAGTACTGGAAGGTGGTTAACGATGATCCGACAGATTTCACAGGCTGGACGTACTTGTTGCAGTACGTCGACCAAGAGAGCGACGCCGAAGCGGCCCGCGAAGCCTACGACGCGTTCCTCTCCCACTACCCGTACTGCTACGGCTACTGGCGCAAGTACGCCGATTACGAGAAGCGTAAAGGAAGTAAAAAGAAATGCCTGGAGGTGTTGGAAAGGGGGCTGAAAGCCATACCACTGTCGGTTGACCTTTGGATACATTATCTCAACCACATTAAGACCACGAGGACTGAAGATCACACCTTCATTAGGTCGCAGTACGAGAGGGCTATCGAAGCATGCGGCCTGGAGTTTCGTTCCGACCGTCTCTGGGAGTCGTACATCAAGTGGGAGGCCGAGAACGGCTCGGCCCTCAACGTCACCAACATATATGACCGTCTGCTGGCAACACCGACACTTGGATACACCTCGCACTTTGACAATTTCCAGGAGCACGTGATGTCGGAGCCGGCTTGCGGAGCGGTTTCCGCTGAGGAGCTCGTTCGCCTCCGCGCTGAGGTGAGGGACTCCGCCCCCGCTCAGCCGCCGCCCGACCTGCCCCCCGGCGAGGACGTCGGGCGACTAGCTTCAGAAGACGAGGCTCAAGCCATCAAAGAGCGAATTATAGCAGCGCGAAGAAAAGTTCACAAGACGACAGGAGAAGAAGTAGCGGCCAGGTGGGCATTTGAAGAAGGGATAAAACGCCCATACTTCCACGTGAAGCCTCTCGAGAGATGTCAGCTGAAGAACTGGAAGGCGTACCTGGAGTGGGAGAAGCAGCACGGCTCCTTTAAACGAGCACTGGTGTTACACGAGCGCTGTCTCATAGCATGTGCTCTGTATGAAGAGTTCTGGATGAGGTTAATAAAGTTTCTGGAAGAACATTCAGCCTCGGACCCCTCAGTGATTCCCCTCCAGCGGGATGCTCTAGAGCGAGCGTGTACTGTACATCACCTGGACAAGCCCGAGCTGCACCTGCACTGGGCGCACTTCGAGGAGGCTAATGGGAACACGAGTCGTGCTGCTGAAATATTAGATAGGATCGAGAAGACCTGCCCCAACCTGGTGCAGATACAGTACAGGCGAATCAATCTTGAGAGGCGTCGCGGGGAGTACGATAAGTGCGTCCAGCTGTATGAAGGTTACATTTCATCAGCTAAAAACAAAGCTATAGCATCCGCGCTCGCTATTAAATACGCACGCTTCCTGTTTCACGTGAAGAGGGAACCGGAAGCCGCGAGGAAGGTGCTGGATGATGCGGTACTTAAGGATCCTCTCAACGCCAGACTACACATGCAGCGGTTGGATCTGGCCCTCCACACACCAGGCACCAAGTACGAGGAGTTGGAAGAATTGCTGATGAGCTACGAGAAGCAAGAGGGTGCGGAGATCGAGACGAGTACGGCGCTGGCGGTGCGGAGGAGGGAACTGGCCGAGGAGCTCGGAGACGCGGCCTCGGCCAGACAAGCACACACGCACGCACGAACACTCTACAAACACATGAGGAAGAGGGCGCGGGCGGCCAAACATGACACGCACCATCACACGGCTTGCGCGGACCCGTCAAAGAAGAAAGAGAACTGTGCAACCACCACCAGCACCACCACAGCCAGTAGCGCAAACCAATACTACCAGAACGCGGCGGCGACTGCGCAGTCATACGACCAATCGTATGCACAGCCCTACACGCCGCCGTGGGGCTACCAGCAAGCGGCAGGGCCTTACCCCCACCACCCCCACCCGCACCCCTGGCCGCAGTACCCCAACTACTATTAA

Protein sequence:

>DPOGS202283-PA
MDDEMSSDLQKCLSEGAMDTDEGPLINENSNGLSHLLTDGEFNSVDAAVVDESSTSNQAFLNAVELSTGSVGDYLDNNTGGFNTDSFDVGDNAVNFPSDSLNNVLSDGDSRLSDAFKTTPNDIDLTENQKTVSQLELDMELTQDTSNETRSNQDFNMSSFNERNDTLPSDQDNTNDTLVEQSVKKSRKKRRSSKDEDDNKKNQNSNDRNRLSERKQDSVSDNTQGNSQDSSYGRSKEKVKVGSAEETEVVSEDELPVIQKPSVKDAENVSDDELPGPKPAELPADTEVVSEDELPTSKKDGKESRKRKTEEGDGYDPGSPTSESESANKKQAVSKNGESKPVSAEKRFSGDEKPKKKTLPDLDKYWKVVNDDPTDFTGWTYLLQYVDQESDAEAAREAYDAFLSHYPYCYGYWRKYADYEKRKGSKKKCLEVLERGLKAIPLSVDLWIHYLNHIKTTRTEDHTFIRSQYERAIEACGLEFRSDRLWESYIKWEAENGSALNVTNIYDRLLATPTLGYTSHFDNFQEHVMSEPACGAVSAEELVRLRAEVRDSAPAQPPPDLPPGEDVGRLASEDEAQAIKERIIAARRKVHKTTGEEVAARWAFEEGIKRPYFHVKPLERCQLKNWKAYLEWEKQHGSFKRALVLHERCLIACALYEEFWMRLIKFLEEHSASDPSVIPLQRDALERACTVHHLDKPELHLHWAHFEEANGNTSRAAEILDRIEKTCPNLVQIQYRRINLERRRGEYDKCVQLYEGYISSAKNKAIASALAIKYARFLFHVKREPEAARKVLDDAVLKDPLNARLHMQRLDLALHTPGTKYEELEELLMSYEKQEGAEIETSTALAVRRRELAEELGDAASARQAHTHARTLYKHMRKRARAAKHDTHHHTACADPSKKKENCATTTSTTTASSANQYYQNAAATAQSYDQSYAQPYTPPWGYQQAAGPYPHHPHPHPWPQYPNYY-