Monarch geneset OGS2.0

DPOGS215370
TranscriptDPOGS215370-TA1842 bp
ProteinDPOGS215370-PA613 aa
Genomic positionDPSCF300088 - 1219547-1225440
RNAseq coverage323x (Rank: top 35%)
Annotation
HeliconiusHMEL0034540.079.38% 
BombyxBGIBMGA012387-TA0.072.12% 
DrosophilapUf68-PA8e-12472.03% 
EBI UniRef50UniRef50_Q17CW33e-13667.32%Fuse-binding protein-interacting repressor siahbp1 n=11 Tax=Neoptera RepID=Q17CW3_AEDAE
NCBI RefSeqXP_001121000.10.063.85%PREDICTED: similar to poly U binding factor 68kD CG12085-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3071950170.065.82%Poly(U)-binding-splicing factor half pint [Harpegnathos saltator]
NCBI nr blastxgi|3838610590.066.11%PREDICTED: poly(U)-binding-splicing factor half pint-like [Megachile rotundata]
Group
Gene OntologyGO:00036769.4e-23nucleic acid binding
GO:00001661e-22nucleotide binding
KEGG pathwayame:7251150.0 
 K12838 (PUF60)maps-> Spliceosome
InterPro domain[88-456] IPR0065322.9e-158Poly-U binding splicing factor, half-pint
[163-236] IPR0005049.4e-23RNA recognition motif domain
[133-235] IPR0126771e-22Nucleotide-binding, alpha-beta plait
[516-599] IPR0039544.9e-09RNA recognition motif domain, eukaryote
Orthology groupMCL12755 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215370-TA
ATGGAATTGTTGCAGGGCATCGAGATGAACGCAGGGATGGGGGTGCCGGCTTCTTTGGGCGTCGTCTGTCCGCCGAGTGCGGCGAGCTCGGTGGGCGCGGTGGGAGTGATGGGCGCGATGGGTGGCGCTTGTCCGACCGTGAGCGGAGACTTCCACGCGGCGCCTATCTATGACCTGTTGCAGGTCGGCGACGTGTTCACAGGTGAGGCAGAGAACGTTTCAAGTAAGTGTATGAGTCGGGCGGTGCTGAGGTGTTTTGTGTCCTCAGGGCCGGGCGCCAAGTGTTCTTCGCTGCCTGCCATCCTTGGCGGAAACATGCCGCGACTGTCCTCGGAACAAGCGGACGCCGTGGCGCGAGCCAAGAAGTACGCCATGGAGCAGAGCATCAAGATGGTGCTGATGAAGCAGACGCTGGCGCACCAACAGCAGCAGATGGCCTCGCAAAGGACGCAGGTGCAGCGGCAACAGGCGCTCGCGCTCATGTGCAGAGTGTACGTGGGGTCGATATCGTTCGAGCTCAAAGAGGACACGATCCGCCAAGCGTTCCTGCCGTTCGGGCCGATCAAGTCCATCAACATGTCGTGGGACCCAGTCACTCAGAAACACAAAGGGTTCGCCTTCGTCGAGTACGAGATACCGGAGGCGGCGCAGCTGAGTCTCGAGCAGATGAACGGAGTGATGCTGGGCGGGAGAAATATCAAAGTGGTAGGGAGACCTTCCAACATGCCGCAGGCCCAGGCCGTCATAGACGAGATACAGGAGGAAGCAAAGCAGTACAACAGAATATACGTCGCCTCCATACACCCGGAGCTGACGGAGGACGATATTAAGAACGTGTTCGAGGCGTTCGGTCCCATCACGTATTGCAAGCTGGCATACGGAGCGTCCGCGCACAAACACAAGGGCTACGGGTTCATCGAGTATGCGACTCTCCCGGCCGCGCTGGAGGCGATCGCCTCCATGAACCTGTTCGACCTCGGTGGCCAGTACCTGCGGGTGGGACGCGCCATCACTCCGCCCAATGCTCTCGCCGGCCCGCCGCAAGCCTCCGCCATGCCGACCGCGGCCGCCGTGGCCGCCGCCGCCGCCACCGCCAAGATACAGGCCATGGACGCCGTCGCCAGCAACGCCGTTGCGCTCGGACTGACCAAGCTCAACGCGCTCGGCGTTCCGCCCGCCGCCGCGCTGCCGACGCTCGCCGCCGCGCTGCCGGTGGCGCTGCCCGCCGCTCTGCCCGCCGCGCTCCCGGTCACTCTGCCGACCGCGCTTCCGGTCACTCTGCCGGTCACTCTGCCCGCCTCTCTGCCGGCCGCCCTGCCCCCGGCGCCGGTCATCCCGCCGCCGGGTGTGGTGATCCCGCCGCCTCCCCGTCCGCCCGCGGCCGAGCCCTCGGCGGACGGCGAGGGTGGCCAGCAGGCGGCGCTACAACGCAAGCTGCTGGACAGTTCGCCGGATACGCTCCAGCAGCAGGAGTCTTTGTCGATCTCGGGTCAGTCGGCGCGACACCTCGTCATGCAGAGACTGATGAGGCGCCGCGCGAGCAGGACCGTGCTGCTCGAGAACATGGTGGCGGCTCACGAGGTGGACGACGCGCTCCACCATGAGATACAGGAGGAGTGTTGCAAGTGGGGCCGGGTGGAGAGACTAGTCATATACAACGAGAGACAAAGCGAGGACGATGACCCTGCACATGCTGACGTTAAGATATTCGTCCAGTTCGCGGACCCCGAGGAGGCGGGAGCTGCGGCCGGGGCTCTATCCGGCCGATACTTCGGAGGTCGTACGGTGCGCGCTCGGCTCTACGACCAGGACCTGTTCGACCACGGGGACCTCTCGGGCTGA

Protein sequence:

>DPOGS215370-PA
MELLQGIEMNAGMGVPASLGVVCPPSAASSVGAVGVMGAMGGACPTVSGDFHAAPIYDLLQVGDVFTGEAENVSSKCMSRAVLRCFVSSGPGAKCSSLPAILGGNMPRLSSEQADAVARAKKYAMEQSIKMVLMKQTLAHQQQQMASQRTQVQRQQALALMCRVYVGSISFELKEDTIRQAFLPFGPIKSINMSWDPVTQKHKGFAFVEYEIPEAAQLSLEQMNGVMLGGRNIKVVGRPSNMPQAQAVIDEIQEEAKQYNRIYVASIHPELTEDDIKNVFEAFGPITYCKLAYGASAHKHKGYGFIEYATLPAALEAIASMNLFDLGGQYLRVGRAITPPNALAGPPQASAMPTAAAVAAAAATAKIQAMDAVASNAVALGLTKLNALGVPPAAALPTLAAALPVALPAALPAALPVTLPTALPVTLPVTLPASLPAALPPAPVIPPPGVVIPPPPRPPAAEPSADGEGGQQAALQRKLLDSSPDTLQQQESLSISGQSARHLVMQRLMRRRASRTVLLENMVAAHEVDDALHHEIQEECCKWGRVERLVIYNERQSEDDDPAHADVKIFVQFADPEEAGAAAGALSGRYFGGRTVRARLYDQDLFDHGDLSG-