Monarch geneset OGS2.0

DPOGS206260
TranscriptDPOGS206260-TA1290 bp
ProteinDPOGS206260-PA429 aa
Genomic positionDPSCF300290 - 413171-416334
RNAseq coverage2013x (Rank: top 6%)
Annotation
HeliconiusHMEL0131260.086.74% 
BombyxBGIBMGA010739-TA0.074.50% 
DrosophilaNop56-PA0.070.60% 
EBI UniRef50UniRef50_Q95WY30.070.60%FI04781p n=90 Tax=Eukaryota RepID=Q95WY3_DROME
NCBI RefSeqXP_001603746.10.072.20%PREDICTED: similar to nucleolar KKE/D repeat protein; DmNOP56 [Nasonia vitripennis]
NCBI nr blastpgi|3504250810.072.45%PREDICTED: nucleolar protein 56-like [Bombus impatiens]
NCBI nr blastxgi|3504250814e-17971.15%PREDICTED: nucleolar protein 56-like [Bombus impatiens]
Group
KEGG pathwaynve:NEMVE_v1g1931354e-25 
 K12844 (PRPF31)maps-> Spliceosome
InterPro domain[236-382] IPR0026872.6e-60Pre-mRNA processing ribonucleoprotein, snoRNA-binding domain
[140-192] IPR0129761.8e-28NOSIC
[4-69] IPR0129742.5e-20NOP5, N-terminal
Orthology groupMCL13982 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206260-TA
ATGAGCAAGTTATACGTGCTCTATGAGCACAGCGCAGGCTTTGCGCTCTTCCGAGTAACAGAGTTTGAAGAGCTGGCAGCCTTTCTGCCGCAAGTGGAGGAATCCGTTACAGACTTGCAAAAATTCAATTCCGTTGTTACTTTGGTAGCTTTCCAGCCCTTCAAATCAGCAGTGCTGGCTTTGGAGAATATTAATGCCGTTTCTGAAGACCCCAAGCTTGGGGCTGCAATTAGTGAGGCTCTGGAGATACCTTGCTCTCATACCGGCGCTGTTCCAGAAGTTATCCGAGGTATAAGACATCACTTCCATTCGCTCATCAAAGGTCTTACAGCGAAGGCGTGTAGTGTGGCCCAGCTGGGTCTCGGTCACTCTTATTCGAGAGCTCGAGTAAAGTTCAATGTTCACCGAGTTGATAACATGATCATACAATCTATAGCCCTGCTCGATCAACTCGACAAGGATGTTAACACGTTTTCTATGAGGATCAGAGAATGGTATTCCTACCACTTCCCGGAGCTTGTGAATATTGTGCCTGAAAATTATCTTTACTCGAAATGTGCGGAATTCATCAAAGACAGGAAGACATTGACGGATGAGTCTGTGGAACCGCTCACAGATATCCTCGGTGACTCTGAAAAGGCCCAGGCTATAATTGATGCATCAAAAATGTCAATGGGGATGGATATATCACCCGTCGACTTGATCAATATTCAGATGTTTGCTAGCAGAGTCGTTGCATTGAGTAATTACAGGAAGCAAATCGCTGAATATCTTCATACAAAAATGGCGTCAGTGGCACCAAATCTGACGACACTAGTTGGTGATCAGGTTGGGGCCCGGCTTATATCCAAAGCTGGTTCCCTTACCAGTTTGGCCAAATATCCTGCTTCTACATTACAAATTTTAGGTGCCGAGAAAGCTTTGTTCCGAGCTTTGAAAACACGTTCGGCGACACCGAAGTATGGTTTGCTGTATCATTCGAGTTTCATTGGTAGGGCTGGAGTGAAGAACAAAGGTCGCATCAGTCGCTACCTAGCTAACAAGTGCTCTATTGCCAGCAGGATCGACTGTTTCTCGGAAAACTTATCAAGCGTTTTTGGTGAGAAATTAAGACAGCAGGTGGAAGACCGCTTGAAGTTCTATGAGACGGGTGACATTCCTATGAAGAATATTGATGTCATGAAACAGGCTATAGAGGAAATAACACAACAAGAGGATGCGTCAGCTAAGAAGAAGAAGAAGAAAAAGAAGAAGGAACAGAATAACGAGGTGCCTATGGATGAAGATTGA

Protein sequence:

>DPOGS206260-PA
MSKLYVLYEHSAGFALFRVTEFEELAAFLPQVEESVTDLQKFNSVVTLVAFQPFKSAVLALENINAVSEDPKLGAAISEALEIPCSHTGAVPEVIRGIRHHFHSLIKGLTAKACSVAQLGLGHSYSRARVKFNVHRVDNMIIQSIALLDQLDKDVNTFSMRIREWYSYHFPELVNIVPENYLYSKCAEFIKDRKTLTDESVEPLTDILGDSEKAQAIIDASKMSMGMDISPVDLINIQMFASRVVALSNYRKQIAEYLHTKMASVAPNLTTLVGDQVGARLISKAGSLTSLAKYPASTLQILGAEKALFRALKTRSATPKYGLLYHSSFIGRAGVKNKGRISRYLANKCSIASRIDCFSENLSSVFGEKLRQQVEDRLKFYETGDIPMKNIDVMKQAIEEITQQEDASAKKKKKKKKKEQNNEVPMDED-