Monarch geneset OGS2.0

DPOGS203263
TranscriptDPOGS203263-TA2397 bp
ProteinDPOGS203263-PA798 aa
Genomic positionDPSCF300229 - 44499-51035
RNAseq coverage531x (Rank: top 24%)
Annotation
HeliconiusHMEL0153620.082.69% 
BombyxBGIBMGA000244-TA0.080.83% 
Drosophilascaf6-PA2e-9942.51% 
EBI UniRef50UniRef50_E0VUQ40.049.94%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VUQ4_PEDHC
NCBI RefSeqXP_002429848.10.049.94%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420187730.049.94%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|1892408410.052.53%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
Gene OntologyGO:00063964.7e-23RNA processing
GO:00037234.7e-23RNA binding
GO:00056224.2e-11intracellular
GO:00036764.2e-11nucleic acid binding
KEGG pathwayphu:Phum_PHUM4529100.0 
 K12841 (CHERP)maps-> Spliceosome
InterPro domain[13-65] IPR0000614.7e-23SWAP/Surp
[720-769] IPR0004674.2e-11D111/G-patch
[245-305] IPR0069034.4e-10Domain of unknown function DUF618
[180-329] IPR0089424.9e-06ENTH/VHS
Orthology groupMCL12398 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203263-TA
ATGGAGCTTCCTCAACCACCTCAGGATCAAGATCTACGCAACATAATTGACAAACTTGCTCAATTTGTCGCAAGAAATGGCCCAGAATTCGAAAAGATGACCAAGACAAAACAAAAAAACAATCCCAAATTCAGCTTTCTCTATGGTGGAGAATACTTCAACTACTATCAATATAAAGTCACCACCGAGCAAGCAATTCTAAAGCAATCTGCCGGTGGCCAGGCGGCCGCTCCCGCTGGGGCTTATATGGCACAGAGCAGGCAATATGTAATGCCCCAACAGGCCAGTACACCAGCGCAGCTCGGGCTGGCGGCGACGATAGCGGCGGCGCCGGCGCTGCAGCAGTGGCTCGCAGCCAACTCCGCGCCCTCGCAGCGACCCGACATCGACAGCGTGAATACTCAGATCAACATACTTAAGGAACAAATTACTCAGTCGGAAAACAATCTCAACGCCCAACATACGGTGTTGATACAGCAACAGCAAGCTAAGATTAATGAGCTGGTCAACAAGGCACAGATGGAAACAATTCAGATCATGGCAGAAGAAAACAACATCAGCTTATCGGAACTGGATAGCGTCTTGCAGCCCATCATTGACACATGCACTAAAGACAGTATATCTGCAGGCAAGGGGTGGATTTTACAACATGCAACTTCAAATGATGCCGGGAAAGTTATCTCACTCCATCTCCTGAGAAAGGTCACACAGTCTGGAGCGCTGTTCACACAGAAGCTGCACATCATATACCTTGTCAATGATGTTTTACATCATTGTGCACGCAAAAACGCGGAAGATCTCAAGAAAAATTTGGAAAATGTGGTGGTTCCAATGTTTTGCAATGCCAGTATAGCCGTCACAGAGGAACAGGAGGCTAAGTTGAATAAACTGCTGCGTCTTTGGGAATCTAAGTCTAACTACTTCGACAGCGCTGTCATTGAAAAAATGAAGAGTCCTACGAGTTCGTACCAGGAGTATCAGAACAACCTCATCGCCCAGCACTCTAACGCTATAGCGCATCTGGCGCAACAGACGAAGGCAACATTTGAGAACTACCAATCTCAACATCAAGCGTTCGTCAGTCACACAATGCAGCAGATACAACAGATGGAGATGCAGAAACACGCCTTGGAACTACAGAGCGGACACAACGATCACAAAGACGCTCAGCCCCAGAACAACATACCGCATCCATTCCAGAGCAACTTCAACGATCAATACAACAACCAGCAACACAACTACGACCAGAACTTCAATTCATCTCATCAGTACGGCAGCGAGAGCGGCGACAACTATCCATCAAACAACAGCGTCAGCGGCAACGAGAATAGTTACGACAGTCAGACATTCGACCAATTGACGAATAAGCCGGATATAGAAGAGCCGGATTTATCAAATCTGCCGAAAATCAATTTCTCTCAACCCCCGCCTGGATTCAAGCCTCAAGAGAACAGTACAATAATACCGTTTCAGAACGGTCTGCCAGATCTAAGCAAACCCCCGCCCGGCTTCCCGCCGTTCCCAGAAATTAACAACGAAGACCTGATGCCCTCCGTACCTTACTTTGAGTTACCAGCGGGGCTGATGGTACCACTCATTATGTTAGAAGACGACAGATACCGCCCCCTGGACCCGGCCGCCATCCGTCTACCGCCGCCCGCGCCGCCCAACGAGCGACTGCTGGCCGCCGTAGACGCTTTCTACGCTTCACCAAACCACGAACGACCTAGAGATAATGAAGGCTGGGAGAAATTAGGTTTATATGAATACTACAAAGCCAAAAACTCAAGTAGAAAGAAAAAAGAAGATGAAATCGCTCAAGGTTTACGGGAAAAGTCAAAATCACCAAGTCCCATACCTAAAGACCTACAGAAGCAGCCCACGCCGCCCGGGAGGAGATATAGGTCATTAAGTAGAACACCAGAAAAAACCCAGTCACCGCAGAGAAAGTCACGATCAAGGACGCCCTCGCCCAGAAGAAAGTCGTCATGGAGGCGGGAACGGGAACGGGAAAGTCGACGCCGTTCCCGGTCCCGGTCTCGTTCCCGGTCCCGAAGTCGATCACGAGAACGTGAAAGACATCGCGAGTCACCTCGCAATAGACGAGTAGAACGATCTAGATCACCCACACCGCCTAGCTTCCTTTCATCGAGCGGTTTGGACCCCAGCAACAAAGGTCACCAGCTGCTACAGAAGATGGGCTGGAGTGCGGGCGGGCTGGGAGCTGCTGGGCAGGGCATCGCTGAACCCATCAGTGGAGGAACAGTACGAGACAAACAGGATCAGTATAAGGGCGTAGGAGTAAATCTGAATGACCCATATGAAAACTTCAGAAAGAACAAGGGAGCAGCTTTTATAACGCGAATGAAAGAAAGAGCTTTGGAGCGATCGTCGTGA

Protein sequence:

>DPOGS203263-PA
MELPQPPQDQDLRNIIDKLAQFVARNGPEFEKMTKTKQKNNPKFSFLYGGEYFNYYQYKVTTEQAILKQSAGGQAAAPAGAYMAQSRQYVMPQQASTPAQLGLAATIAAAPALQQWLAANSAPSQRPDIDSVNTQINILKEQITQSENNLNAQHTVLIQQQQAKINELVNKAQMETIQIMAEENNISLSELDSVLQPIIDTCTKDSISAGKGWILQHATSNDAGKVISLHLLRKVTQSGALFTQKLHIIYLVNDVLHHCARKNAEDLKKNLENVVVPMFCNASIAVTEEQEAKLNKLLRLWESKSNYFDSAVIEKMKSPTSSYQEYQNNLIAQHSNAIAHLAQQTKATFENYQSQHQAFVSHTMQQIQQMEMQKHALELQSGHNDHKDAQPQNNIPHPFQSNFNDQYNNQQHNYDQNFNSSHQYGSESGDNYPSNNSVSGNENSYDSQTFDQLTNKPDIEEPDLSNLPKINFSQPPPGFKPQENSTIIPFQNGLPDLSKPPPGFPPFPEINNEDLMPSVPYFELPAGLMVPLIMLEDDRYRPLDPAAIRLPPPAPPNERLLAAVDAFYASPNHERPRDNEGWEKLGLYEYYKAKNSSRKKKEDEIAQGLREKSKSPSPIPKDLQKQPTPPGRRYRSLSRTPEKTQSPQRKSRSRTPSPRRKSSWRRERERESRRRSRSRSRSRSRSRSRERERHRESPRNRRVERSRSPTPPSFLSSSGLDPSNKGHQLLQKMGWSAGGLGAAGQGIAEPISGGTVRDKQDQYKGVGVNLNDPYENFRKNKGAAFITRMKERALERSS-