Monarch geneset OGS2.0

DPOGS201642
TranscriptDPOGS201642-TA1650 bp
ProteinDPOGS201642-PA549 aa
Genomic positionDPSCF300254 - 100464-105565
RNAseq coverage2141x (Rank: top 6%)
Annotation
HeliconiusHMEL0156650.083.75% 
BombyxBGIBMGA008202-TA0.086.09% 
DrosophilaBx42-PA0.065.47% 
EBI UniRef50UniRef50_Q135739e-17259.25%SNW domain-containing protein 1 n=128 Tax=Metazoa RepID=SNW1_HUMAN
NCBI RefSeqXP_623623.10.068.28%PREDICTED: similar to Bx42 CG8264-PA [Apis mellifera]
NCBI nr blastpgi|3838560580.069.22%PREDICTED: puff-specific protein Bx42-like [Megachile rotundata]
NCBI nr blastxgi|3838560580.067.51%PREDICTED: puff-specific protein Bx42-like [Megachile rotundata]
Group
Gene OntologyGO:00056814.8e-62spliceosomal complex
GO:00003984.8e-62nuclear mRNA splicing, via spliceosome
KEGG pathwayame:5512250.0 
 K06063 (SNW1, SKIIP, SKIP)maps-> Spliceosome
    Notch signaling pathway
InterPro domain[2-549] IPR0178620SKI-interacting protein, SKIP
[174-334] IPR0040154.8e-62SKI-interacting protein SKIP, SNW domain
Orthology groupMCL12200 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201642-TA
ATGGCGTCGTTGTCGAGCCTATTACCGGCTCCCGTGCAGCCAGTATGGGATCGAGATGACGAATTAAAGGCAAAGCGTGTTGGTAGTGCCCTTGTTGTTTCGCAGACCAGTGTTCCACCTTACGGTCAAAGAAGAGGATGGGTCCCACGCACGGAAGAAGATTTTGGAGATGGTGGTGCGTTTCCCGAGATACACGTAGCTCAATATCCATTGGGTATGGGAGCCCGTGGCAAGGAGAGCACATCGAATGCTTTAGCTGTCCAATTAGATGAGTCAGGAAGGGTGAAATACTCGGCTATAGCTCGTCAAGGCCATGGAGCAGATAAGATAATCTACTCAAAGCTAACAGATCTTCTGCCATCGGAGGTGCTGTCAGAAGACGATCCAAGTCTCCAGAAACCTTCAGAGGAGGATATACAGGAAATAACGGAGAAAACAAAATTAGCTCTTGAGAAACTGACCAATGCAAAGATATCAGCCGCTATGCCGGTTAAAGCTGCTCCCAAGGCTGCTCCCGCCCAATACATCAGATACACTCCCGCCCAACAGGGTGGACAGTTTAATTCGGGCGCCAAACAGAGAGTTATCAGAATGGTTGAAGCCCAATCAGATCCTTTGGAGCCGCCTCGCTTTCAAATTAACCGTAAGATACCCCGCGCGGCGCCTTCGCCCCCGGCCCCCGTGCTCCACTCGCCACCACGGAGAGTCTCCGTCAAACAACAGAGAGATTGGAAAGTCCCACCCTGTGTGTCACACTGGAAGAACGCCAAGGGTTACACAATACCGTTAGACAAACGTCTAGCGGCTGACGGCCGTGGTCTCCAACAGGTTCACATCAACGAGAACTTCTCCAAGTTGGCGGAGGCTTTGTACATAGCTGATAGGAAAGCTAGAGAGGCTGTGGAGGCGAGAGCACAGCTGGAAAGGAGATTGGCTCAGAGGGAGAAGGAGAAGAAGGAAGAACATTTGAGGATGCTGGCGCAGAGAGCAAGAGATCATAGAGCAGGTATAAGGAATCCGGAAGATGAAGCAGAAGAGGGCTTAGACGCTGCCCCAGAGGGAGAGCTCTCTGTGGCGGAGAGAGATAAGTTGAGAGCGGAGAGACACAGAGATAGGCAGAGAGATAGGAATCTGGCTCGAGCTGCACCAGACAAGAGGTCCAAACTGGTGAAGGAGAGAGAACGCGACATATCCGAGCAAATAGCTCTGGGACTCCCGGCCAAGAACAACACGGGGGATGCCATGTTCGATCAGAGATTGTTCAACAACAGCAAAGGAATGGACAGCGGTTACGGTGATGACGAGGCCTATACGGTGTACGATAAGCCGTGGAGAAATCAGGACGGCATTGGATCACATATATACAGACCCTCGAGGAACGCTGATAAGGATAACTACGGAGACGTTGATAGTCTAGCGGCTAACAAACGTTTCGTTGCTGACAAGACATTTGCTGGGAGTAGCGGTGGAGCGCCTCGTTCAGGACCCGTCAACTTTGAGAAGGATACCAGAGAGGAACCGAGTCGAGGCCAGCCCGAAGCTGATCCTGATCCGTTTGGTTTGGATCGGTTCTTGAGTGAAGCCAAACGAGCTGATAAGGCGAGGAAGAGAGATCACCACGAGCCACACGCCAAGAGGAGGAGGGATTAG

Protein sequence:

>DPOGS201642-PA
MASLSSLLPAPVQPVWDRDDELKAKRVGSALVVSQTSVPPYGQRRGWVPRTEEDFGDGGAFPEIHVAQYPLGMGARGKESTSNALAVQLDESGRVKYSAIARQGHGADKIIYSKLTDLLPSEVLSEDDPSLQKPSEEDIQEITEKTKLALEKLTNAKISAAMPVKAAPKAAPAQYIRYTPAQQGGQFNSGAKQRVIRMVEAQSDPLEPPRFQINRKIPRAAPSPPAPVLHSPPRRVSVKQQRDWKVPPCVSHWKNAKGYTIPLDKRLAADGRGLQQVHINENFSKLAEALYIADRKAREAVEARAQLERRLAQREKEKKEEHLRMLAQRARDHRAGIRNPEDEAEEGLDAAPEGELSVAERDKLRAERHRDRQRDRNLARAAPDKRSKLVKERERDISEQIALGLPAKNNTGDAMFDQRLFNNSKGMDSGYGDDEAYTVYDKPWRNQDGIGSHIYRPSRNADKDNYGDVDSLAANKRFVADKTFAGSSGGAPRSGPVNFEKDTREEPSRGQPEADPDPFGLDRFLSEAKRADKARKRDHHEPHAKRRRD-